METHODOLOGICAL PRINCIPLES FOR CONSTRUCTING LANGUAGE CORPORA BASED ON MEDIA DISCOURSE

loading.default
thumbnail.default.alt

item.page.date

item.page.journal-title

item.page.journal-issn

item.page.volume-title

item.page.publisher

Western European Studies

item.page.abstract

The digital revolution has transformed media into a primary source of linguistic data. However, the transient and heterogeneous nature of media texts, namely, spanning news reports, social media posts, and multimedia broadcasts, requires a structured methodological approach. This article explores the core principles of corpus design, focusing on representativeness, sampling, metadata enrichment, and ethical considerations. By adhering to these principles, researchers can create robust datasets capable of supporting diachronic and synchronic linguistic analysis.

item.page.description

item.page.citation

item.page.collections

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced