The National Corpus of the Russian Language as a Source Base for Social and Humanitarian Researches

Abstract: 
The title of the article directly reflects its purpose: to present the National Corpus of the Russian Language (NCRL, Corpus) as an information base for research in the socio-humanitarian profile. Currently, the Corpus has a volume of over two billion words usages and is an information and reference system based on a collection of Russian-language texts created from the beginning of the 18th century to 2010, representing the Russian language within the specified time limits. The NCRL reflects the whole variety of genres, styles, social and territorial variants of our language, covering fiction and scientific literature, essays, journalism, public speaking, etc. The texts of the second half of the 20th — early 21st centuries are presented in the Corpus most fully and diversely. The article highlights two main areas of modern scientific research using NCRS both as a source of empirical information and at the same time as a research tool. The first has a distinct linguistic orientation and is associated with turning to the Corpus when teaching Russian and foreign languages, in the analysis of dialects and sociolects, when immersed in the field of theory and practice of language translation. The corpus also makes it possible to track changes in the language norm and other language parameters over a certain period of time. The second direction is formed by studies of a clearly social profile, associated with the diagnosis of social consciousness and its transformations. With the help of the NCRS, its huge, in the thousands and tens of thousands of units of analysis, data arrays, equipped with the necessary statistics, the conclusions obtained in sociological surveys, discourse studies, associative experiments performed on fairly limited samples are reliably verified. Autonomous appeals to the Corpus are also of indisputable value, providing the most important information about the attitude of the bearers of Russian linguistic culture to various social institutions, processes, phenomena and subjects. The article provides examples of such studies associated with a number of key political concepts, such as officials, law enforcement agencies, corruption, politicians, the elite and compatriots.
edn: 
DGOEFK

For citation: Kornienko A.V. The National Corpus of the Russian Language as a Source Base for Social and Humanitarian Researches. St. Petersburg Sociology Today. 2023. N. 21. P. 58-71. DOI:10.25990/socinstras.pss-21.8sqp-bb68

References: 
  • Barash R. E. Kontseptsiya “sootechestvennikov” — smysl i znacheniye v kontekste rastushchego etnicheskogo natsionalizma v Rossii. Monitoring obshchestvennogo mneniya. 2011, no. 6 (106), pp. 15–24. (In Russ.)
  • Bol'shoy tolkovyy slovar' russkogo yazyka. Sost. i gl. red. S. A. Kuznetsov. St.-Petersburg, Norint, 1998, 1536 p. (In Russ.)
  • Zakharov V. P. Korpusnaya lingvistika: Uchebno-metodicheskoye posobiye. St.-Petersburg, Izdatel'stvo SPbGU, 2005, 48 p. (In Russ.)
  • Zolyan S. T. Sootechestvenniki — semanticheskiy portret po dannym Natsional'nogo korpusa russkogo yazyka. Politicheskaya lingvistika, 2021, no. 2 (86), pp. 166–180. (In Russ.)
  • Irgizova K. V. Korpusnaya lingvistika v otechestvennom i zarubezhnom yazykoznanii na sovremennom etape. [Elektronnyy resurs]. 2019. URL: https://cyberleninka.ru/article/n/korpusnaya-lingvistika-v-otechestvenno... (access date: 11.07.2023). (In Russ.)
  • Krysin L. P. Problema sotsial'noy differentsiatsii yazyka v sovremennoy lingvistike Voprosy sotsial'noy lingvistiki. Red. A. V. Desnitskaya, V. M. Zhirmunskiy. Leningrad, Nauka, 1969, pp. 73–91. (In Russ.)
  • Kul'tura russkoy rechi: Entsiklopedicheskiy slovar'- spravochnik. Pod red. L. Yu. Ivanova, A. P. Skovorodnikova, Ye. N. Shiryayeva i dr. Moscow, Flinta, Nauka, 2003, 840 p. (In Russ.)
  • Osnovnoy korpus NKRYA. [Elektronnyy resurs]. 2023. URL: https://ruscorpora.ru/corpus/main (access date: 19.07.2023). (In Russ.)
  • Parallel'nyy korpus NKRYA. [Elektronnyy resurs]. 2023. URL: https://ruscorpora.ru/corpus/para-eng (access date: 21.07.2023). (In Russ.)
  • Plungyan V. A. Korpus kak instrument i kak ideologiya: o nekotorykh urokakh sovremennoy korpusnoy lingvistiki. Russkiy yazyk v nauchnom osveshchenii. 2008, no. 2 (16), pp. 7–20. (In Russ.)
  • Plungyan V. A. Pochemu sovremennaya lingvistika dolzhna byt' lingvistikoy korpusov. [Elektronnyy resurs]. 2009. URL: http://www.polit.ru/lectures/2009/10/23/corpus.html (access date: 14.07.2023). (In Russ.)
  • Sostav i struktura NKRYA. [Elektronnyy resurs]. 2023. URL: https://ruscorpora.ru/page/corpora-structure/ (access date: 19.07.2023). (In Russ.)
  • Surikova T. I. Vlast' i obshchestvo: chto s nami proiskhodit [Elektronnyy resurs]. G. Ya. Solganik. Yazyk SMI i politika. Moscow, Izdatel'stvo MGU; Fakul'tet zhurnalistiki MGU, 2012. URL: https://lib.sale/jurnalistika-knigi/vlast-obschestvo-chto-nami-71590.html (access date: 11.07.2023). (In Russ.)
  • Chto takoye natsional'nyy korpus russkogo yazyka? [Elektronnyy resurs]. 2023. URL: https://ruscorpora.ru/media/uploads/2022/04/21/instruction-main.pdf (access date: 18.07.2023). (In Russ.)
  • Chudinov A. P., Segal N. A., Mishchenko A. N. Pravookhranitel'nyye organy v zerkale postsovetskogo mediaprostranstva. Politicheskaya lingvistika. 2023, no. 2 (98), pp. 104–108. (In Russ.)
  • Shveytser A. D., Nikol'skiy L. B. Vvedeniye v sotsiolingvistiku. Uchebnoye posobiye. Moscow, Vysshaya shkola, 1978, 216 p. (In Russ.)