The article presents the results of a comprehensive linguistic and statistical analysis of Yevheniia Kuznetsova’s novel Drabyna (The Ladder) and its Polish translation. The study was conducted using the corpus manager AntConc to determine the statistical characteristics of the texts. This software tool was employed for corpus processing, namely for the compilation of frequency dictionaries. After generating the frequency lists, lemmatization and part-of-speech tagging were carried out in MS Excel. It was established that the volume of the original text and the translation is nearly identical; however, the original displays a higher degree of lexical richness, a greater number of hapax legomena, and lower lexical stereotypy compared to the translation. Yule’s characteristic was calculated to reflect vocabulary diversity, along with indices of exclusiveness, stereotypy, and lexical concentration, as well as text entropy, redundancy, and other quantitative parameters. Quantitative algorithms were also applied to determine the Automated Readability Index (ARI), which indicates the degree of textual complexity. It was found that the ARI score is higher in the translation (14 versus 12), suggesting that the translated text is somewhat more difficult to read. The morpho-statistical analysis confirms that both texts’ vocabularies are dominated by nouns and verbs, while function words prevail within the text structure. Despite their relatively low percentage in the vocabulary, function words play an active role in ensuring syntactic cohesion and textual connectivity. The preservation of psycholinguistic statistical parameters (such as average sentence length, verb density coefficient, and logical coherence index) indicates an adequate reproduction of the author’s style in the translation. The results provide valuable insights into the quantitative characteristics of literary texts and highlight the potential of parallel corpora for comparative stylistic, translational, and psycholinguistic studies, offering a robust methodological framework for further research on the interaction between lexical choice, syntactic patterns, and psycholinguistic parameters in literary translation.
- Buk, S. N. (2021). Corpus-lexicographic and linguistic-statistical measurements of Ivan Franko’s major prose works: dictionary and text. (Doctoral dissertation, Taras Shevchenko National University of Kyiv, Kyiv, Ukraine).
- Darchuk, N. P. (1976). Statistical characteristics of vocabulary as a reflection of text structure. Linguistic studies, 97-102.
- Demska–Kulchytska, O. (2005). Fundamentals of the national corpus of the Ukrainian language. Kyiv: Institute of Ukrainian language, National Academy of Sciences of Ukraine.
- Kamińska-Szmaj, I. (1988). Parts of speech in the dictionary and text of five functional styles of written Polish (based on frequency dictionary data). Bulletin of the Polish Linguistic Society, 127-136.
- Kamińska-Szmaj I. (1990). Lexical differences between functional styles of written Polish: statistical analysis based on frequency dictionary data. Wrocław: Wrocław University Press.
- Kapranov Ya. V. (2021). AntConc Corpus Manager and its capabilities for determining the frequency of keywords in different languages. In Knowledge engineering as a factor in intercultural cooperation between Ukraine, Japan, China, and South Korea: II International Scientific and Practical Conference (pp. 100-102). Kyiv: KNLU Publishing Center.
- Kotsuik L. M., & Prokip L. I. (2016). A corpus-based study of the terminology system of surgery. Scientific Bulletin of the Lesya Ukrainka Eastern European National University. Series: Philological studies, 6(331), 15-25.
- Kuznietsova, Ye. (2023). The Ladder. Lviv: The Old Lion Publishing House.
- Kuznietsowa, Ye. (2024). The Ladder (I. Boruszkowska, Trans.). Kraków: Znak Publishing House. (Original work published 2023)
- Nohovska, S. H. (2022). Peculiarities of English language toning drinks advertising texts reproduction in Ukrainian: corpus-based experimental research. Transcarpathian Philological Studies, 22(2), 182-189. https://doi.org/10.32782/tps2663-4880/2022.22.2.33
- Perebyinis, V. I. (2002). Statistical methods for linguists. Vinnytsia: Nova Knyha.
- TextAttributor. (2024). Methods. Mova.info. Retrieved October 3, 2025, from http://ta.mova.info/methods
- Zasiekin S. V. (2012). Psycholinguistic universals of literary translation. Lutsk: Lesya Ukrainka Volyn National University.