Controlled Synthesis and Hierarchical Structuring of Ukrainian Datasets
The article addresses the urgent scientific and practical problem of overcoming the "cold start" effect in the development and deployment of Natural Language Processing (NLP) systems aimed at monitoring public opinion and sentiment analysis of Ukrainian-language content. The critical shortage of representative, balanced, and pre-labeled datasets that accurately reflect the specifics of social, economic, and political processes in modern Ukrainian society is identified as the main barrier to integrating advanced neural network solutions.