parsing

Parsing the text of terminology dictionaries

The article outlines a range of tasks, approaches and stages of developing parsing technology for text of a multilingual explanatory terminology dictionary. Research was conducted for the “Dictionary of Ukrainian Biological Terminology”. Among all the vocabulary diversity, this dictionary was chosen because terminology dictionaries provide a lexical-semantic basis for further creation of systems for the intelligent processing of professional texts, which provide information on specific subject areas.

The linguistic analysis method for a Ukrainian commercial content

The scientific and practical problem of automatic detection of meaningful keywords and Ukrainian content categorization in Internet systems on the basis of linguistic analysis of text information is unleashed. The article presents a theoretical and experimental substantiation of linguistic analysis methods for Ukrainian content using Porter stemming.

Метод формального визначення якості допису на спеціалізованих сайтах

Post quality assessing algorithm based on the set of chosen parameters is considered in the article. To solve the problem the following next instruments will be used: Java library called Jsoup for HTML-code parsing, and Matlab tools for building the decision tree for post quality assessing.