Information Technology Intelligent Search of Content in E-commerce Systems

2023;
: pp. 142 - 166
1
Lviv Polytechnic National University, Information Systems and Networks Chair
2
Lviv Polytechnic National University, Information Systems and Networks Department; Osnabrück University, Institute of Computer Science
3
Osnabrück University, International Economic Policy Chair
4
Lviv Polytechnic National University, Mathematics Department

The article describes the process of developing intelligent search technology for content for the implementation of the module of e-commerce systems for forming a list of recommendations for regular users. Intelligent search of content is based on methods of linguistic analysis, modern algorithms for parsing and finding words, and recommendations based on user preferences. The main components of such a search are the parsing of text strings, the selection of keywords, the spelling check, the recognition of common abbreviations and acronyms, the semantic analysis of the text, the search by relevance with the extraction of synonyms, filters and sorting. A web application based on Java and Elasticsearch was developed with the implementation of a recommender system based on a collaborative filtering algorithm. The purpose of the work is to develop the technology of intelligent product search with the formation of a list of recommendations for the user. The object of the research is the processes of intelligent search with the possibility of generating recommendations for users in the field of any e-commerce without reference to the categorization of goods/services, etc. The subject of research is the methods and means of intelligent search of recommender systems based on the Collaborative Filtering algorithm for the formation of product recommendations for users, which is oriented on general coincidences of the choices of similar users. During the experimental testing of the developed system, a number of search queries were conducted with and without the NLP algorithm, the results of which demonstrated an improvement in system performance within the range of 15–95 % depending on the keyword and the presence/absence of errors in the search words. A comparison of the speed of execution of requests with already existing systems was also carried out. Yes, the amount of data in the storage may differ (error when comparing 60–70 ms). For example, a query that consists of 1 or 2 words will be found much faster by 20–70 ms compared to its counterparts. But for 3 and more, results are about the same – 9–20 ms faster.

  1. Eileen Pangu. The Search Backend (2020). URL: https://eileen-code4fun.medium.com/system-design- interview-mini-google-search-6fd319cd66ca.
  2. Traditional Database (Forward Indexes) vs Search Engines (Inverted Index) (2020). URL: https://dev.to/search?q=%28Forward%20Indexes%29%20vs%20Search%20Engines. https://dev.to/im_bhatman/introduction- to-inverted-indexes-l04
  3. Alex Franz, Thorsten Brants (2006). All Our N-gram are Belong to You, August.
  4. David Guthrie (2017). A Closer Look at Skip-gram. NLP Research group, Department of Computer Science.
  5. Levenshtein V. I. (1986). Binary codes with correction of dropouts, insertions and substitutions, 845–848.
  6. Navarro Gonzalo (2006). A guided tour to approximate string matching. Department of Computer Science, University of Chile, 31–88. DOI: 10.1145/375360.375365
  7. The Levenshtein Algorithm. Example (2021). URL: https://www.cuel.ogic.com/blog/the-levenshtein- algorithm.
  8. Levenshtein distance (2011). URL: https://habr.com/ru/post/114997/.
  9. Ghazanfar, Mustansar Ali; Prügel-Bennett, Adam; Szedmak, Sandor (2012). KernelMapping Recommender system algorithms. School of Electronics and Computer Science, 81–104. DOI: 10.1016/j.ins.2012.04.012
  10. Ricci F., Rokach L., Shapira B. (2011). Introduction to Recommender Systems Handbook, Recommender Systems Handbook, Springer. Faculty of Computer Science, 32–35. DOI: 10.1007/978-0-387-85820-3_1
  11. Linden, Gregory D., Brent Russell Smith, Nida K. Zada  (2017). Automated detection and exposure of behavior-based relationships between browsable items, 40–42.
  12. Terveen, Loren Hill (2011). Beyond Recommender Systems: Helping People Help Each Other items. University of Minnesota Twin Cities, February.
  13. Chaitanya         Belhekar.         Collaborative           Filtering          Recommender           System         (2020).         URL: https://medium.com/@chaitanyarb619/recommendationsystems-a-walk-trough-33587fecc195.
  14. Introducing intelligence in relevance search (2020). URL: https://powerapps.microsoft.com/.
  15. Robert           P.         Hanrahan.           The          IDEF           Family           of         Methods            (2020).           URL: http://www.sba.oakland.edu/faculty/mathieson/mis524/resources/readings/idef/idef.html.
  16. How search algorithms work. (2022). URL: https://www.google.com/intl/uk/search/howsearchworks/algorithms/.
  17. DFD (2020). URL: https://habr.com/ru/company/trinion/blog/340064.
  18. What is an Elasticsearch index? (2022). URL: https://www.elastic.co/what-is/elasticsearch.
  19. 5 reasons to choose mysql. (2017). URL: https://dataconomy.com/2017/04/5-reasons-challenges-mysql/.
  20. Java (2023). URL: https://techterms.com/definition/java.
  21. Why would you choose Java programming language over others? (2008). URL: https:// stackoverflow.com/questions/209555/why-would-you-choose-the-javaprogramming-language-over-others.
  22. A Collaborative Filtering Recommendation System in Java (2022). URL: https://www.baeldung.com/java- collaborative-filtering-recommendations.
  23. Shi, Yue; Larson, Martha; Hanjalic, Alan (2014). Collaborative filtering beyond the user-item matrix: A survey of the state of the art and future challenges. ACM Computing Surveys, 47. DOI: 10.1145/2556270
  24. Figueroa, Alejandro; Atkinson, John (2012). Contextual Language Models for Ranking Answers to Natural Language Definition Questions. Department of Computer Sciences, Universidad de Concepcion, Concepcion, Chile, November, 528–548. DOI: 10.1111/j.1467-8640.2012.00426.x
  25. Features in the Java language 8 (2014). URL: https://javarush.com/groups/posts/1037-osobennosti-java-8--maksimaljhnoe-rukovodstvo-chastjh-1.
  26. Liquibase allows you to perform the following (2022). URL: https://medium.com/search?q=Liquibase.
  27. N Gram (2020). URL: https://stackoverflow.com/questions/18193253/what-exactly-is-an-n-gram.
  28. Raza S., Ding C. (2022). News recommender system: a review of recent progress, challenges, and opportunities. Artif. Intell. Rev., 55, 749–800. DOI: 10.1007/s10462-021-10043-x.
  29. Zuo Y., Zeng J., Gong M., Jiao L. (2016). Tag-aware recommender systems based on deep neural networks. Neurocomputing, 204, 51–60. DOI: 10.1016/j.neucom.2015.10.134.
  30. Lytvyn V., et. al. (2019). Design of a recommendation system based on Collaborative Filtering and machine learning considering personal needs of the user. Eastern-European Journal of Enterprise Technologies, 4(2), 6–28. DOI: 10.15587/1729-4061.2019.175507.
  31. Balush I., Vysotska V., Albota S. (2021). Recommendation System Development Based on Intelligent Search, NLP and Machine Learning Methods. CEUR Workshop Proceedings,  584–617.
  32. Demchuk A., Lytvyn V., Vysotska V., Dilai M. (2020). Methods and Means of Web Content Personalization for Commercial Information Products Distribution. Lecture Notes in Computational Intelligence and Decision Making, Vol. 1020. Springer, Cham. DOI: 10.1007/978-3-030-26474-1_24.
  33. Tulashvili Y., Turbal Y., Abd Alkaleg D., Pasichnyk V., Kunanets N. (2020). The Optimal Tour Problem in Smart Tourism Recommender Systems. In IEEE 15th International Conference on Computer Sciences and Information Technologies (CSIT), Vol. 2,  246–250. DOI: 10.1109/CSIT49958.2020.9322043