Database Indexing Using Deep Learning Algorithms

2026;
: pp. 18 - 26
ISSN: 2524-065Х (рrint); 2663-0001 (оnline)

https://doi.org/10.23939/sisn2026.19.018
Received: November 05, 2025
Accepted: April 22, 2026
1
Lviv Polytechnic National University, Information Systems and Networks Department, Lviv, Ukraine
ORCID: 0009-0001-3676-3566
2
Lviv Polytechnic National University, Information Systems and Networks Department, Lviv, Ukraine
ORCID: 0000-0003-2029-7270

Summary. Automation of database indexing is a crucial component of modern database management systems that enhances search performance, scalability, and relevance in large-scale data environments. This paper explores the application of deep learning algorithms for building and optimizing vector indexes capable of automatic adaptation to changes in data structure and query patterns. An experimental comparison was conducted between traditional indexing methods (B-Tree, GIN in PostgreSQL) and vector-based indexing using Sentence-BERT embeddings implemented in FAISS and Milvus systems.

The study involved constructing a corpus of 1–5 million text records and performing a series of tests to evaluate performance metrics, including average query time, latency (95th percentile), index size, memory usage, and the search completeness metric (recall@10) relative to the BM25 baseline search. The results revealed that vector indexing achieved up to 0.94 recall@10 with comparable query latency, significantly outperforming traditional approaches in semantic search efficiency.

The practical contribution of this work lies in demonstrating the feasibility of hybrid DBMS architectures that combine traditional SQL indexing for fast filtering with vector-based methods for semantic refinement. This approach lays the foundation for the development of intelligent, self-optimizing databases capable of autonomous adaptation. Future research should focus on reinforcement learning for dynamic index restructuring, optimization of computational costs, and the integration of vector indexing into cloud-based and distributed systems.

  1. BERT. (n.d.). Huggingface.co. https://huggingface.co/docs/transformers/model_doc/bert
  2. Briggs, J. (2023). Introduction to Facebook AI Similarity Search (Faiss). Pinecone.io; Pinecone. https://www.pinecone.io/learn/series/faiss/faiss-tutorial
  3. Chen, Y., Liang, G., & Wang, Q. (2025). Research on SQL Injection Detection Technology Based on Content Matching and Deep Learning. Computers, Materials & Continua/Computers, Materials & Continua (Print), 0(0), 1–10. https://doi.org/10.32604/cmc.2025.063319
  4. Chhabra, L. (2019, December 7). Search Engine Using Machine Learning and NLP - Analytics Vidhya - Medium. Medium; Analytics Vidhya. https://medium.com/analytics-vidhya/search-engine-using-machine-learning...
  5. E. James, B., & P.O, A. (2017). Hybrid Database System for Big Data Storage and Management. International Journal of Computer Science, Engineering and Applications, 7(3/4), 15–27. https://doi.org/10.5121/ ijcsea.2017.7402
  6. Gao, A., Ai, T., Yu, H., Xiao, T., Chen, Y., Li, J., & Huang, H. (2024). A vector-based coastline shape classification approach using sequential deep learning model. International Journal of Applied Earth Observation and Geoinformation, 129, 103810–103810. https://doi.org/10.1016/j.jag.2024.103810
  7. Introduction to Facebook AI Similarity Search (Faiss) | Pinecone. (n.d.). Www.pinecone.io. https://www.pinecone. io/learn/series/faiss/faiss-tutorial/
  8. Kamal Berahmand, Fatemeh Daneshfar, Elaheh Sadat Salehi, Li, Y., & Xu, Y. (2024). Autoencoders and their applications in machine learning: a survey. Artificial Intelligence Review, 57(2). https://doi.org/10.1007/s10462-023-10662-6
  9. Milvus, a highly performant distributed vector database for AI apps. (2025). Milvus.io. https://milvus.io/intro
  10. Obaid, L., Hamad, K., Al-Ruzouq, R., Dabous, S. A., & Ismail, K. (2025). Automating the Estimation of Turning Movement Rates at Multilane Roundabouts Using Unmanned Aerial Vehicles and Deep Learning. Green Energy and Intelligent Transportation, 100340. https://doi.org/10.1016/j.geits.2025.100340
  11. Peyman Jafary, Davood Shojaei, Abbas Rajabifard, & Ngo, T. (2024). Automated land valuation models: A comparative study of four machine learning and deep learning methods based on a comprehensive range of influential factors. Cities, 151, 105115–105115. https://doi.org/10.1016/j.cities.2024.105115
  12. Sinha, M., & Paul, S. (2025). Automated Detection of Craters on the Lunar Surface Using Deep Learning: A Review with Insights from Chandrayaan-2 TMC-2 Data. Results in Earth Sciences, 100094–100094. https://doi.org/ 10.1016/j.rines.2025.100094
  13. Theodoropoulos, N., Andronas, D., Kampourakis, E., & Makris, S. (2025). Automated pallet loading of irregularly shaped objects: A deep reinforcement learning and multi-criteria optimization method. Journal of Manufacturing Systems, 80, 916–932. https://doi.org/10.1016/j.jmsy.2025.04.014
  14. What is a Vector Database & How Does it Work? Use Cases + Examples | Pinecone. (2023). Www.pinecone.io. https://www.pinecone.io/learn/vector-database
  15. Zhou, Q., Sun, T., Shen, T., & Xue, Y. (2025). A high-performance tensor computing unit for deep learning acceleration. Chip, 4(2), 100145. https://doi.org/10.1016/j.chip.2025.100145