A New Hybrid Search Approach to Optimize the Retrieval of Information from the Website at the Universidad Politécnica Salesiana

Juan P. Salgado-Guerrero, Diego F. Quisi-Peralta, Martin Lopez-Nores, Luis D. Paguay-Palaguachi, Jordan F. Murillo-Valarezo, Gabriela Cajamarca-Morquecho

Producción científica: Capítulo del libro/informe/acta de congresoContribución de conferenciarevisión exhaustiva

Resumen

This paper presents a novel hybrid search approach to improve information retrieval from the Salesian Polytechnic University website, addressing the challenge of efficiently managing and accessing the growing volume of information. Leveraging virtual assistant technology, the study combines vector similarity and keyword-based techniques to optimize data retrieval. The methodology involves a structured process, including information gathering, architecture design, search execution and analysis of the results. The system architecture consists of three key layers: the intelligent layer, which uses the OpenAI API for query processing; the data layer, which uses the Qdrant database for storage; and the logic layer, responsible for query execution. Two search methods are applied: Vector similarity search, which retrieves data based on contextual relevance, and keyword search with BM25, which sorts documents by keyword relevance. Testing and analysis confirm that the hybrid search method significantly improves the efficiency and accuracy of information retrieval. The results show a significant improvement in the request measures obtained, where the 4 highest percentages were selected to obtain the context from which the answer is derived. The highest similarity values were 5.56, followed by 3.84, the effectiveness of this method in various knowledge areas of the university website. In conclusion, the hybrid search approach presented in this paper offers a promising solution to efficiently retrieve information from the Salesian Polytechnic University website, improve accessibility and ultimately improve user satisfaction.

Idioma originalInglés
Título de la publicación alojadaInformation Technology and Systems - ICITS 2024
EditoresAlvaro Rocha, Jorge Hochstetter Diez, Carlos Ferras, Mauricio Dieguez Rebolledo
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas247-257
Número de páginas11
ISBN (versión impresa)9783031542343
DOI
EstadoPublicada - 2024
EventoInternational Conference on Information Technology and Systems, ICITS 2024 - Temuco, Chile
Duración: 24 ene. 202426 ene. 2024

Serie de la publicación

NombreLecture Notes in Networks and Systems
Volumen932 LNNS
ISSN (versión impresa)2367-3370
ISSN (versión digital)2367-3389

Conferencia

ConferenciaInternational Conference on Information Technology and Systems, ICITS 2024
País/TerritorioChile
CiudadTemuco
Período24/01/2426/01/24

Nota bibliográfica

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

Huella

Profundice en los temas de investigación de 'A New Hybrid Search Approach to Optimize the Retrieval of Information from the Website at the Universidad Politécnica Salesiana'. En conjunto forman una huella única.

Citar esto