Building an On-Premises Knowledge Repository with Large Language Models for Instant Information Access

Burak Dobur; Engin Bıçakcı; Asli Terim; Cemal Arık

doi:10.56038/oprd.v5i1.545

Back to Journal

Research Article Open AccessOrclever Native

Building an On-Premises Knowledge Repository with Large Language Models for Instant Information Access

Burak Dobur¹,

Engin Bıçakcı²,

Asli Terim³,

Cemal Arık⁴

¹Procat

²Procat

³Procat

⁴Procat

Published:December 31, 2024

DOI: 10.56038/oprd.v5i1.545

Vol. 5, No. 1 · pp. 261–273

Abstract

This project aims to design and develop a live knowledge library utilizing large language models (LLMs) to enhance access to real-time information across various domains. The system will be deployed on-premises, enabling instant responses to user queries, thus optimizing information retrieval processes. By leveraging the natural language processing (NLP) capabilities of LLMs, the project seeks to improve decision-making and operational efficiency within organizations. It addresses the growing need for rapid information access, providing precise and accurate answers to user inquiries, minimizing the delays inherent in traditional search methods. Additionally, the system enhances user experience by offering a user-friendly interface with quick response times, making information retrieval more intuitive. The project also focuses on improving internal knowledge flow by facilitating better communication and collaboration across departments. With an emphasis on scalability, the solution is designed to be adaptable to various sectors, ensuring widespread applicability. By continuously learning and adapting to new data, the system will provide up-to-date information, reducing reliance on manual updates and minimizing human error. Ultimately, this innovation aims to significantly enhance productivity, support effective decision-making, and offer a competitive advantage to organizations through the use of AI-driven knowledge management solutions.

Keywords: Knowledge Library, Large Language Models, AI, Real-time Information Retrieval, Decision-making

Keywords

Knowledge LibraryLarge Language ModelsAIReal-time Information RetrievalDecision-making

References

1.Zhong, L., Wu, J., Li, Q., Peng, H., & Wu, X. (2023). A comprehensive survey on automatic knowledge graph construction. ACM Computing Surveys, 56(4), 1-62.
2.Thanachawengsakul, N., Wannapiroon, P., & Nilsook, P. (2019). The Knowledge Repository Management System Architecture of Digital Knowledge Engineering using Machine Learning to Promote Software Engineering Competencies. International Journal of Emerging Technologies in Learning, 14(12).
3.Wang, H., Xu, Z., Fujita, H., & Liu, S. (2016). Towards felicitous decision making: An overview on challenges and trends of Big Data. Information Sciences, 367, 747-765.
4.Walker, W. H., & Kintsch, W. (1985). Automatic and strategic aspects of knowledge retrieval. Cognitive Science, 9(2), 261-283.
5.Martin, P., & Eklund, P. W. (2000). Knowledge retrieval and the world wide web. Ieee Intelligent Systems and Their Applications, 15(3), 18-25.
6.Oskooei, A. R., Babacan, M. S., Yağcı, E., Alptekin, Ç., & Buğday, A. (2024). Beyond synthetic benchmarks: Assessing recent LLMs for code generation. The 14th International Workshop on Computer Science and Engineering (WCSE 2024), 290-296. Phuket Island, Thailand.
7.Long, X., Zeng, J., Meng, F., Ma, Z., Zhang, K., Zhou, B., & Zhou, J. (2024, March). Generative multi-modal knowledge retrieval with large language models. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 38, No. 17, pp. 18733-18741).
8.Zhu, Y., Yuan, H., Wang, S., Liu, J., Liu, W., Deng, C., ... & Wen, J. R. (2023). Large language models for information retrieval: A survey. arXiv preprint arXiv:2308.07107.
9.Abdalla, H. B., Ahmed, A. M., & Al Sibahee, M. A. (2020). Optimization driven MapReduce framework for indexing and retrieval of big data. KSII Transactions on Internet and Information Systems (TIIS), 14(5), 1886-1908.
10.Oskooei, A. R. (2024). On the use of data parallelism technologies for implementing statistical analysis functions. The 14th International Workshop on Computer Science and Engineering (WCSE 2024), 94-102. Phuket Island, Thailand.
11.Zhang, Y., Cao, T., Li, S., Tian, X., Yuan, L., Jia, H., & Vasilakos, A. V. (2016). Parallel processing systems for big data: a survey. Proceedings of the IEEE, 104(11), 2114-2136.
12.Rafieioskouei, A., Rogale, K., Dibavar, A. S., Mahmoudi, M., & Bonakdarpour, B. (2024). Causality analysis of protein corona composition: phosphatidylcholine-enhances plasma proteome profiling by proteomics. bioRxiv, 2024-09.
13.Marwala, T. (2015). Causality, correlation and artificial intelligence for rational decision making. World Scientific.
14.Rafieioskouei, A., & Bonakdarpour, B. (2024). Efficient Discovery of Actual Causality Using Abstraction Refinement. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 43(11), 4274-4285.
15.Raharjana, I. K., Siahaan, D., & Fatichah, C. (2021). User stories and natural language processing: A systematic literature review. IEEE access, 9, 53811-53826.
16.Planas, E., Daniel, G., Brambilla, M., & Cabot, J. (2021). Towards a model-driven approach for multiexperience AI-based user interfaces. Software and Systems Modeling, 20(4), 997-1009.
17.de Souza Alves, T., de Oliveira, C. S., Sanin, C., & Szczerbicki, E. (2018). From knowledge based vision systems to cognitive vision systems: a review. Procedia Computer Science, 126, 1855-1864.
18.Ruíz, L. M., Pueyo, P. P., Mateo-Fornés, J., Mayoral, J. V., & Tehàs, F. S. (2022). Autoscaling pods on an on-premise Kubernetes infrastructure QoS-aware. IEEE Access, 10, 33083-33094.
19.Zhong, Z., Xu, M., Rodriguez, M. A., Xu, C., & Buyya, R. (2022). Machine learning-based orchestration of containers: A taxonomy and future directions. ACM Computing Surveys (CSUR), 54(10s), 1-35.
20.Rafiei Oskooei, A., Yahsi, E., Sungur, M., & S. Aktas, M. (2024, July). Can One Model Fit All? An Exploration of Wav2Lip’s Lip-Syncing Generalizability Across Culturally Distinct Languages. In International Conference on Computational Science and Its Applications (pp. 149-164). Cham: Springer Nature Switzerland.
21.Rahman, M. M., Balakrishnan, D., Murthy, D., Kutlu, M., & Lease, M. (2021). An information retrieval approach to building datasets for hate speech detection. arXiv preprint arXiv:2106.09775.
22.Guveyi, E., Aktas, M. S., & Kalipsiz, O. (2020). Human factor on software quality: A systematic literature review. In O. Gervasi, B. Murgante, S. Misra, C. Garau, I. Blečić, D. Taniar, B. O. Apduhan, A. M. A. C. Rocha, E. Tarantino, C. M. Torre, & Y. Karaca (Eds.), Computational Science and Its Applications – ICCSA 2020. Lecture Notes in Computer Science (pp. 918–930). Springer.
23.Aktas, M. S., & Kapdan, M. (2016). Structural code clone detection methodology using software metrics. International Journal of Software Engineering and Knowledge Engineering, 26(2), 307–332.
24.Oz, M., Kaya, C., Olmezogullari, E., & Aktas, M. S. (2021). On the use of generative deep learning approaches for generating hidden test scripts. International Journal of Software Engineering and Knowledge Engineering, 31(10), 1447–1468.
25.Oguz, R.F., Oz, M., Olmezogullari, E., Aktas, M. S. (2022). Extracting Information from Large Scale Graph Data: Case Study on Automated UI Testing, Euro-Par 2021: Parallel Processing Workshops, LNCS,volume 13098.
26.Uzun-Per, M., Can, A. B., Gurel, A. V., & Aktas, M. S. (2021). Big data testing framework for recommendation systems in e-science and e-commerce domains. 2021 IEEE International Conference on Big Data (Big Data), 2021.
27.Erdem, I., Oguz, R. F., Olmezogullari, E., & Aktas, M. S. (2021). Test script generation based on hidden Markov models learning from user browsing behaviors 2021 IEEE International Conference on Big Data (Big Data), 2021.
28.Düzen, Z., & Aktas, M. S. (2016). An approach to hybrid personalized recommender systems. 2016 International Symposium on INnovations in Intelligent SysTems and Applications (INISTA), 2-5 Ağustos 2016, Sinaia, Romanya.
29.Uzun-Per, M., Gurel, A. V., Can, A. B., & Aktas, M. S. (2022). Scalable recommendation systems based on finding similar items and sequences. Concurrency and Computation: Practice and Experience, 34(20).
30.Yildiz, B. (2022, September). Enhancing image resolution with generative adversarial networks. In 2022 7th International Conference on Computer Science and Engineering (UBMK) (pp. 104–109). IEEE.
31.Yıldız, B. (2022). Efficient text classification with deep learning on imbalanced data improved with better distribution. Turkish Journal of Science and Technology, 17(1), 89–98.
32.Briman, M. K. H., & Yildiz, B. (2024). Beyond ROUGE: A comprehensive evaluation metric for abstractive summarization leveraging similarity, entailment, and acceptability. International Journal on Artificial Intelligence Tools.
33.Saad, A. M. S. E., & Yildiz, B. (2022, September). Reinforcement learning for intrusion detection. In International Conference on Computing, Intelligence and Data Analytics (pp. 230–243). Cham: Springer International Publishing.
34.Haider, U., & Yildiz, B. (2023, December). A novel use of reinforcement learning for elevated click-through rate in online advertising. In 2023 International Conference on Computational Science and Computational Intelligence (CSCI) (pp. 64–70). IEEE.
35.Yildiz, B. (2021). Optimizing bitmap index encoding for high performance queries. Concurrency and Computation: Practice and Experience, 33(18), e5943.
36.Yildiz, B., & Tezgider, M. (2020). Learning quality improved word embedding with assessment of hyperparameters. In Euro-Par 2019: Parallel Processing Workshops: Euro-Par 2019 International Workshops, Göttingen, Germany, August 26–30, 2019, Revised Selected Papers 25 (pp. 506–518). Cham: Springer International Publishing.

Download PDF

Cite This Article

Dobur, B., Bıçakcı, E., Terim, A., Arık, C. (2024). Building an On-Premises Knowledge Repository with Large Language Models for Instant Information Access. *Orclever Proceedings of Research and Development*, 5(1), 261-273. https://doi.org/10.56038/oprd.v5i1.545

Bibliographic Info

JournalOrclever Proceedings of Research and Development

Volume5

Issue1

Pages261–273

PublishedDecember 31, 2024

eISSN2980-020X