Semantic Orientation for Indoor Navigation System using Large Language Models | Institute of Theoretical and Applied Informatics, Polish Academy of Sciences

Title	Semantic Orientation for Indoor Navigation System using Large Language Models
Publication Type	Journal Article
Year of Publication	2025
Authors	Halama M, Nowak S, Połys K
Journal	Scientific Reports
Issue	(in review)
Abstract	Autonomous robots play an important role in modern indoor navigation, but existing systems often struggle with seamless human interaction and semantic understanding of environments. This paper presents an Artificial Intelligence (AI)-driven object recognition system enhanced by Large Language Models (LLMs), such as GPT-4 Vision and Gemini, to bridge this gap. Our approach combines vision-based mapping techniques with natural language processing and interactions to enable intuitive collaboration in solving navigation tasks. By leveraging multimodal input and vector space analysis, our system achieves enhanced object recognition, semantic embedding, and context-aware responses, setting a new standard for autonomous indoor navigation. This approach provides a novel framework for improving spatial understanding and dynamic interaction, making it suitable for complex indoor environments.

Historia zmian