Semantic Orientation for Indoor Navigation System using Large Language Models

TitleSemantic Orientation for Indoor Navigation System using Large Language Models
Publication TypeJournal Article
Year of Publication2025
AuthorsHalama M, Nowak S, Połys K
JournalScientific Reports
Issue(in review)
Abstract

Autonomous robots play an important role in modern indoor navigation, but existing systems often struggle with seamless human interaction and semantic understanding of environments. This paper presents an Artificial Intelligence (AI)-driven object recognition system enhanced by Large Language Models (LLMs), such as GPT-4 Vision and Gemini, to bridge this gap. Our approach combines vision-based mapping techniques with natural language processing and interactions to enable intuitive collaboration in solving navigation tasks. By leveraging multimodal input and vector space analysis, our system achieves enhanced object recognition, semantic embedding, and context-aware responses, setting a new standard for autonomous indoor navigation. This approach provides a novel framework for improving spatial understanding and dynamic interaction, making it suitable for complex indoor environments.

Historia zmian

Data aktualizacji: 18/12/2024 - 13:35; autor zmian: Konrad Połys (kpolys@iitis.pl)