RE: LeoThread 2025-11-05 15-48

Part 3/10:

One of the headline advancements comes from Google (in collaboration with UC Berkeley), with the debut of LM-Nav, a self-navigating robot embedded with multimodal AI capabilities. This robot can understand natural language instructions—such as "please clean up" or "go to the kitchen"—and execute complex tasks autonomously.

What makes LM-Nav particularly impressive is its integration of three AI models: large language models (LLMs), video-language understanding, and visual navigation systems. This synergy allows the robot to create its objectives, monitor progress, overcome obstacles, and adapt dynamically—all based on natural language input.