RE: LeoThread 2025-03-17 08:15

Part 4/10:

The technology uses a vision language action (VLA) model, which merges reasoning with spatial awareness and dexterous control. This means that robots can comprehend real-world challenges—like folding laundry or organizing items—effectively mimicking human fine motor skills. Google's assertions include that Gemini models can execute "zero-shot" and "few-shot" learning, rapidly accelerating their learning processes and making them significantly more efficient in training and operation.

Core Features: Generality, Interactivity, and Dexterity

Gemini robotics emphasizes three pivotal aspects: generality, interactivity, and dexterity.

Generality ensures that the AI is versatile, capable of performing a wide range of functions across various settings, be it in homes or factories.