RE: LeoThread 2025-11-09 22-46

Part 3/11:

Video Generation Model: Built with 30 billion parameters, this transformer-based model is designed specifically for high-quality video synthesis.
Audio Model: A 13 billion parameters system handles audio creation, ensuring sound aligns perfectly with visual activity.

Training these models involved an extensive dataset comprising over 100 million video-text pairs and more than 1 billion image-text pairs, covering a broad spectrum of content including landscapes, animals, human interactions, and object motion. This diverse training ensures the AI can generate a wide array of scenarios with impressive realism.

RE: LeoThread 2025-11-09 22-46

Innovative Techniques Powering MovieGenen