Part 3/12:
One major disappointment was the negligible emphasis on multimodal functions. Despite ongoing industry debates about the importance of combining text, images, and videos in AI models, GPT-5's presentation lacked robust demonstrations or even discussions on these fronts. Voice capabilities saw some improvements, but video and image functionalities were notably missing, suggesting that OpenAI may not have prioritized multimodality in this generation.