Part 2/9:
The introduction of new voice agents demonstrates OpenAI's commitment to creating natural human interfaces. According to the OpenAI team, voice capabilities are underutilized in current AI applications, and with improvements in text-to-speech (TTS) and speech-to-text (STT) technologies, developers are now presented with new opportunities to enhance user experience through voice.
New Model Offerings
During their live stream, members of the OpenAI team announced the release of three new models designed for robust voice experiences:
- Two New Speech-to-Text Models: These models have shown to outperform previous versions, such as Whisper, across various languages.