ChatGPT Plus is getting Advanced Voice Mode


OpenAI has announced Advanced Voice Mode for its ChatGPT Plus users. It offers more natural, real-time conversations, allows users to interrupt anytime, and senses and responds to your emotions.

The mode will utilize four preset voices for the output, ensuring people’s privacy. We’ve trained the model to only speak in the four preset voices, and we built systems to block outputs that differ from those voices, said the company.

With this new advanced mode, OpenAI is trying to offer its users with a human-like conversation with the AI model. Understanding our senses with the voice is the highlight here, letting the model detect users’ emotions and reply appropriately.

The company also confirmed to be working on video and screen-sharing capabilities of ChatGPT, which should be released later.

Availability

The rollout of the Advanced Voice Mode is in the alpha stage. Eligible ChatGPT Plus users would have received an email with instructions and a message in their mobile app. The company has also promised to add more people later.

The official rollout should happen between September and November 2024.

Regarding the safety and quality of these voice conversations, OpenAI said,

Since we first demoed advanced Voice Mode, we’ve been working to reinforce the safety and quality of voice conversations as we prepare to bring this frontier technology to millions of people.

We tested GPT-4o’s voice capabilities with 100+ external red teamers across 45 languages. To protect people’s privacy, we’ve trained the model to only speak in the four preset voices, and we built systems to block outputs that differ from those voices. We’ve also implemented guardrails to block requests for violent or copyrighted content.

We plan to share a detailed report on GPT-4o’s capabilities, limitations, and safety evaluations in early August.