OpenAI recently rolled out ChatGPT’s advanced voice mode to a select group of ChatGPT Plus subscribers, and the reactions so far have been overwhelmingly positive.
The new functionality allows ChatGPT to imitate accents, sing, correct pronunciations of different languages and even narrate stories, demonstrating the growing potential of artificial intelligence in communication.
Read too:
This guy created a TARS from the movie Interstellar using ChatGPT
OpenAI will block full access to ChatGPT in China
New ChatGPT Voice Mode Brings More Immersion
Several clips have surfaced online demonstrating ChatGPT’s impressive capabilities. One of the highlights was a demo in which X user @nickfloats asked ChatGPT to tell a story as if he were an airline pilot talking to passengers during a flight.
Guys im never talking to any of you ever again once gpt voice is released. I won’t need friends anymore. AI will tell me whatever I need to hear in any voice I want & it wont talk back or get mad when I interrupt it. Might even fuck around & fall in lovepic.twitter.com/GIRyhZYj9j
— Nick St. Pierre (@nickfloats) July 31, 2024
Not only did the chatbot respond immediately, but it also adjusted the audio to sound like it was coming from an airplane intercom. While ChatGPT still has limitations when it comes to handling more complex requests, such as adding engine sounds, the clarity and expressiveness of the voice are remarkable, and the chatbot handles user interruptions well.
Multilingualism and pronunciation correction
Another interesting detail about the advanced voice mode is ChatGPT’s ability to operate in “dozens of languages.” In a YouTube chat, the chatbot mentioned that the exact number of languages it can process depends on how dialects and regional variations are counted.
In one clip, ChatGPT demonstrated its ability to correct the pronunciation of French words, offering specific tips on inflection.
In another demonstration, ChatGPT narrated an emotional story in Turkish, with the ability to react emotionally at key points in the narrative. Although some native Turkish users noted that the accent did not sound native, the chatbot was able to deliver the story coherently and expressively.
Diversity of accents and musical styles
In addition to languages, ChatGPT is also capable of imitating regional accents in the US, including New York, Boston, Wisconsin, and the typical “valley girl” accent.
ChatGPT Advanced Voice Mode attempting various US regional accents pic.twitter.com/UvDeQUNHLp
— Cristiano Giardina (@CrisGiardina) July 31, 2024
The ability to sing in different musical styles was another feature that caught people’s attention. In one demonstration, ChatGPT performed a blues-style version of “Happy Birthday” and playfully tried to imitate what animals, such as frogs and cats, would sound like singing the same song.
ChatGPT Availability and Expansion
The advanced voice mode is currently only available to a small group of ChatGPT Plus subscribers, but OpenAI has announced that it plans to make the feature available to all ChatGPT Plus subscribers in the fall, which corresponds to the months of September, October and November.
This represents a significant step in expanding ChatGPT’s capabilities and offering new ways to interact with AI.
Fonte: The Verge
Source: https://www.hardware.com.br/noticias/novo-modo-de-voz-avancado-do-chatgpt-impressiona-ouca-como-soa.html