OpenAI’s announcement on Monday revealed that ChatGPT has undergone a significant update, granting it the ability to comprehend spoken language, respond using a synthetic voice, and process images. This marks the most substantial enhancement to the chatbot since the introduction of GPT-4.
Users can now voice chat via the ChatGPT mobile app and choose from five different voices that the bot will respond to. Additionally, users will be able to share images with ChatGPT, allowing them to identify specific areas of interest or request verification, such as verification of cloud type.
OpenAI, plans to implement these changes for paying users within the next two weeks. Notably, the voice functionality will be accessible exclusively through the iOS and Android apps, while image processing capabilities will be available across all platforms.

The significant feature expansion coincides with the escalating competition in the artificial intelligence race involving leading chatbot companies like OpenAI, Microsoft, Google, and Anthropic.
These tech giants are in a race to not only introduce new chatbot applications but also roll out fresh functionalities, particularly during the current summer season. Google, for instance, has unveiled a series of updates for its Bard chatbot, while Microsoft has incorporated visual search capabilities into Bing.
OpenAI Investments and Worries About Fake Voices
Earlier this year, Microsoft’s substantial additional investment of $10 billion in OpenAI marked it as the largest AI investment of the year, as reported by PitchBook.
In April, OpenAI reportedly concluded a share sale worth $300 million, valuing the company at approximately $27 billion to $29 billion, with financial support coming from prominent firms like Sequoia Capital and Andreessen Horowitz.
Experts have expressed concerns about synthetic voices produced by artificial intelligence; In this case, this voice can provide users with more accurate information and can also open the door to many beliefs.
Cybersecurity threat actors and researchers have begun investigating the possibility of using deep-rooted vulnerabilities to compromise cybersecurity systems.
In a statement released on Monday, OpenAI confirmed that the audio links were “created in collaboration with artists with whom we have a working relationship” and not by strangers.
The press release also includes safeguards to protect that information if used. As such, there is no information about the intended use of OpenAI’s extensive customer voice feedback content. OpenAI’s terms of service state that customers retain ownership of their ideas “to the extent permitted by law.”
OpenAI has developed voice instructions. An interface stating that OpenAI does not store audio recordings and that the audio clips themselves are stored. It is not used in model development.
However, it is worth noting that the manual states that the notes are classified as ideas and will be used to improve the performance of their models. general language.
Conclusion
The latest update to OpenAI’s ChatGPT marks a major advance by adding speech recognition and conversational responses. Microsoft’s major investment increases interest in the smart chatbot competition. But concerns remain about fraud caused by AI-generated voices. OpenAI addresses some concerns but leaves open the question of data use, highlighting the changing nature of AI.
Master AI Before It Masters You
Learn how to leverage AI to boost your productivity and accelerate your career.