OpenAI’s ChatGPT announced a major upgrade will be available shortly that will change the way you interact with the generative AI tool.
The new functionality will be available under its subscription plan ChatGPT Plus.
In the next few weeks, OpenAI will add voice and listening capabilities that will allow the user to talk to the AI and have it respond with speech.
While the ability to listen and respond is not new territory, other tools like Alexa and Siri have limited power to respond creatively the way that ChatGPT can.
OpenAI offers a selection of voices to choose from carefully trained by hired voice actors.
ChatGPT will be merged with existing technology called Whisper which can listen to users and transcribe voice conversations into text.
The new software will be added to take ChatGPT’s answers and convert them into spoken words.
But the wildest addition to the new release has to be adding “sight” to the tool.
Using the new DALL-E image creation tool, ChatGPT will be able to look at pictures and recognize and react to them.
For instance, a user can upload a picture of the Eiffel Tower and ask ChatGPT to tell the user about what is in the image.
ChatGPT will recognize the image and be able to give a history of the tower without any additional prompts.
A parent can take a picture of their child’s math homework and circle a problem to have ChatGPT help them find an answer.
Perhaps one of the coolest use cases is through a partnership with BeMyEyes, an app that helps blind and low-vision people, “see” the world around them via a video connection to volunteer helpers.
But OpenAI is using ChatGPT’s seeing feature to plug directly into the app and help immediately without waiting for a helper to pick up.