Major improvements to ChatGPT will allow the chatbot to respond to voice instructions and visual inquiries. Users will have the ability to feed photos into ChatGPT on every platform and engage in voice conversations with it on both iOS and Android devices.
If you want to test out voice discussions in the ChatGPT application, you must opt-in. Five voices are available for you to select by touching the mic button.
The latest text-to-speech algorithm, according to OpenAI, powers the exchange of voice dialogues and can produce humanoid audio from only written words and just a few moments of sample voice.
It used professional actors to help create the five voices. The business’s Whisper voice recognition system, on the other hand, transforms an individual’s spoken words into text.
The features based on images are also fascinating. According to OpenAI, you could ask the chatbot to answer a maths problem you’ve snapped a picture of and show it a picture of your smoker. and inquire why it fails to ignite or get it to assist you in planning a dinner based on a photograph of what’s in the refrigerator. In fact, Microsoft emphasized the Copilot AI’s aptitude for maths issues during the previous week’s Surface event.
GPT-3.5 and GPT-4 are used by OpenAI to fuel its image classification capabilities. Tap the photo icon to snap a picture or select an existing photo from your smartphone to enjoy ChatGPT’s visual features. You may use a tool for drawing to zoom in on a particular area of the image while asking ChatGPT about numerous images.
The possibility for damage was mentioned by OpenAI in a blog post introducing the revisions. It’s conceivable for dishonest people to imitate famous people’s voices and maybe conduct fraud. For this reason, OpenAI is concentrating on ChatGPT voice chats using this technology and collaborating with a few chosen partners on additional constrained use cases.
Also Read: Palantir Wins $250 Million AI Deal with US Defence Department
Regarding visuals, OpenAI collaborated with Be My Eyes, a free tool that enables people to join video conversations with blind and low-vision users in order to assist them in better interpreting their surroundings.
“Users have told us they find it valuable to have general conversations about images that happen to contain people in the background, like if someone appears on TV while you’re trying to figure out your remote-control settings,” OpenAI said.
finance.yahoo.com
I am a student pursuing my bachelor’s in information technology. I have a interest in writing so, I am working a freelance content writer because I enjoy writing. I also write poetries. I believe in the quote by anne frank “paper has more patience than person