TL;DR: OpenAI’s ChatGPT has added voice and image prompting for plus users, coming to everyone else “soon after”. You can now ask questions by speaking or uploading images. It uses advanced text-to-speech and image recognition technology, but with controlled limitations to prevent misuse.
I wonder if they will use the voice recordings to train their model.
deleted by creator
deleted by creator
At least it’s supposed to be coming for free users as well, just not yet. We’ll see.
Not actually rolled out everywhere just yet. Current plus subscriber, UK, Android, not seeing it in app or an app update
So now it can lip read?..
🤖 I’m a bot that provides automatic summaries for articles:
Click here to see the summary
Most of OpenAI’s changes to ChatGPT involve what the AI-powered bot can do: questions it can answer, information it can access, and improved underlying models.
The company is rolling out a new version of the service that allows you to prompt the AI bot not just by typing sentences into a text box but by either speaking aloud or just uploading a picture.
But the fact that you can build a capable synthetic voice with just a few seconds of audio also opens the door for all kinds of problematic use cases.
“These capabilities also present new risks, such as the potential for malicious actors to impersonate public figures or commit fraud,” the company says in a blog post announcing the new features.
OpenAI says it has deliberately limited ChatGPT’s “ability to analyze and make direct statements about people” both for accuracy and privacy reasons.
Almost a year after ChatGPT’s initial launch, OpenAI seems to still be trying to figure out how to give its bot more features and capabilities without creating new sets of problems and downsides.
Saved 73% of original text.