Researchers say an AI-powered transcription tool used in hospitals invents things no one ever said

Yuritopiaposadism [none/use name]@hexbear.net · 3 days ago

Researchers say an AI-powered transcription tool used in hospitals invents things no one ever said

TankieTanuki [he/him]@hexbear.net · edit-2 3 days ago

I’ve been using Whisper with TankieTube and I’m curious whether these errors were made with the Large-v2 or the Large-v3 model. I suspect it was the latter, because its dataset includes output from the other.

The Whisper large-v3 model was trained on 1 million hours of weakly labeled audio and 4 million hours of pseudo-labeled audio collected using Whisper large-v2.

Snake eating its own tail, etc.

gay_king_prince_charles [she/her, he/him]@hexbear.net · 3 days ago

In your experience, has whisper large c3 been much worse than vo2?

TankieTanuki [he/him]@hexbear.net · edit-2 3 days ago

I haven’t done any comparing; I just went with the apparent consensus, which is that v2 was more accurate and hallucinated less.

gay_king_prince_charles [she/her, he/him]@hexbear.net · 3 days ago

In your experience, has whisper large c3 been much worse than vo2?