Those experts said some of the invented text — known in the industry as hallucinations — can include racial commentary, violent rhetoric and even imagined medical treatments.
Those experts said some of the invented text — known in the industry as hallucinations — can include racial commentary, violent rhetoric and even imagined medical treatments.
I’ve been using Whisper with TankieTube and I’m curious whether these errors were made with the Large-v2 or the Large-v3 model. I suspect it was the latter, because its dataset includes output from the other.
Snake eating its own tail, etc.
In your experience, has whisper large c3 been much worse than vo2?
I haven’t done any comparing; I just went with the apparent consensus, which is that v2 was more accurate and hallucinated less.
In your experience, has whisper large c3 been much worse than vo2?