OpenAI transcribed over a million hours of YouTube videos to train its LLMs, Google engaged in same practice [TechSpot]

View Article on TechSpot

In order to access more reputable English language-based text on the internet in 2021, OpenAI researchers created a speech recognition tool called Whisper, reports The New York Times. It was designed to transcribe audio from YouTube videos, giving the company a trove of data to train its LLMs.

Read Entire Article