YouTuber launches first-of-its-kind class action lawsuit against OpenAI

The reckoning over companies like OpenAI using creators’ content without their permission might finally be at hand.

Since OpenAI dropped the first version of ChatGPT in November 2022, it’s faced scrutiny over what data it uses to make its generative AI products, with many creators concerned their videos have been scraped and dumped into the slush. Creators understandably want to keep control of the content they make, and while it’s too late to extract their videos from already-scraped datasets, it may be possible for them to receive compensation for the violation of their ownership rights–and set a precedent that could prevent other companies from taking what’s theirs in the future.

That’s the goal of a lawsuit from David Millette, a Massachusetts man who’s now opened a class action lawsuit against OpenAI seeking $5 million in damages for himself and other creators.

Subscribe to get the latest creator news

Millette, who’s had a YouTube account since 2009, alleges OpenAI has engaged in the “surreptitious, non-consensual transcription of millions of YouTube users’ videos […] to train Defendants’ AI software products,” and that it’s “profited significantly” by doing so. The suit specifically refers to allegations that OpenAI created a speech recognition model, Whisper, to transcribe audio, then used Whisper to transcribe millions of hours of YouTube content. Those transcriptions were reportedly used to train GPT-4.

The lawsuit alleges that by scraping creators’ videos, OpenAI violated copyright law, since creators retain ownership rights to any videos they upload thanks to YouTube’s terms of service.

“Much of the material in OpenAI’s training datasets […] comes from works–including videos created and uploaded by Plaintiff–that were copied by OpenAI without consent, without credit, and without compensation,” the suit alleges.

As TechCrunch points out, the reason makers of large language models (LLMs) like ChatGPT have turned to using video transcriptions for training is because they’ve already scraped everything they can from the rest of the internet, and because more and more text-based websites are now installing blockers to keep future scrapes from happening. Over 35% of the world’s top 1,000 websites have those protections in place.

If you’re wondering whether YouTube is looking into solutions like that to prevent external scrapings, we’re not sure. But there’s a bigger concern with YouTube: it talks a big game about keeping creators safe in the advent of genAI, but it’s allegedly also scraping transcriptions of creators’ videos and using them to train Google‘s own AI products.

Millette’s lawsuit is a civil case, but it does ask the presiding judge to state that OpenAI violated copyright laws, something that could expose the company to future criminal charges. And, like we mentioned above, Millette’s also seeking $5 million in damages–which, since this is a class action lawsuit, would be split between him and any other affected creators in the event that things are decided in his favor.

If Millette wins his case, creators may receive some cash. But their data will still be part of potentially dozens of genAI products because there are no established protections for creators against having their videos, writing, art, and more scraped and subsumed into training sets. Until those protections are in place, creators like Millette have to fight for themselves, and hope judgments in their favor will deter companies who want to use their content without permission.

Published by

James Hale

Tags: copyright lawcreatorsgenaiopenaiYouTube

2 years ago

Top 5 Branded Videos of the Week: Big views
'Tis the season for festive holiday beverages, and some of YouTube's biggest channels are raising…
Tubi looks to build on momentum by hunting for creators at VidCon
Within the streaming industry, Tubi continues to raise its profile, and creators are a big…
Creators are popping up all over India. A college program is training them.
India's growing class of professional creators is getting access to a new training program. At MICA,…

Tubi looks to build on momentum by hunting for creators at VidCon

Within the streaming industry, Tubi continues to raise its profile, and creators are a big…

6 hours ago

News

Creators are popping up all over India. A college program is training them.

India's growing class of professional creators is getting access to a new training program. At MICA,…

8 hours ago

News

Are usernames WhatsApp’s path toward becoming a creator hub?

On June 29, Meta asked its users to get a handle on their handles, because usernames are…

9 hours ago

Homepage Feature

Expedia’s newest campaign tells travelers they can go global just like IShowSpeed

"I want to travel the whole world one day. Through space, to the moon, different…

1 day ago

News

Instagram invites users to tweak the algorithm with categorical tags

In recent years, social media companies have explored an interesting query: Do individual users understand…