Creators! Your YouTube Videos Are Being Trained by AI Giants Without Your Permission

It was inevitable. Almost all data on the web is being trained by AI giants with the help of 3rd party dataset generators and without any permission. PoofNews.org reveals that even your YouTube videos are being used for that purpose and without any consent.

Everybody’s work is being exposed to AI dataset generators

As stated by Proof News: “Apple, Nvidia, Anthropic, and other big tech companies used thousands of swiped YouTube videos to train AI. Creators claim their videos were used without their knowledge”. The research site claims that tech companies are turning to controversial tactics to feed their data-hungry artificial intelligence models, vacuuming up books, websites, photos, and social media posts, often unbeknownst to the creators.

OpenAI founder, Sam Altman, and Hollywood.

Remaining in secrecy

Proof News adds that AI companies are generally secretive about their sources of training data, but an investigation by Proof News found some of the wealthiest AI companies in the world have used material from thousands of YouTube videos to train AI. Companies did so despite YouTube’s rules against harvesting materials from the platform without permission. “Our investigation found that subtitles from 173,536 YouTube videos, siphoned from more than 48,000 channels, were used by Silicon Valley heavyweights, including Anthropic, Nvidia, Apple, and Salesforce” the site adds. Proof News also found material from YouTube megastars, including MrBeast (289 million subscribers, two videos taken for training), Marques Brownlee (19 million subscribers, seven videos taken), Jacksepticeye (nearly 31 million subscribers, 377 videos taken), and PewDiePie (111 million subscribers, 337 videos taken). Some of the material used to train AI also promoted conspiracies such as the “at-Earth theory.”

Layers on layers of data: Irreversible process

It’s important to emphasize that the process of training datasets is irreversible since every layer of data is based on other layers. Practically, AI image generators can not undo it (=remove a specifically trained video) since it would interfere with the AI calculation, and thus, datasets are being stitched together.

Post production - post prompt. Filmmaking on Sora. Image: shy kids — Post production – post prompt. Filmmaking on Sora. Image: shy kids

YouTube and Sora

Moreover, OpenAI executives have repeatedly declined to publicly answer questions about whether it used YouTube videos to train its AI product Sora, which creates videos from text prompts. Earlier this year, a reporter with The Wall Street Journal put the question to Mira Murati, OpenAI’s chief technology officer. “I’m actually not sure about that,” Murati replied. That means the answer is ‘Yes’. So next time you upload your Porsche to YouTube, be aware that Sora will be trained on it and without your consent. According to those dataset generators, the utilization of YT content for AI train purposes can be defined as ‘Fair Use’. Yeah, you heard right. AI image generators think that taking your videos to train on them is Fair Use. Oh, and without any compensation – means you are getting nothing for it. I have a question: WHERE ARE THE LAWYERS?

Possible solution: Marking and money!

First, YouTube needs to address that ASAP, by clarifying and explaining to creators in case their videos are being trained without their permission. Second, trained videos should be marked, as well as AI-generated imagery. Every AI video must be marked ‘Made by AI’. Third, creators should get compensated twice: By datasets generators, and by the AI giants who trained those datasets. Therefore, those ‘voluntary’ dataset generators will understand the consequences of harming those who feed them (creators), by paying them money. It’s time to stop this circus. Take an example from Blackmagic.

2025-01-22

The Wizard of Oz (16K) Sphere Project- Reimagining Cinema with Cutting-Edge Technology

5 mins read

Discuss

2024-12-29

Can IMAX Preserve and Revive the Film Industry?

6 mins read

Discuss

2024-12-13

Sora: The Digital Equivalent of Junk Food

3 mins read

News

2024-11-27

OpenAI Sora Has Been Leaked: The Pandora’s Box of AI Creativity

3 mins read

News

2024-10-31

Hollywood’s Future: How AI and New Technology Are Transforming the Industry

7 mins read

Discuss

2024-10-16

Adobe’s New Generative Timeline Extend in Premiere Pro: AE Killer?

2 mins read

News

YMCinema Latest posts

YMCinema is a premier online publication dedicated to the intersection of cinema and cutting-edge technology. As a trusted voice in the industry, YMCinema delivers in-depth reporting, expert analysis, and breaking news on professional camera systems, post-production tools, filmmaking innovations, and the evolving landscape of visual storytelling. Recognized by industry professionals, filmmakers, and tech enthusiasts alike, YMCinema stands at the forefront of cinema-tech journalism.

View all

Creators! Your YouTube Videos Are Being Trained by AI Giants Without Your Permission

Everybody’s work is being exposed to AI dataset generators

Remaining in secrecy

Layers on layers of data: Irreversible process

YouTube and Sora

Possible solution: Marking and money!

Get the best of filmmaking!

Related Posts

The Wizard of Oz (16K) Sphere Project- Reimagining Cinema with Cutting-Edge Technology

Can IMAX Preserve and Revive the Film Industry?

Sora: The Digital Equivalent of Junk Food

OpenAI Sora Has Been Leaked: The Pandora’s Box of AI Creativity

Hollywood’s Future: How AI and New Technology Are Transforming the Industry

Adobe’s New Generative Timeline Extend in Premiere Pro: AE Killer?

Leave a Reply Cancel reply

Get the best of filmmaking!

Recent Posts

Categories

Get the best of filmmaking!

Nikon is Ready to Develop a Cinema Camera

Sony BURANO + FX3: The Ideal Combination?

Latest from Educate

How an iPhone Camera Revolutionized the F1 Movie’s Epic POV Shots

Why Film Schools Are Turning to Blackmagic Cinema Cameras

Blackmagic’s Big League Leap: How Sensor Patents Reveal a Master Plan

Forza’s New 4K 1100FPS Global Shutter Sensor Could Power the Next Mirrorless Cinema Beast

Autumn Durald Arkapaw Makes IMAX History with Sinners

Creators! Your YouTube Videos Are Being Trained by AI Giants Without Your Permission

Everybody’s work is being exposed to AI dataset generators

Remaining in secrecy

Layers on layers of data: Irreversible process

YouTube and Sora

Possible solution: Marking and money!

Get the best of filmmaking!

Related Posts

Leave a Reply Cancel reply

Get the best of filmmaking!

Recent Posts

Categories

Get the best of filmmaking!

Nikon is Ready to Develop a Cinema Camera

Sony BURANO + FX3: The Ideal Combination?

Latest from Educate

Don't Miss