‘Impossible’ to create AI tools like ChatGPT without copyrighted material, OpenAI says

L4sBot@lemmy.world · 10 months ago

‘Impossible’ to create AI tools like ChatGPT without copyrighted material, OpenAI says

S410@lemmy.ml · 10 months ago

Every work is protected by copyright, unless stated otherwise by the author.
If you want to create a capable system, you want real data and you want a wide range of it, including data that is rarely considered to be a protected work, despite being one.
I can guarantee you that you’re going to have a pretty hard time finding a dataset with diverse data containing things like napkin doodles or bathroom stall writing that’s compiled with permission of every copyright holder involved.

Exatron@lemmy.world · 10 months ago

How hard it is doesn’t matter. If you can’t compensate people for using their work, or excluding work people don’t want users, you just don’t get that data.

There’s plenty of stuff in the public domain.

afraid_of_zombies@lemmy.world · 10 months ago

And artists are being compensated now fairly?

Exatron@lemmy.world · 10 months ago

Previous wrongs don’t make this instance right.

afraid_of_zombies@lemmy.world · 10 months ago

now

beckerist@lemmy.world · 10 months ago

deleted by creator

Fisk400@feddit.nu · 10 months ago

Sounds like a OpenAI problem and not an us problem.

HelloThere@sh.itjust.works · edit-2 10 months ago

I never said it was going to be easy - and clearly that is why OpenAI didn’t bother.

If they want to advocate for changes to copyright law then I’m all ears, but let’s not pretend they actually have any interest in that.

deweydecibel@lemmy.world · 10 months ago

I can guarantee you that you’re going to have a pretty hard time finding a dataset with diverse data containing things like napkin doodles or bathroom stall writing that’s compiled with permission of every copyright holder involved.

You make this sound like a bad thing.