this post was submitted on 29 Sep 2023
429 points (93.7% liked)

Technology

59235 readers
3262 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Authors using a new tool to search a list of 183,000 books used to train AI are furious to find their works on the list.

you are viewing a single comment's thread
view the rest of the comments
[–] Touching_Grass@lemmy.world 8 points 1 year ago* (last edited 1 year ago) (1 children)

What exactly was not permitted by the license? Reading?

[–] sab@lemmy.world 13 points 1 year ago (4 children)

Using it to (create a tool to) create derivatives of the work on a massive scale.

[–] SirGolan@lemmy.sdf.org 7 points 1 year ago

Wikipedia: In copyright law, a derivative work is an expressive creation that includes major copyrightable elements of a first, previously created original work.

I think you may be off a bit on what a derivative work is. I don't see LLMs spouting out major copyrightable elements of books. They can give a summary sure, but Cliff Notes would like to have a word if you think that's copyright infringement.

[–] FaceDeer@kbin.social 7 points 1 year ago* (last edited 1 year ago)

An AI model is not a derivative work. It does not contain the copyrighted expression, just information about the copyrighted expression.

[–] lloram239@feddit.de 4 points 1 year ago (1 children)

Better tell that Google and their search index, book scanning project and knowledge graph.

[–] sab@lemmy.world 0 points 1 year ago* (last edited 1 year ago)

I didn't know those were LLMs, TIL.

[–] Touching_Grass@lemmy.world -3 points 1 year ago (1 children)

Well when that happens we have laws. So no problems

[–] sab@lemmy.world 2 points 1 year ago (2 children)

Would you be okay with applying that argument for any crime?

[–] FaceDeer@kbin.social 2 points 1 year ago

I would be, and I don't understand why you think this would be a problem. I wouldn't want the government to be preventing activities that there weren't any actual laws prohibiting.

[–] Touching_Grass@lemmy.world 0 points 1 year ago (1 children)

Ever heard of the early 21st century classic Minority Report

[–] sab@lemmy.world 4 points 1 year ago

You're missing the point. I'll make your example more specific.

Well when fraud/rape/murder happens we have laws. So no problems.

Those things happen. Creating a LLM based on copyrighted material without permission happens - it's not a hypothetical. But even then, giving a punishment after the fact does not make the initial crime "no problem", as you put it.