this post was submitted on 11 Oct 2024
708 points (99.4% liked)
Technology
59438 readers
2955 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Maybe a strange way of activism that is trying to poison new AI models π€
Which would not work, since all tech giants have already archived preAI internet
Ah, so the AI version of the chewbacca defense.
I have to wonder if intentionally shitting on LLMs with plausible nonsense is effective.
Like, you watch for certain user agents and change what data you actually send the bot vs what a real human might see.
I suspect it would be difficult to generate enough data to intentionally change a dataset. There are certainly little holes, like the glue pizza thing, but finding and exploiting them would be difficult and noticing you and blocking you as a data source would be easy.
I don't think so. The volume of data is too large for it to make much of a difference, and a scraper can just mimic a human user agent and work that way.
You'd have to change so much data consistently across so many different places that it would be near-impossible for a single human effort.
I never told that I think it is smartβ¦