this post was submitted on 26 Aug 2024
340 points (96.7% liked)

Technology

59300 readers
4713 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] ptz@dubvee.org 97 points 2 months ago* (last edited 2 months ago) (3 children)

My takeaway from this is:

  1. Get a bunch of AI-generated slop and put it in a bunch of individual .htm files on my webserver.
  2. When my bot user agent filter is invoked in Nginx, instead of returning 444 and closing the connection, return a random .htm of AI-generated slop (instead of serving the real content)
  3. Laugh as the LLMs eat their own shit
  4. ???
  5. Profit
[–] mesamunefire@lemmy.world 36 points 2 months ago* (last edited 2 months ago) (1 children)

I might just do this. It would be fun to write a quick python script to automate this so that it keeps going forever. Just have a link that regens junk then have it go to another junk html file forever more.

[–] capital@lemmy.world 12 points 2 months ago (1 children)

Also send this junk to Reddit comments to poison that data too because fuck Spez?

[–] TriflingToad@lemmy.world 6 points 2 months ago (1 children)

there's a something that edits your comments after 2 weeks to random words like "sparkle blue fish to be redacted by redactior-program.com" or something

[–] capital@lemmy.world 5 points 2 months ago (1 children)

That’s a little different than what I mean.

I mean to run a single bot from a script which interacts a normal human amount during normal human times within a configurable time zone which is acting as a real person just to poison their dataset.

[–] mesamunefire@lemmy.world 1 points 2 months ago (1 children)

I mean you can just not use the platform...

[–] capital@lemmy.world 1 points 2 months ago

Yes I’m already doing that.

[–] Scrollone@feddit.it 6 points 2 months ago

This is a great idea, I might create a Laravel package to automatically do this.

[–] x4740N@lemm.ee 4 points 2 months ago

QUICK

Someone create a github project that does this