this post was submitted on 12 Jul 2024

123 points (100.0% liked)

TechTakes

2030 readers

249 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago

MODERATORS

dgerard@awful.systems

123

LLM vendors are incredibly bad at responding to security issues (pivot-to-ai.com)

submitted 11 months ago by dgerard@awful.systems to c/techtakes@awful.systems

34 comments fedilink hide all child comments

all 36 comments

sorted by: hot top controversial new old

[–] kbal@fedia.io 44 points 11 months ago* (last edited 11 months ago) (2 children)

If models are trained on data that it would be a security breach for them to reveal to their users, then the real breach occurred at training.

[–] dgerard@awful.systems 27 points 11 months ago

now you know that and i know that,

[–] Cube6392@beehaw.org 18 points 11 months ago

The big LLMs everyone's talking about and using are just advanced forms of theft

[–] sailor_sega_saturn@awful.systems 32 points 11 months ago* (last edited 11 months ago) (4 children)

Sloppy LLM programming? Never!

In completely unrelated news I've been staring at this spinner icon for the past five minutes after asking an LLM to output nothing at all:

[–] self@awful.systems 22 points 11 months ago

same energy as “your request could not be processed due to the following error: Success”

[–] earthquake@lemm.ee 19 points 11 months ago (1 children)

What are the chances that the front end was not programmed to handle the LLM returning an empty string?

[–] sailor_sega_saturn@awful.systems 16 points 11 months ago

Quite likely yeah. There's no way they don't have a timeout on the backend.

[–] dgerard@awful.systems 10 points 11 months ago (1 children)

boooo Gemini now replies "I'm just a language model, so I can't help you with that."

[–] froztbyte@awful.systems 9 points 11 months ago (1 children)

"what would a reply with no text look like?" or similar?

[–] dgerard@awful.systems 8 points 11 months ago (1 children)

what would a reply with no text look like?

nah it just described what an empty reply might look like in a messaging app

they seem to have done quite well at making Gemini do mundane responses

[–] froztbyte@awful.systems 8 points 11 months ago

that's a hilarious response (from it). perfectly understand how it got there, and even more laughable

[–] casmael@lemm.ee 24 points 11 months ago (1 children)

LLM vendors are incredibly bad ~~at responding to security issues~~

[–] Tar_alcaran@sh.itjust.works 9 points 11 months ago (1 children)

They're surprisingly skilled at getting money from idiots.

[–] skillissuer@discuss.tchncs.de 6 points 11 months ago

their previous experience in crypto is shining

[–] corbin@awful.systems 19 points 11 months ago

My NSFW reply, including my own experience, is here. However, for this crowd, what I would point out is that this was always part of the mathematics, just like confabulation, and the only surprise should be that the prompt doesn't need to saturate the context in order to approach an invariant distribution. I only have two nickels so far, for this Markov property and for confabulation from PAC learning, but it's ~~completely expected~~ weird that it's happened twice.

[–] motor_spirit@lemmy.world 9 points 11 months ago

Lol that's like expecting gold rushers to be squared away with OSHA, I hope nobody's surprised here

[–] sunzu@kbin.run 9 points 11 months ago

These guys got barely enough staff to run the model lol

[–] 0laura@lemmy.world -5 points 11 months ago (1 children)

Not really a security issue I'd say. The AI speaking gibberish when you try to make it speak gibberish isn't really that big of an issue.

[–] froztbyte@awful.systems 14 points 11 months ago (2 children)

sure hope you're not in charge of security anywhere

[–] blakestacey@awful.systems 25 points 11 months ago (1 children)

Correction: I sure hope they're in charge of security at some place I don't like.

[–] froztbyte@awful.systems 11 points 11 months ago

.......okay fine I'll take it

[–] 0laura@lemmy.world -5 points 11 months ago (3 children)

How is it inherently a security issue when an LLM speaks gibberish? Genuine question.

[–] froztbyte@awful.systems 9 points 11 months ago* (last edited 11 months ago) (1 children)

it "speaking gibberish" is not the problem. the answer to your question is literally in the third paragraph in the article.

if you do not comprehend what it references or implies, then (quite seriously) if you are in any way involved in any security shit get the fuck out. alternatively read up some history about, well, literally any actual technical detail of even lightly technical systems hacking. and that's about as much free advice as I'm gonna give you.

[+] 0laura@lemmy.world -6 points 11 months ago* (last edited 11 months ago) (2 children)

[removed by mod]

[–] self@awful.systems 17 points 11 months ago (1 children)

Genuine question.

So rude, you didn’t answer my question at all.

yeah find me one single instance of someone doing this “genuine question” shit that doesn’t result in the most bad faith interpretation possible of the answers they get

If I’m missing something obvious I’d love it if you told me.

most security vulnerabilities look like they cause the targeted program to spew gibberish, until they’re crafted into a more targeted attack
it’s likely that gibberish is the LLM’s training data, where companies are increasingly being encouraged to store sensitive data
there’s also a trivial resource exhaustion attack where you have one or more LLMs spew garbage until they’ve either exhausted their paid-for allocation of tokens or cost their hosting organization a relative fuckload of cash
either you knew all of the above already and just came here to be a shithead, or you’re the type of shithead who doesn’t know fuck about computer security but still likes to argue about it
fuck off

[–] froztbyte@awful.systems 9 points 11 months ago

the amount of times I've had to clean shit up after someone like this "didn't think $x would matter"...

[–] froztbyte@awful.systems 14 points 11 months ago (1 children)

so you start by claiming that you don't think there's any problematic security potential, follow it up by clarifying that you actually have no fucking understanding of how any of it could work and might matter, and then you get annoyed at the response? so rude, indeed!

[–] V0ldek@awful.systems 8 points 11 months ago

User input doing unexpected stuff to the backend = Bad™

[–] kbal@fedia.io 2 points 11 months ago* (last edited 11 months ago) (1 children)

It's a reasonable question, and the answer is perhaps beyond my ken even though I've had substantial experience with both building machine learning models (mostly in pre-LLM times) and keeping computer systems secure. That a chatbot might tell someone “how to make a bomb” is probably not a great example of the dangers they pose. Bomb making instructions are more or less available to everyone who can find chemistry textbooks. The greater dangers that the LLM owners are trying to guard against might instead be more like having one advising someone that they should make a bomb. That sort of thing could be hazardous to the financial security of the vendor as well as the health of its users.

Finding an input that will make the machine produce gibberish is not directly equivalent to the kind of misbehaviour that often indicates exploitable bugs in software that "crashes" in more conventional ways. But it may be loosely analagous to it, in that it's an observation of unintended behaviour which might reveal flaws that would otherwise remain hidden, giving attackers something to work with.

[–] froztbyte@awful.systems 9 points 11 months ago* (last edited 11 months ago) (3 children)

so there's 3 immediately-suggestive paths that come to mind from this

the first is that gibbering prompts itself already means you've hit a boundary in the design of its execution space (or fucking around in the very edges of training data where its precision gets low), and that could mean you are beyond what the programmers thought of/handled. whether or not you can get reliable further behaviours in that mode/space will be extremely contingent on a lot of factors (model type, execution type, runtime, ...), but given how extremely rapidly and harshly oai (and friends) reacted to simple behavioural breaks I get the impression that they're more concerned with such cases than they might be letting on

the second fairly obvious vector is where everyone is trying to shove LLMs into everything without good safety boundaries. oh that handy chatbot on your doctor/airline/insurance/.... site that's pitched as "it can use your identification details and look up $x"[0], that means that system has access to places where to look up private data. so if you could break a boundary via whatever method, who's to say it can't go further. it's not like telling the prompt "do $x and only $x" will work, as many examples have shown

third path, and sort-of the one that ties the bow on the second a bit, is that most of these dipshits probably don't have proper isolation controls, just because it's hard and effortful. building actual multitenancy with strong inter-tenant separation is a lot of work. that's something that's just not done in bayfucker world unless it is specifically needed. so the more these things get shoved into various products and this segmentation work is not done thoroughly, the more likely that sort of shit becomes

[0] - couple years back (pre-llm) I worked on exactly this problem with a client. it's fantastically annoying to design, not half because humans are such wonderfully unpredictable input sources

[–] kbal@fedia.io 8 points 11 months ago

Yeah, no doubt they will push to give the things built atop the shaky foundation of LLMs as much responsibility and access to credentials as they think they can get away with. Making the models trustworthy for such purposes has been the goal since DeepMind set off in that direction with such optimism. There are a lot of people eager to get there, and a lot of other people eager to give us the impression right now that they will get there soon. That in itself is one more reason they react with some alarm when the products are easily provoked into producing garbage.

I'm sure it will go wrong in many interesting ways. Seems to me there are risks they haven't begun to think about. There's a lot of focus on preventing the models producing output that's obviously morally offensive, very little thought given to the idea that output entirely within the bounds of what is thought acceptable might end up accidentally calibrated to reinforce and perpetuate the existing prejudices and misconceptions the machines have learned from us.

[–] barsquid@lemmy.world 6 points 11 months ago

Why would they bother with safety boundaries for AI? Companies leak millions of records of PII all the time and there are zero real consequences. Of course we will start seeing access level bypass exploits leaking customer data.

[–] Tar_alcaran@sh.itjust.works 4 points 11 months ago

couple years back (pre-llm) I worked on exactly this problem with a client. it's fantastically annoying to design, not half because humans are such wonderfully unpredictable input sources

Oh don't worry, humans are amazingly unpredictable interfaces too, which is why social engineering works so well.