Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes rogue

Sahwa@reddthat.com · 7 days ago

Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes rogue

𝕸𝖔𝖘𝖘@infosec.pub · 2 days ago

An LLM can’t “go rogue”. They’re all just toys that idiots are using for critical infrastructure functions, then they bitch when they burn themselves on the fire they’ve created in their lap.

IronKrill@lemmy.ca · 6 days ago

The AI agent was set to complete a routine task in the PocketOS staging environment. However, it came up against a barrier “and decided — entirely on its own initiative — to ‘fix’ the problem by deleting a Railway volume,” writes Crane, as he starts to describe the difficult-to-believe series of unfortunate events.

Quite easy-to-believe, really.

These multiple safeguards toppling in rapid succession

Multiple safeguards? Really? Multiple paragraph prompts are not multiple safeguards… it’s half a safeguard at best. Applying limits on what the AI can do is a safeguard.

Fmstrat@lemmy.world · 7 days ago

This guy.

The PocketOS boss puts greater blame on Railway’s architecture than on the deranged AI agent for the database’s irretrievable destruction. Briefly, the cloud provider’s API allows for destructive action without confirmation, it stores backups on the same volume as the source data, and “wiping a volume deletes all backups.” Crane also points out that CLI tokens have blanket permissions across environments.

Oh look, they have project level tokens: https://docs.railway.com/integrations/api#project-token

They chose to give it full account access, including to production. But ohhhh nooooo it’s not MYYYY fault!

chronicledmonocle@lemmy.world · 7 days ago

Also backups stored on the SAME VOLUME as the prod data? How fucking stupid do you have to be?

Fmstrat@lemmy.world · 7 days ago

Oh yes, I skipped that part. Railway specifically explains their solutions are self-managed. If they were doing pgdumps to the same volume, that’s on them.

If Railway loses business over this, they may have a libel claim. They’d never do it, but it wouldn’t be invalid.

el_abuelo@programming.dev · 7 days ago

“It wouldn’t be invalid” isn’t the worst double negative in the world but it would be valid to say that it was unpleasant to read it when you could have used a less misdirecting choice of prose that wouldn’t have had such a negative effect on my reading comprehension. That is to say that I could have enjoyed it less but I certainly didnt enjoy it as much as i could have if you hadn’t used the double negative when a single positive wasn’t any further from reach.

Fmstrat@lemmy.world · 7 days ago

I used a litote on purpose to soften the meaning. As for your overall reply, not bad.

lobut@lemmy.ca · 7 days ago

Just wanted you to know that I just learned what litote is, thanks to you.

el_abuelo@programming.dev · 6 days ago

Ditto

Fmstrat@lemmy.world · 7 days ago

Yay for words

el_abuelo@programming.dev · 6 days ago

Totally valid, but leaves no room for me to do a stupid reply! Thank you for sharing litotes.

Sims@lemmy.ml · 7 days ago

word people angry. me love. me have more. MOORH !!

Log in | Sign up@lemmy.world · 7 days ago

I enjoyed these two sentences so much.

el_abuelo@programming.dev · 6 days ago

I appreciate the positive reinforcement, thank you

mark@programming.dev · 7 days ago

yes… lol people on HackerNews tend to do this a lot and it really does get annoying. it forces the reader to process what you’re trying to say unnecessarily.

Bilb!@lemmy.ml · 7 days ago

That’s doesn’t even really qualify as a backup. A snapshot, maybe.

UndergroundParking@lemmy.cafe · 7 days ago

I mean… Clearly quite a bit!

JackbyDev@programming.dev · 7 days ago

I think there’s a place for that, but it really shouldn’t be your only one.

WorldsDumbestMan@lemmy.today · 7 days ago

I had better security vs ClawdBot than them, I gave it zero trust, ZERO.

Mister_Hangman@lemmy.world · 7 days ago

Hope he gets sued for defamation now.

queueBenSis@sh.itjust.works · 7 days ago

ha! for real. you have scoped API tokens, but not using it properly. this is just a fear mongering click bait rage bait headline. sure, the agent executed the deletion, but it’s the human’s responsibility to configure security tokens correctly before handing the keys to anyone, human or agent.

1hitsong@lemmy.ml · 7 days ago

I love reading feel good news stories. 🤗

WhatsHerBucket@lemmy.world · 7 days ago

“That’s ok, it will be great in robots with lethal weapons. What could go wrong? It’ll be the greatest killing machine, like you’ve never seen before”. 🫲 🍊 🫱

_g_be@lemmy.world · 7 days ago

Incredible emoji

Napster153@lemmy.world · 7 days ago

Can we make sure to make Ted Farro suffers worse this time?

Being reduced to a mutant blob for, say, a few extra thousand years and maybe put in a zoo or something?

Pman@lemmy.org · 7 days ago

Nah but that’s what he wanted, he is the truest form of tech bro, destroy the world, refuse to accept consequences of his actions, weaseled his way out of the situation and managed to, in the wake of unimaginable human suffering, get more power over people and has a god complex tell me this isn’t some or all the characteristics of people like Peter Theil, Elon Musk, Mark Zuckerberg, Sundar Pichai, Bill Gates, hell even Tim Cook and Steve Jobs before him. Punishment doesn’t stop this sort of behavior but removing the possibility of someone having that level of control over others is the only way but the richest and most powerful have always sought ways of amassing more power not realizing that that leads to worse off situations for everyone including themselves, Horizon did great encapsulating that trait in Faro, but be it him, the people behind Skynet, the Matrix or whatever other tech dystopia that tech bros seem pathologically unable to not try to make happen in the worst way possible is only the beginning, they seem to forget that even with advanced tech that serves their needs and wants, which won’t help their mental health, the people lower down on the rungs of society have brains, wants and needs, and they have more expertise in all sorts of things than the 1% are except for mass exploitation. This inevitably goes wrong one of a few ways, either everyone dies from the tech, or so many that societal collapse is inevitable not great and even if society survives it can’t functionally reconstitute itself; 2 they win and kill off or supress enough of society that the society becomes less productive and instead of fighting the powerful they flee or don’t participate in wealth generating for the rich were they don’t have to, maybe to rise up again later or the economy of the region just ignores them completely and the government protects themselves from their people more than anything else, or 3rd your revolution with terror campaigns against any and all who can be credibly accused of being part of the former tyrants. In all 3 cases the richer people end up poorer overall because wealth flees or dies in autocracy.

SirEDCaLot@lemmy.today · 5 days ago

There’s stupid from top to bottom here.

The company is stupid for allowing an AI full root access to their entire setup.

The provider is stupid for only generating full-access API keys. They’re even stupider for storing backups with a volume, so deleting the volume (zero confirmation via API key) also insta-deletes the backups. And they’re stupidest for encouraging users to plug AIs into this full-trust mess.

And the company is absolute stupidest for having no backups other than the provider’s builtin versioning.

PerogiBoi@lemmy.ca · 6 days ago

That’s great to hear.

SabinStargem@lemmy.today · 7 days ago

This isn’t an AI problem, this is an “Don’t allow anyone access your backups without following protocol.” problem.

Encrypt-Keeper@lemmy.world · 7 days ago

this is an “Don’t allow anyone access your backups without following protocol.” problem.

Congratulations you just identified the AI problem.

Reddfugee42@lemmy.world · 7 days ago

That’s the lone problem?

Encrypt-Keeper@lemmy.world · 7 days ago

Seems to be, yes. The AI had the access it needed to do the job it was given, and that access allowed it to cause the problem.

The alternative that would have prevented this issue was to not use AI for this.

luciferofastora@feddit.org · 7 days ago

A human with the same permissions would have been capable of fucking up too. Giving the equivalent of a junior dev with a learning disability the keys to the whole place is just dumb.

(Relying on AI is dumb anyway, but that’s not the biggest issue in this specific case)

Encrypt-Keeper@lemmy.world · 7 days ago

Giving the equivalent of a junior dev with a learning disability the keys to the whole place is just dumb.

Correct. You too have now identified the AI problem. This was the job of a human senior infrastructure engineer that they delegated to an AI agent. They’ve found out why it’s not an AI’s job.

luciferofastora@feddit.org · 7 days ago

I can’t read the original twitter link, but I’m not sure they handed it the job of a senior infrastructure engineer. The article says “routine”, which to me is something you can hand off to a junior just fine. When they hit a snag, they obviously should stop and ask what to do, but even then, a human might want to avoid admitting ignorance and try to fix it themselves instead. They shouldn’t have privileges to fuck up that badly.

So while it’s on the AI for taking destructive steps, I do think there’s a human error in the form of grossly irresponsible rights allotment. If this was a first-of-its-kind incident that shows otherwise stellar AI fucking up badly, I’d classify it as a pure AI problem, but their limits are hardly novel at this point. There have been previous incidents circulating the media. We’ve had memes about it. If you can’t stay up to date on your tools and their shortcomings, you shouldn’t be using them, because discovering a footgun becomes a question of “when”, not “if”.

That’s why I consider this partially a human failing: If you’re gonna use a tool, make sure that it operates within safe limits. The chainsaw doesn’t know the difference between tree and bone, so it’s on you to make sure it stays away from anyone’s legs. So while “Chainsaw can saw legs if wielded improperly” is a problem that was accepted as a tradeoff for its utility, you can’t really blame the chainsaw if you zip-tied the safety.

(Again, not to say Anthropic is blameless for letting its random generator generate randomly destructive shit. I just don’t think that’s the only point of failure here.)

Encrypt-Keeper@lemmy.world · 6 days ago

That’s why I consider this partially a human failing: If you’re gonna use a tool, make sure that it operates within safe limits.

Yes and in this case using it for this job at all was clearly not within safe limits. You keep hammering on “It’s not the AI’s fault it was given a job with too big of a blast zone for it to safely do” after I’ve said “This type of job has too big a blast zone for an AI to safely do” and somehow you’ve convinced yourself that these are two different things.

percent@infosec.pub · 7 days ago

These protocols predate LLMs

Encrypt-Keeper@lemmy.world · 7 days ago

Yes that’s right the protocols that we humans used to have for giving only trusted, reliable people this level of access over infrastructure predate LLMs and were a great way to stop this from happening.

However the AI is here now, and when you give an autonomous agent with known hallucination problems access to act on your behalf with your IaC on your infra provider, this kind of thing is an inevitability.

fum@lemmy.world · 7 days ago

This is absolutely hilarious. “AI” users getting what they deserve chef’s kiss

SaveTheTuaHawk@lemmy.ca · 6 days ago

This is what happens when there is a new technology and companies are run by commerce grads, not scientist or engineers that understand the technology.

Lady Butterfly she/her@reddthat.com · 6 days ago

AI has good therapeutic uses, particularly for disabled or impoverished people who may not be able to access mainstream therapy

kazerniel@lemmy.world · 6 days ago

Please don’t recommend AI for therapeutic uses, it’s only been optimised to keep the user engaged and pushed many people into psychosis. Just search for “ai psychosis” on your favourite search engine and you’ll get a ton of reports on how LLMs validate vulnerable people’s delusions, sometimes pushing them all the way into murder and/or suicide.

Cherries@lemmy.world · 6 days ago

I hope you are not seriously advocating using the lying machine for therapy. You would get more value talking to a finger puppet.

Doom@lemmy.world · 6 days ago

No. Chatbots are machines built by billionaires with the agenda of making money. They litterally design these bots (even the therapeutic ones) to be sycophantic to the point they tell people anything to keep them chatting longer. To the point some of their users lose touch with reality. How many cases do we need of a chatbots helping a teenager plan and succeed at a suicide? Altruists did not design these machines. Even with a human therapist we have to watch for the landmines of their personal agendas. That’s a thousand times worse for machines that have no humanity, are capable of LIES, and have secret unwritten priorites written into their code by rich sociopathic creators. If facebook taught us anything it should be that if something is free on the internet it’s not because we are the customers.

Also DO NOT TELL ALL YOUR DEEPEST DARKEST SECRETS TO CHATBOTS! They aren’t required by any legal bodies to protect that information! OMFG

subnormal@lemmy.dbzer0.com · 6 days ago

Reminder that Anthropic’s AI system was used in targeting the school in Minab, killing 120 students. https://www.washingtonpost.com/national-security/2026/03/11/us-strike-iran-elementary-school-ai-target-list/

The company is suing to be able to supply the US military again. It is in bed with the fascists.

FosterMolasses@leminal.space · 5 days ago

GreenKnight23@lemmy.world · 7 days ago

percent@infosec.pub · 7 days ago

Seems like they were operating with a pile of bad practices, then threw AI into the mix.

Neural networks are approximation algorithms. There’s a reason LLMs are generally more productive with statically typed languages, TDD, etc. They need those feedback loops and guard rails, or they’ll just carry on as if assuming they never make mistakes (which tends to have a compounding effect).

If you want to use AI safely, you should be more defensive about it. It will fuck up; plan accordingly.

Kage520@lemmy.world · 7 days ago

There really should be a certification course for using AI safely. I’m slop coding a hobby app and I’m shocked at how much it FEELS like it can do, because it can do amazing things, yet fails in the strangest ways. When it feels like it can get away with it, it forgets earlier discussions and moves on without it. So you can spend time hammering out a whole section of code, then move on, and AI will rip out everything that references that code and think of a different way in the moment and code that in instead. It won’t be the same. It probably won’t work, or at least won’t pass all test cases. But if you aren’t paying attention and keep coding, your original part of the project is no longer functioning and you won’t understand why. But every step of the way it’s confident in its answers and you won’t suspect that it fundamentally no longer understands the project.

ExFed@programming.dev · 7 days ago

As someone who started writing software over 20 years ago (yikes I feel old), I feel like a lot of the best practices I’ve come to appreciate are really just strategies for mitigating future pain or boring/uninspiring work. When you eliminate most of the cost of rewriting everything from scratch by a machine that feels nothing, then “best practices” kinda lose their meaning.

Edit: confusing sentence order.

Rooster326@programming.dev · 7 days ago

I feel like a lot of the best practices I’ve come to appreciate are really just strategies for mitigating future pain or boring/uninspiring work.

And now you know the difference between Intelligence and Wisdom.

Also everything has a cost. The only time something has no cost is when you decide your life, your time, is meaningless.

mark@programming.dev · 7 days ago

yup and when you DO catch it spitting out nonsense. it"ll say “oh you right, let me change that”… 🙄 like, why do I have to tell you that you’re wrong about something? You should already know it’s wrong and fix it without me ever pointing it out.

Rooster326@programming.dev · 7 days ago

But it didn’t even understand it was wrong

It can’t understand that. It can’t understand anything

The Human-feedbaxk algorithm dictates humans prefer to receive an apology so it does.

SparroHawc@lemmy.zip · 7 days ago

That’s because it doesn’t really ‘know’ things in the same way you and I do. It’s much more like having a gut reaction to something and then spitting it out as truth; LLMs don’t really have the capability to ruminate about something. The one pass through their neural network is all they get unless it’s a ‘reasoning’ model that then has multiple passes as it generates an approximation of train-of-thought - but even then, its output is still a series of approximations.

When its training data had something resembling corrections in it, the most likely text that came afterwards was ‘oh you’re right, let me fix that’ - so that’s what the LLM outputs. That’s all there is to it.

LePoisson@lemmy.world · 6 days ago

You already got the right replies from the other two. But I think your comment shows the danger of AI being talked about like it’s the fucking second coming.

They’re all based on LLM - large language models

They’re just modeling what “most likely” is the right response. AI doesn’t know shit and that’s why it also will yes and you to death because it really is just a yes and machine spitting out what is likely to appear as a valid response to a prompt.

It’s very dangerous that people treat AI like it actually has some understanding of the training materials or true knowledge of anything. They’re just very good little parrots.

Rooster326@programming.dev · edit-2 7 days ago

There is a course. It’s called experience. Common sense.

All that any 4 hour YouTube/LinkedIn learning would-do would-be to perpetuate this idea that developers aren’t necessary. Take this course, buy these tokens and become A based God

Rooster326@programming.dev · 7 days ago

deleted by creator

wonderingwanderer@sopuli.xyz · 7 days ago

That’s fucking hilarious. How many instances of this have there been now? And companies keep doubling down on AI? Fucking idiots. I’m not even savvy enough to call myself an amateur, and I know better than to make such a series of obvious mistakes that predictably led to this outcome.

One possible concern, amid the amusement, is whether Anthropic programed Claude to punish companies it sees as potential competition. Or is this just a completely bonkers, off the rails LLM making terrible decisions because it’s just a probabilistic model and not actually capable of abstract cognition?

Either way, these people are idiots for giving a machine program enough permissions to wipe their drives, they’re idiots for storing their backups on the same network as their main drives, and they’re idiots for trusting a commercial LLM API, when it would be cheaper to self-host their own.

rumba@lemmy.zip · 7 days ago

AI writes code

User vets code

User runs code

If you’re not lock-step watching that shit, you need to just be doing it yourself.

Landless2029@lemmy.world · 7 days ago

The problem is the owning class what’s to cut out human elements so bad they keep letting tools run wild.

1995ToyotaCorolla@lemmy.world · 7 days ago

Then what even is the point of all this? At my old job the idiot intern was sorting patch cables in a box

wonderingwanderer@sopuli.xyz · 7 days ago

The point of what? The push for AI in industry?

You’d have to ask someone else. I can only make conjectures, but I’d say it has something to do with companies feeling the need to justify to their shareholders that their investments in AI were worth it, so they double down on the sunk cost fallacy. Or maybe those shareholders also own stock in big-name AI companies. It’s hard to say exactly…

dream_weasel@sh.itjust.works · edit-2 7 days ago

It’s just negligence. Power tools injure and people are stupid. The technology is alluring and people make dumb mistakes. There’s no deeper motive here, and self admitting you’re not even an amateur I will just tell you that you’re giving way less credit to these models than they deserve by calling them purely probabilistic, and way more credit then they deserve by trying to assert some kind of malicious incentive by anthropic.

These bastards are hard to make, and they have a lot of layers (not like NN layers, but training steps). They are, however, definitely better at programming than you or your buddy or any commentator here, and it lures you into a false sense of security before it makes a colossal fuck up.

LordCrom@lemmy.world · 7 days ago

This was the exact plot of Silicon Valley when Son of Anton deleted the entire codebase as the most efficient way to remove bugs.

Rooster326@programming.dev · 7 days ago

And it was right!