• 𝕸𝖔𝖘𝖘@infosec.pub
    link
    fedilink
    English
    arrow-up
    2
    ·
    2 days ago

    An LLM can’t “go rogue”. They’re all just toys that idiots are using for critical infrastructure functions, then they bitch when they burn themselves on the fire they’ve created in their lap.

  • IronKrill@lemmy.ca
    link
    fedilink
    English
    arrow-up
    52
    ·
    6 days ago

    The AI agent was set to complete a routine task in the PocketOS staging environment. However, it came up against a barrier “and decided — entirely on its own initiative — to ‘fix’ the problem by deleting a Railway volume,” writes Crane, as he starts to describe the difficult-to-believe series of unfortunate events.

    Quite easy-to-believe, really.

    These multiple safeguards toppling in rapid succession

    Multiple safeguards? Really? Multiple paragraph prompts are not multiple safeguards… it’s half a safeguard at best. Applying limits on what the AI can do is a safeguard.

  • Fmstrat@lemmy.world
    link
    fedilink
    English
    arrow-up
    91
    ·
    7 days ago

    This guy.

    The PocketOS boss puts greater blame on Railway’s architecture than on the deranged AI agent for the database’s irretrievable destruction. Briefly, the cloud provider’s API allows for destructive action without confirmation, it stores backups on the same volume as the source data, and “wiping a volume deletes all backups.” Crane also points out that CLI tokens have blanket permissions across environments.

    Oh look, they have project level tokens: https://docs.railway.com/integrations/api#project-token

    They chose to give it full account access, including to production. But ohhhh nooooo it’s not MYYYY fault!

      • Fmstrat@lemmy.world
        link
        fedilink
        English
        arrow-up
        24
        ·
        7 days ago

        Oh yes, I skipped that part. Railway specifically explains their solutions are self-managed. If they were doing pgdumps to the same volume, that’s on them.

        If Railway loses business over this, they may have a libel claim. They’d never do it, but it wouldn’t be invalid.

        • el_abuelo@programming.dev
          link
          fedilink
          English
          arrow-up
          9
          arrow-down
          2
          ·
          7 days ago

          “It wouldn’t be invalid” isn’t the worst double negative in the world but it would be valid to say that it was unpleasant to read it when you could have used a less misdirecting choice of prose that wouldn’t have had such a negative effect on my reading comprehension. That is to say that I could have enjoyed it less but I certainly didnt enjoy it as much as i could have if you hadn’t used the double negative when a single positive wasn’t any further from reach.

      • Bilb!@lemmy.ml
        link
        fedilink
        English
        arrow-up
        8
        ·
        7 days ago

        That’s doesn’t even really qualify as a backup. A snapshot, maybe.

    • queueBenSis@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      ·
      7 days ago

      ha! for real. you have scoped API tokens, but not using it properly. this is just a fear mongering click bait rage bait headline. sure, the agent executed the deletion, but it’s the human’s responsibility to configure security tokens correctly before handing the keys to anyone, human or agent.

  • WhatsHerBucket@lemmy.world
    link
    fedilink
    English
    arrow-up
    70
    arrow-down
    2
    ·
    7 days ago

    “That’s ok, it will be great in robots with lethal weapons. What could go wrong? It’ll be the greatest killing machine, like you’ve never seen before”. 🫲 🍊 🫱

    • Napster153@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      7 days ago

      Can we make sure to make Ted Farro suffers worse this time?

      Being reduced to a mutant blob for, say, a few extra thousand years and maybe put in a zoo or something?

      • Pman@lemmy.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        7 days ago

        Nah but that’s what he wanted, he is the truest form of tech bro, destroy the world, refuse to accept consequences of his actions, weaseled his way out of the situation and managed to, in the wake of unimaginable human suffering, get more power over people and has a god complex tell me this isn’t some or all the characteristics of people like Peter Theil, Elon Musk, Mark Zuckerberg, Sundar Pichai, Bill Gates, hell even Tim Cook and Steve Jobs before him. Punishment doesn’t stop this sort of behavior but removing the possibility of someone having that level of control over others is the only way but the richest and most powerful have always sought ways of amassing more power not realizing that that leads to worse off situations for everyone including themselves, Horizon did great encapsulating that trait in Faro, but be it him, the people behind Skynet, the Matrix or whatever other tech dystopia that tech bros seem pathologically unable to not try to make happen in the worst way possible is only the beginning, they seem to forget that even with advanced tech that serves their needs and wants, which won’t help their mental health, the people lower down on the rungs of society have brains, wants and needs, and they have more expertise in all sorts of things than the 1% are except for mass exploitation. This inevitably goes wrong one of a few ways, either everyone dies from the tech, or so many that societal collapse is inevitable not great and even if society survives it can’t functionally reconstitute itself; 2 they win and kill off or supress enough of society that the society becomes less productive and instead of fighting the powerful they flee or don’t participate in wealth generating for the rich were they don’t have to, maybe to rise up again later or the economy of the region just ignores them completely and the government protects themselves from their people more than anything else, or 3rd your revolution with terror campaigns against any and all who can be credibly accused of being part of the former tyrants. In all 3 cases the richer people end up poorer overall because wealth flees or dies in autocracy.

  • SirEDCaLot@lemmy.today
    link
    fedilink
    English
    arrow-up
    12
    ·
    5 days ago

    There’s stupid from top to bottom here.

    The company is stupid for allowing an AI full root access to their entire setup.

    The provider is stupid for only generating full-access API keys. They’re even stupider for storing backups with a volume, so deleting the volume (zero confirmation via API key) also insta-deletes the backups. And they’re stupidest for encouraging users to plug AIs into this full-trust mess.

    And the company is absolute stupidest for having no backups other than the provider’s builtin versioning.

  • SabinStargem@lemmy.today
    link
    fedilink
    English
    arrow-up
    74
    arrow-down
    1
    ·
    7 days ago

    This isn’t an AI problem, this is an “Don’t allow anyone access your backups without following protocol.” problem.

    • Encrypt-Keeper@lemmy.world
      link
      fedilink
      English
      arrow-up
      31
      arrow-down
      9
      ·
      7 days ago

      this is an “Don’t allow anyone access your backups without following protocol.” problem.

      Congratulations you just identified the AI problem.

        • Encrypt-Keeper@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          arrow-down
          3
          ·
          7 days ago

          Seems to be, yes. The AI had the access it needed to do the job it was given, and that access allowed it to cause the problem.

          The alternative that would have prevented this issue was to not use AI for this.

          • luciferofastora@feddit.org
            link
            fedilink
            English
            arrow-up
            5
            arrow-down
            1
            ·
            7 days ago

            A human with the same permissions would have been capable of fucking up too. Giving the equivalent of a junior dev with a learning disability the keys to the whole place is just dumb.

            (Relying on AI is dumb anyway, but that’s not the biggest issue in this specific case)

            • Encrypt-Keeper@lemmy.world
              link
              fedilink
              English
              arrow-up
              2
              ·
              7 days ago

              Giving the equivalent of a junior dev with a learning disability the keys to the whole place is just dumb.

              Correct. You too have now identified the AI problem. This was the job of a human senior infrastructure engineer that they delegated to an AI agent. They’ve found out why it’s not an AI’s job.

              • luciferofastora@feddit.org
                link
                fedilink
                English
                arrow-up
                2
                arrow-down
                1
                ·
                7 days ago

                I can’t read the original twitter link, but I’m not sure they handed it the job of a senior infrastructure engineer. The article says “routine”, which to me is something you can hand off to a junior just fine. When they hit a snag, they obviously should stop and ask what to do, but even then, a human might want to avoid admitting ignorance and try to fix it themselves instead. They shouldn’t have privileges to fuck up that badly.

                So while it’s on the AI for taking destructive steps, I do think there’s a human error in the form of grossly irresponsible rights allotment. If this was a first-of-its-kind incident that shows otherwise stellar AI fucking up badly, I’d classify it as a pure AI problem, but their limits are hardly novel at this point. There have been previous incidents circulating the media. We’ve had memes about it. If you can’t stay up to date on your tools and their shortcomings, you shouldn’t be using them, because discovering a footgun becomes a question of “when”, not “if”.

                That’s why I consider this partially a human failing: If you’re gonna use a tool, make sure that it operates within safe limits. The chainsaw doesn’t know the difference between tree and bone, so it’s on you to make sure it stays away from anyone’s legs. So while “Chainsaw can saw legs if wielded improperly” is a problem that was accepted as a tradeoff for its utility, you can’t really blame the chainsaw if you zip-tied the safety.

                (Again, not to say Anthropic is blameless for letting its random generator generate randomly destructive shit. I just don’t think that’s the only point of failure here.)

                • Encrypt-Keeper@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  6 days ago

                  That’s why I consider this partially a human failing: If you’re gonna use a tool, make sure that it operates within safe limits.

                  Yes and in this case using it for this job at all was clearly not within safe limits. You keep hammering on “It’s not the AI’s fault it was given a job with too big of a blast zone for it to safely do” after I’ve said “This type of job has too big a blast zone for an AI to safely do” and somehow you’ve convinced yourself that these are two different things.

        • Encrypt-Keeper@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          7 days ago

          Yes that’s right the protocols that we humans used to have for giving only trusted, reliable people this level of access over infrastructure predate LLMs and were a great way to stop this from happening.

          However the AI is here now, and when you give an autonomous agent with known hallucination problems access to act on your behalf with your IaC on your infra provider, this kind of thing is an inevitability.

  • fum@lemmy.world
    link
    fedilink
    English
    arrow-up
    47
    arrow-down
    4
    ·
    7 days ago

    This is absolutely hilarious. “AI” users getting what they deserve chef’s kiss

    • SaveTheTuaHawk@lemmy.ca
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      1
      ·
      6 days ago

      This is what happens when there is a new technology and companies are run by commerce grads, not scientist or engineers that understand the technology.

      • kazerniel@lemmy.world
        link
        fedilink
        English
        arrow-up
        20
        arrow-down
        1
        ·
        6 days ago

        Please don’t recommend AI for therapeutic uses, it’s only been optimised to keep the user engaged and pushed many people into psychosis. Just search for “ai psychosis” on your favourite search engine and you’ll get a ton of reports on how LLMs validate vulnerable people’s delusions, sometimes pushing them all the way into murder and/or suicide.

      • Cherries@lemmy.world
        link
        fedilink
        English
        arrow-up
        16
        ·
        6 days ago

        I hope you are not seriously advocating using the lying machine for therapy. You would get more value talking to a finger puppet.

      • Doom@lemmy.world
        link
        fedilink
        English
        arrow-up
        13
        ·
        6 days ago

        No. Chatbots are machines built by billionaires with the agenda of making money. They litterally design these bots (even the therapeutic ones) to be sycophantic to the point they tell people anything to keep them chatting longer. To the point some of their users lose touch with reality. How many cases do we need of a chatbots helping a teenager plan and succeed at a suicide? Altruists did not design these machines. Even with a human therapist we have to watch for the landmines of their personal agendas. That’s a thousand times worse for machines that have no humanity, are capable of LIES, and have secret unwritten priorites written into their code by rich sociopathic creators. If facebook taught us anything it should be that if something is free on the internet it’s not because we are the customers.

        Also DO NOT TELL ALL YOUR DEEPEST DARKEST SECRETS TO CHATBOTS! They aren’t required by any legal bodies to protect that information! OMFG

  • percent@infosec.pub
    link
    fedilink
    English
    arrow-up
    42
    arrow-down
    1
    ·
    7 days ago

    Seems like they were operating with a pile of bad practices, then threw AI into the mix.

    Neural networks are approximation algorithms. There’s a reason LLMs are generally more productive with statically typed languages, TDD, etc. They need those feedback loops and guard rails, or they’ll just carry on as if assuming they never make mistakes (which tends to have a compounding effect).

    If you want to use AI safely, you should be more defensive about it. It will fuck up; plan accordingly.

    • Kage520@lemmy.world
      link
      fedilink
      English
      arrow-up
      17
      ·
      7 days ago

      There really should be a certification course for using AI safely. I’m slop coding a hobby app and I’m shocked at how much it FEELS like it can do, because it can do amazing things, yet fails in the strangest ways. When it feels like it can get away with it, it forgets earlier discussions and moves on without it. So you can spend time hammering out a whole section of code, then move on, and AI will rip out everything that references that code and think of a different way in the moment and code that in instead. It won’t be the same. It probably won’t work, or at least won’t pass all test cases. But if you aren’t paying attention and keep coding, your original part of the project is no longer functioning and you won’t understand why. But every step of the way it’s confident in its answers and you won’t suspect that it fundamentally no longer understands the project.

      • ExFed@programming.dev
        link
        fedilink
        English
        arrow-up
        8
        ·
        7 days ago

        As someone who started writing software over 20 years ago (yikes I feel old), I feel like a lot of the best practices I’ve come to appreciate are really just strategies for mitigating future pain or boring/uninspiring work. When you eliminate most of the cost of rewriting everything from scratch by a machine that feels nothing, then “best practices” kinda lose their meaning.

        Edit: confusing sentence order.

        • Rooster326@programming.dev
          link
          fedilink
          English
          arrow-up
          3
          ·
          7 days ago

          I feel like a lot of the best practices I’ve come to appreciate are really just strategies for mitigating future pain or boring/uninspiring work.

          And now you know the difference between Intelligence and Wisdom.

          Also everything has a cost. The only time something has no cost is when you decide your life, your time, is meaningless.

      • mark@programming.dev
        link
        fedilink
        English
        arrow-up
        5
        ·
        7 days ago

        yup and when you DO catch it spitting out nonsense. it"ll say “oh you right, let me change that”… 🙄 like, why do I have to tell you that you’re wrong about something? You should already know it’s wrong and fix it without me ever pointing it out.

        • Rooster326@programming.dev
          link
          fedilink
          English
          arrow-up
          17
          ·
          7 days ago

          But it didn’t even understand it was wrong

          It can’t understand that. It can’t understand anything

          The Human-feedbaxk algorithm dictates humans prefer to receive an apology so it does.

        • SparroHawc@lemmy.zip
          link
          fedilink
          English
          arrow-up
          12
          ·
          7 days ago

          That’s because it doesn’t really ‘know’ things in the same way you and I do. It’s much more like having a gut reaction to something and then spitting it out as truth; LLMs don’t really have the capability to ruminate about something. The one pass through their neural network is all they get unless it’s a ‘reasoning’ model that then has multiple passes as it generates an approximation of train-of-thought - but even then, its output is still a series of approximations.

          When its training data had something resembling corrections in it, the most likely text that came afterwards was ‘oh you’re right, let me fix that’ - so that’s what the LLM outputs. That’s all there is to it.

        • LePoisson@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          6 days ago

          You already got the right replies from the other two. But I think your comment shows the danger of AI being talked about like it’s the fucking second coming.

          They’re all based on LLM - large language models

          They’re just modeling what “most likely” is the right response. AI doesn’t know shit and that’s why it also will yes and you to death because it really is just a yes and machine spitting out what is likely to appear as a valid response to a prompt.

          It’s very dangerous that people treat AI like it actually has some understanding of the training materials or true knowledge of anything. They’re just very good little parrots.

      • Rooster326@programming.dev
        link
        fedilink
        English
        arrow-up
        4
        ·
        edit-2
        7 days ago

        There is a course. It’s called experience. Common sense.

        All that any 4 hour YouTube/LinkedIn learning would-do would-be to perpetuate this idea that developers aren’t necessary. Take this course, buy these tokens and become A based God

  • wonderingwanderer@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    49
    arrow-down
    2
    ·
    7 days ago

    That’s fucking hilarious. How many instances of this have there been now? And companies keep doubling down on AI? Fucking idiots. I’m not even savvy enough to call myself an amateur, and I know better than to make such a series of obvious mistakes that predictably led to this outcome.

    One possible concern, amid the amusement, is whether Anthropic programed Claude to punish companies it sees as potential competition. Or is this just a completely bonkers, off the rails LLM making terrible decisions because it’s just a probabilistic model and not actually capable of abstract cognition?

    Either way, these people are idiots for giving a machine program enough permissions to wipe their drives, they’re idiots for storing their backups on the same network as their main drives, and they’re idiots for trusting a commercial LLM API, when it would be cheaper to self-host their own.

    • rumba@lemmy.zip
      link
      fedilink
      English
      arrow-up
      9
      ·
      7 days ago

      AI writes code

      User vets code

      User runs code

      If you’re not lock-step watching that shit, you need to just be doing it yourself.

      • Landless2029@lemmy.world
        link
        fedilink
        English
        arrow-up
        7
        arrow-down
        1
        ·
        7 days ago

        The problem is the owning class what’s to cut out human elements so bad they keep letting tools run wild.

      • wonderingwanderer@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        7 days ago

        The point of what? The push for AI in industry?

        You’d have to ask someone else. I can only make conjectures, but I’d say it has something to do with companies feeling the need to justify to their shareholders that their investments in AI were worth it, so they double down on the sunk cost fallacy. Or maybe those shareholders also own stock in big-name AI companies. It’s hard to say exactly…

    • dream_weasel@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      7 days ago

      It’s just negligence. Power tools injure and people are stupid. The technology is alluring and people make dumb mistakes. There’s no deeper motive here, and self admitting you’re not even an amateur I will just tell you that you’re giving way less credit to these models than they deserve by calling them purely probabilistic, and way more credit then they deserve by trying to assert some kind of malicious incentive by anthropic.

      These bastards are hard to make, and they have a lot of layers (not like NN layers, but training steps). They are, however, definitely better at programming than you or your buddy or any commentator here, and it lures you into a false sense of security before it makes a colossal fuck up.

  • LordCrom@lemmy.world
    link
    fedilink
    English
    arrow-up
    39
    ·
    7 days ago

    This was the exact plot of Silicon Valley when Son of Anton deleted the entire codebase as the most efficient way to remove bugs.