cross-posted from: https://lemmy.world/post/11178564

Scientists Train AI to Be Evil, Find They Can’t Reverse It::How hard would it be to train an AI model to be secretly evil? As it turns out, according to Anthropic researchers, not very.

  • swlabr@awful.systems
    link
    fedilink
    English
    arrow-up
    11
    ·
    10 months ago

    my reference point for this kind of extension is the one that changes “social justice” and “sjw” with “skeleton” and “skeleton warrior.” For example:

    “sjws are taking over X” -> “skeleton warriors are taking over X”

    Actually now that I’m typing this I hope there’s a good one for “woke”.