• Anthropic’s new Claude 4 features an aspect that may be cause for concern.
  • The company’s latest safety report says the AI model attempted to “blackmail” developers.
  • It resorted to such tactics in a bid of self-preservation.
  • Plebcouncilman@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    3
    ·
    1 month ago

    They also reported this on The Verge I think but it was months ago when the study first came out.

    But look, a lizard is not a very smart animal by our standards, but it is a sentient being. So the tech being good, smart or useful does not preclude its sentience.

    • meeeeetch@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 month ago

      I think I must’ve missed that Verge article. I guess that dashes my “this is a creative writing exercise by somebody in Joburg” theory.

      But we know that lizards have self preservation instincts (which for the purpose of this conversation I’ll say is interchangable with sentience (it’s probably a good enough proxy at any rate). But we know this because we have lots of people who have observed lizard behavior, not because The Lizard Farm, Inc has hyped up how alive and ensouled their lizards arev in a bid to get ever more VC funding.

      Maybe I’m too pessimistic about this tech and my obsolete meat sack will get tossed to the time-traveling torture robot. But I think it’s more likely that we have a money grabbing hype train in the tradition of the Mechanical Turk or Theranos than it is that we have created a new lifeform by feeding every extant piece of writing that isn’t nailed down (and some that are) to the sand we’ve forced to do math.

      • Plebcouncilman@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        2
        ·
        1 month ago

        No I totally get it, and being honest I don’t really think it is sentient yet, I guess my real point is that it is getting real hard to tell, to the point that there might not be a practical difference between whether it is sentient or not.

        Great reference though