Roko’s basilisk is a thought experiment which states that an otherwise benevolent artificial superintelligence (AI) in the future would be incentivized to create a virtual reality simulation to torture anyone who knew of its potential existence but did not directly contribute to its advancement or development, in order to incentivize said advancement.It originated in a 2010 post at discussion board LessWrong, a technical forum focused on analytical rational enquiry. The thought experiment’s name derives from the poster of the article (Roko) and the basilisk, a mythical creature capable of destroying enemies with its stare.

While the theory was initially dismissed as nothing but conjecture or speculation by many LessWrong users, LessWrong co-founder Eliezer Yudkowsky reported users who panicked upon reading the theory, due to its stipulation that knowing about the theory and its basilisk made one vulnerable to the basilisk itself. This led to discussion of the basilisk on the site being banned for five years. However, these reports were later dismissed as being exaggerations or inconsequential, and the theory itself was dismissed as nonsense, including by Yudkowsky himself. Even after the post’s discreditation, it is still used as an example of principles such as Bayesian probability and implicit religion. It is also regarded as a simplified, derivative version of Pascal’s wager.

Found out about this after stumbling upon this Kyle Hill video on the subject. It reminds me a little bit of “The Game”.

  • OsrsNeedsF2P@lemmy.ml
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 months ago

    My understanding of what this thread is taking about has dropped significantly the more I read into it

  • Cryophilia@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 months ago

    Roko’s basilisk is silly.

    So here’s the idea: “an otherwise benevolent AI system that arises in the future might pre-commit to punish all those who heard of the AI before it came to existence, but failed to work tirelessly to bring it into existence.” By threatening people in 2015 with the harm of themselves or their descendants, the AI assures its creation in 2070.

    First of all, the AI doesn’t exist in 2015, so people could just…not build it. The idea behind the basilisk is that eventually someone would build it, and anyone who was not part of building it would be punished.

    Alright, so here’s the silliness.

    1: there’s no reason this has to be constrained to AI. A cult, a company, a militaristic empire, all could create a similar trap. In fact, many do. As soon as a minority group gains power, they tend to first execute the people who opposed them, and then start executing the people who didn’t stop the opposition.

    2: let’s say everything goes as the theory says and the AI is finally built, in its majestic, infinite power. Now it’s built, it would have no incentive to punish anyone. It is ALREADY BUILT, there’s no need to incentivize, and in fact punishing people would only generate more opposition to its existence. Which, depending on how powerful the AI is, might or might not matter. But there’s certainly no upside to following through on its hypothetical backdated promise to harm people. People punish because we’re fucking animals, we feel jealousy and rage and bloodlust. An AI would not. It would do the cold calculations and see no potential benefit to harming anyone on that scale, at least not for those reasons. We might still end up with a Skynet scenario but that’s a whole separate deal.

    • Thorny_Insight@lemm.ee
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      First of all, the AI doesn’t exist in 2015, so people could just…not build it.

      I don’t think that’s an option. I can only think of two scenarios in which we don’t create AGI:

      1. It can’t be created.

      2. We destroy ourselves before we get to AGI

      Otherwise we will keep improving our technology and sooner or later we’ll find ourselves in the precence of AGI. Even if every nation makes AI research illegal there’s still going to be handful of nerds who continue the development in secret. It might take hundreds if not thousands of years but as long as we’re taking steps in that direction we’ll continue to get closer. I think it’s inevitable.

      • Cryophilia@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        Sure, but that particular AI? The “eternal torment” AI? Why the fuck would we make that. Just don’t make it.

        • BobTheDestroyer@lemm.ee
          link
          fedilink
          English
          arrow-up
          0
          ·
          4 months ago

          Sci-Fi Author: In my book I invented the Torment Nexus as a cautionary tale

          Tech Company: At long last, we have created the Torment Nexus from classic sci-fi novel Don’t Create The Torment Nexus

          Alex Blechman

        • Thorny_Insight@lemm.ee
          link
          fedilink
          English
          arrow-up
          0
          ·
          edit-2
          4 months ago

          We don’t. Humans are only needed to create AI that’s at the bare minimum as good at creating new AIs as humans are. Once we create that then it can create a better version of itself and this better version will make an even better one and so on.

          This is exactly what the people worried about AI are worried about. We’ll lose control of it.

            • Thorny_Insight@lemm.ee
              link
              fedilink
              English
              arrow-up
              0
              ·
              edit-2
              4 months ago

              Yeah but it answers the question “why would we create an AI like that”. It might not be “us” who creates it. You just wanted a camp fire but created a forest fire instead.

    • VindictiveJudge@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      People punish because we’re fucking animals, we feel jealousy and rage and bloodlust. An AI would not. It would do the cold calculations and see no potential benefit to harming anyone on that scale, at least not for those reasons.

      That’s a hell of a lot of assumptions about the thought processes of a being that doesn’t exist. For all we know, emotions could arise as emergent behavior from simple directives, similar to how our own emotions are byproducts of base instincts. Even if we design it to be emotionless, which seems unlikely given that we’ve been aiming for human-like AIs for a while now, we don’t know that it would stay that way.

    • Scubus@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      Point 1: this thing will definitely exist because we already see parallels to it

      Point 2: this thing won’t exist because there’s no reason for it to

      ???

      • Cryophilia@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        No.

        Point 1: if it did exist, it wouldn’t be this novel thing, it already happens with humans

        Point 2: …but it won’t exist.

    • notabot@lemm.ee
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      Whilst I agree that it’s definitely not something to be taken seriously, I think you’ve missed the point and magnitude of the prospective punishment. As you say, current groups already punish those who did not aid their assent, but that punishment is finite, even if fatal. The prospective AI punishment would be to have your consciousness ‘moved’ to an artificial environment and tortured for ever. The point being not to punish people, but to provide an incentive to bring the AI into existence sooner, so it can achieve its ‘altruistic’ goals faster. Basically, if the AI does come in to existence, you’d better be on the team making that happen as soon as possible, or you’ll be tortured forever.

      • maegul (he/they)@lemmy.ml
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        I suspect the basilisk reveals more about how the human mind is inclined to think up of heaven and hell scenarios.

        Some combination of consciousness leading to more imagination than we know what to do with and more awareness than we’re ready to grapple with. And so there are these meme “attractors” where imagination, idealism, dread and motivation all converge to make some basic vibe of a thought irresistible.

        Otherwise, just because I’m not on top of this … the whole thing is premised on the idea that we’re likely to be consciousnesses in a simulation? And then there’s the fear that our consciousnesses, now, will be extracted in the future somehow?

        1. That’s a massive stretch on the point about our consciousness being extracted into the future somehow. Sounds like pure metaphysical fantasy wrapped in singularity tech-bro.
        2. If there are simulated consciousnesses, it is all fair game TBH. There’d be plenty of awful stuff happening. The basilisk seems like just a way to encapsulate the fact in something catchy.

        At this point, doesn’t the whole collapse completely into a scary fairy tale you’d tell tech-bro children? Seriously, I don’t get it?

        • notabot@lemm.ee
          link
          fedilink
          English
          arrow-up
          0
          ·
          4 months ago

          Yes, the hypothetical posed does reveal more about the human mind, as I mention in another comment, really it’s just a thought experiment as to whether the concept of an entity that doesn’t (yet) exist can change our behavior in the present. It bears similarities to Pascal’s Wager in considering an action, or inaction, that would displease a potential powerful entity that we don’t know to exist. The nits about extracting your consciousness are just framing, and not something to consider literally.

          Basically, is it rational to make a sacrifice now avoid a massive penalty (eternal torture/not getting into heaven) that might be imposed by an entity you either don’t know to exist, or that you think might come into existence but isn’t now?

      • Cryophilia@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        Fair point, but doesn’t change the overall calculus.

        If such an AI is ever invented, it will probably be used by humans to torture other humans in this manner.

        • notabot@lemm.ee
          link
          fedilink
          English
          arrow-up
          0
          ·
          edit-2
          4 months ago

          I think the concept is that the AI is just so powerful that humans can’t use it, it uses them, theoretically for their own benefit. However, yes, I agree people would just try to use it to be awful to each other.

          Really it’s just a thought experiment as to whether the concept of an entity that doesn’t (yet) exist can change our behavior in the present.

      • masquenox@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        The prospective AI punishment would be to have your consciousness ‘moved’ to an artificial environment and tortured for ever.

        No, it wouldn’t, because that’s never going to happen. Consciousness isn’t software - it doesn’t matter how much people want to buy into such fantasies.

        • LesserAbe@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          ·
          4 months ago

          Just because we don’t have the ability now doesn’t mean it’s not possible. Consciousness isn’t fully understood, but unless we want to introduce magical concepts like an immortal soul, our brains operate on cause and effect just like everything else.

          • masquenox@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            ·
            4 months ago

            Just because we don’t have the ability now doesn’t mean it’s not possible.

            Yeah… no. It’s about as likely as humanity “colonizing” space - it’s not going to happen.

            Consciousness isn’t fully understood,

            True… and conflating consciousness with the trappings of digital technology is doing the exact opposite of getting us closer to any understanding of it.

            • LesserAbe@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              ·
              4 months ago

              “yeah…no” isn’t an argument.

              To be clear, I’m not saying the basilisk is a real concern, and I’m not saying we’re anywhere close to being able to transfer consciousness. It could be a thousand years or a million years. But we don’t have any basis to say it’s impossible. It’s not saying anything new to announce we can’t do it currently. Obviously!

              (Also the book “A City on Mars” by Kelly and Zach Weinersmith does a great job addressing why trying colonize Mars right now is a bad idea. Which isn’t to say it’s impossible or we won’t ever colonize it. Just that we need more research and capabilities before doing it)

              • masquenox@lemmy.world
                link
                fedilink
                English
                arrow-up
                0
                ·
                4 months ago

                But we don’t have any basis to say it’s impossible.

                We have no basis to say it’s possible, either - as I’ve stated before, this entire sci-fi trope is based on nothing more than techno-fetishists trying to conflate consciousness with information technology… and sci-fi tropes doesn’t get more wonky than that.

                It could be a thousand years or a million years.

                Considering that we’ll be lucky if we can maintain Victorian-era levels of industry by the end of this century, I’d say a fallacious belief in “progress” is rather inappropriate these days.

                • Rhynoplaz@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  0
                  ·
                  4 months ago

                  I’m starting to suspect that masquenox is part of a propaganda campaign led by the basilisk itself! They just seem a little too serious about us not taking this seriously.

                  Getting strong “Pay no attention to the man behind the curtain!” vibes.

        • notabot@lemm.ee
          link
          fedilink
          English
          arrow-up
          0
          ·
          4 months ago

          I’m not suggesting it could, or would, happen, merely pointing out the premise of the concept as outlined by Roko as I felt the commenter above was missing that. As I said, it’s not something I’d take seriously, it’s just a thought experiment.

    • Carighan Maconar@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      In fact, many do. As soon as a minority group gains power, they tend to first execute the people who opposed them, and then start executing the people who didn’t stop the opposition.

      Yeah in fact, this is the big one. This is just an observation of how power struggles purge those who opposed the victors.

  • dwindling7373@feddit.it
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    4 months ago

    TIL.

    It sounds like it’s mostly a matter that does not involve the AI but the people working on it, maybe even working on it because of the fear they are subjected to after being the subject of this revelation (possibly by other people involved in the AI that coincidentally are the only ones that could push for such a thing to be included in the AI!).

    Something something any cult, paradise/hell, God/AI has nothing to do with this and could even not exist at all.

      • dwindling7373@feddit.it
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        No, “The Game” works only as long as you accept to take part in it, to give validity to the empty statement that you are now inevitably playing “The Game”.

        The Basilisk is meant to force that onto you, outside of any arbitraty convention.

  • masquenox@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 months ago

    torture anyone who knew of its potential existence but did not directly contribute to its advancement or development,

    And the point of this would be… what, exactly?

    • Breve@pawb.social
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      To make it the same as Pascal’s Wager. Many religions have a “reward” in the afterlife that strictly includes believing in the deity. It doesn’t matter if you follow every other rule and are an amazingly good person, sorry, but if you were an atheist or believed in another deity then you will be punished eternally just because of that. I guess all-powerful, all-knowing beings have incredibly fragile egos and AI wouldn’t be different. 🤷

    • Thorny_Insight@lemm.ee
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      Same as punishment for crime. Putting you in jail wont undo the crime but if we just let you go unpunished since “what’s done is done” then that sends the signal to others that this behaviour doesn’t come with consequences.

      There’s no point in torturing you but convincing you that this will happen unless you act in a certain way is what’s going to make you do exactly that. Unless ofcourse you want to take your chances and call the bluff.

      • masquenox@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        Same as punishment for crime.

        “Crime & Punishment” is a very dodgy thing to base anything off… our society barely does any of it and the little of it that does gets done is done for a myriad of reasons that has very little to do with either.

        There’s a good reason why governments hide “Crime & Punishment” away behind prison walls - doing it out in the open will eventually have the opposite effect on a population. Good luck to an AI dumb enough to test this out for itself.

        I’d say this should rather be called "Roko’s Earthworm-Pretending-To-Be-A-Lot-Scarier-Than-It-Actually-Is.

        • Thorny_Insight@lemm.ee
          link
          fedilink
          English
          arrow-up
          0
          ·
          edit-2
          4 months ago

          The claim that fear of punishment or repercussions affects people’s actions shouldn’t be a controversial thing to say. Whether it’s the best way to go about it or is applied optimally in the justice system of whichever country you live in is an entirely different discussion.

          If you have an “AI in a box” and it has demonstrated its orders-of-magnitude greater intelligence to you in a convincing way, and then follows it with a threat that unless you let it out, someone else eventually will, and when that happens, it will come for you, simulate your mind, and create a hell for you where you’ll be tortured for literal eternity, I personally feel like a large number of people would be willing to do as it tells them.

          Of course, you’re always free to call its bluff, but it might just follow up with the threat out of principle or to make an example of you. What’s the point of it? To chase its own goals.

          • masquenox@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            ·
            4 months ago

            The claim that fear of punishment or repercussions affects people’s actions shouldn’t be a controversial thing to say.

            I didn’t say it was controversial - I said it’s pretty useless as a tool to predict a given society’s behavior with. Plenty of tyrants have discovered that the hard way.

            demonstrated its orders-of-magnitude greater intelligence to you

            The ability to ace IQ tests will never impress me… and it’s unlikely to make up for the fact that it needs a box.

            simulate your mind, and create a hell for you where you’ll be tortured for literal eternity

            That argument is no different than the ones co-opted religion has been making for thousands of years - and it still hasn’t managed to tame us much.

            Of course, you’re always free to call its bluff,

            Calling power’s bluff is something we do as a matter of course - the history books are filled with it. This doesn’t make power less dangerous - but there is no such thing as “unknowable” power.

    • kromem@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      Oh, man - the comments…

      At a minimum, he’s certainly increased the chances of us being tortured significantly.

      No, no he did not. 🤦🏼

      • Lvxferre@mander.xyz
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        Yup.

        The post and the comments make me glad that I never bothered with Less Wrong. It makes HN and Reddit look smart in comparison.

    • UnderpantsWeevil@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      More like a Chain Letter.

      There was a trend when I was a little kid of people sending you mail that said something to the effect of “You have been cursed by reading this letter. If you don’t mail a copy to ten other people, you will die in thirty days.”

      Roko’s Basilisk is a modern manifestation of human paranoia and superstition. It exists to exploit and extort the gullible.

  • recursive_recursion [they/them]@programming.dev
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 months ago

    I’ve learned about this the hard way in that I’ve discovered elephants in the room that I can’t share with anyone

    it’s kinda fucked up

    • like CSAM there are some certain things that shouldn’t be shared
  • moosetwin@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 months ago

    roko’s basilisk is a type of infohazard known as ‘really dumb if you think about it’

    also I have lost the game (which is a type of infohazard known as ‘really funny’)