A shocking story was promoted on the “front page” or main feed of Elon Musk’s X on Thursday:

“Iran Strikes Tel Aviv with Heavy Missiles,” read the headline.

This would certainly be a worrying world news development. Earlier that week, Israel had conducted an airstrike on Iran’s embassy in Syria, killing two generals as well as other officers. Retaliation from Iran seemed like a plausible occurrence.

But, there was one major problem: Iran did not attack Israel. The headline was fake.

Even more concerning, the fake headline was apparently generated by X’s own official AI chatbot, Grok, and then promoted by X’s trending news product, Explore, on the very first day of an updated version of the feature.

  • cmnybo@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    0
    ·
    3 months ago

    Oh, what a surprise. Another AI spat out some more bullshit. I can’t wait until companies finally give up on trying to do everything with AI.

    • Cosmic Cleric@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      3 months ago

      I can’t wait until companies finally give up on trying to do everything with AI.

      I don’t think that will ever happen.

      They’re acceptable of AI driving car accidents that causes harm happen. It’s all part of the learning / debugging process to them.

      • JackGreenEarth@lemm.ee
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 months ago

        AI isn’t inherently bad. Once AI cars cause less accidents than human drivers (even if they still cause some accidents) it will be moral to use them on roads.

      • rottingleaf@lemmy.zip
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        3 months ago

        The issue is that the process won’t ever stop. It won’t ever be debugged sufficiently

        EDIT: Due to the way it works. A bit like static error in control theory, you know that for different applications it may or may not be acceptable. The “I” in PID-regulators and all that. IIRC

        • Cosmic Cleric@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          ·
          edit-2
          3 months ago

          It won’t ever be debugged sufficiently

          It will, someday. Probably years and years down the road (pardon the pun), but it will.

          By the way, you reply to me seems very AI-ish. Are you a bot?

          • maynarkh@feddit.nl
            link
            fedilink
            English
            arrow-up
            0
            ·
            3 months ago

            I guess the argument is that this is what “innovation and disruption” looks like. When they finally iron out so that chatbots won’t invent fake headlines, they will pile on a new technology that endangers us in a new way. This is the acceptable margin of error to them.

  • kadu@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    3 months ago

    I wonder how legislation is going to evolve to handle AI. Brazilian law would punish a newspaper or social media platform claiming that Iran just attacked Israel - this is dangerous information that could affect somebody’s life.

    If it were up to me, if your AI hallucinated some dangerous information and provided it to users, you’re personally responsible. I bet if such a law existed in less than a month all those AI developers would very quickly abandon the “oh no you see it’s impossible to completely avoid hallucinations for you see the math is just too complex tee hee” and would actually fix this.

    • rottingleaf@lemmy.zip
      link
      fedilink
      English
      arrow-up
      0
      ·
      3 months ago

      The legislation should work like it would before. It’s not something new, like filesharing in the Internet was.

      Which means - punishment.

    • Ottomateeverything@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      3 months ago

      I bet if such a law existed in less than a month all those AI developers would very quickly abandon the “oh no you see it’s impossible to completely avoid hallucinations for you see the math is just too complex tee hee” and would actually fix this.

      Nah, this problem is actually too hard to solve with LLMs. They don’t have any structure or understanding of what they’re saying so there’s no way to write better guardrails… Unless you build some other system that tries to make sense of what the LLM says, but that approaches the difficulty of just building an intelligent agent in the first place.

      So no, if this law came into effect, people would just stop using AI. It’s too cavalier. And imo, they probably should stop for cases like this unless it has direct human oversight of everything coming out of it. Which also, probably just wouldn’t happen.

      • wizardbeard@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 months ago

        Yep. To add on, this is exactly what all the “AI haters” (myself included) are going on about when they say stuff like there isn’t any logic or understanding behind LLMs, or when they say they are stochastic parrots.

        LLMs are incredibly good at generating text that works grammatically and reads like it was put together by someone knowledgable and confident, but they have no concept of “truth” or reality. They just have a ton of absurdly complicated technical data about how words/phrases/sentences are related to each other on a structural basis. It’s all just really complicated math about how text is put together. It’s absolutely amazing, but it is also literally and technologically impossible for that to spontaneously coelesce into reason/logic/sentience.

        Turns out that if you get enough of that data together, it makes a very convincing appearance of logic and reason. But it’s only an appearance.

        You can’t duct tape enough speak and spells together to rival the mass of the Sun and have it somehow just become something that outputs a believable human voice.


        For an incredibly long time, ChatGPT would fail questions along the lines of “What’s heavier, a pound of feathers or three pounds of steel?” because it had seen the normal variation of the riddle with equal weights so many times. It has no concept of one being smaller than three. It just “knows” the pattern of the “correct” response.

        It no longer fails that “trick”, but there’s significant evidence that OpenAI has set up custom handling for that riddle over top of the actual LLM, as it doesn’t take much work to find similar ways to trip it up by using slightly modified versions of classic riddles.

        A lot of supporters will counter “Well I just ask it to tell the truth, or tell it that it’s wrong, and it corrects itself”, but I’ve seen plenty of anecdotes in the opposite direction, with ChatGPT insisting that it’s hallucination was fact. It doesn’t have any concept of true or false.

        • cygon@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 months ago

          I love that example. Microsoft’s Copilot (based on GTP-4) immediately doesn’t disappoint:

          Microsoft Copilot: Two pounds of feathers and a pound of lead both weigh the same: two pounds. The difference lies in the material—feathers are much lighter and less dense than lead. However, when it comes to weight, they balance out equally.

          It’s annoying that for many things, like basic programming tasks, it manages to generate reasonable output that is good enough to goat people into trusting it, yet hallucinates very obviously wrong stuff or follows completely insane approaches on anything off the beaten path. Every other day, I have to spend an hour to justify to a coworker why I wrote code this way when the AI has given him another “great” suggestion, like opening a hidden window with an UI control to query a database instead of going through our ORM.

        • neatchee@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          ·
          edit-2
          3 months ago

          The shame of it is that despite this limitation LLMs have very real practical uses that, much like cryptocurrencies and NFTs did to blockchain, are being undercut by hucksters.

          Tesla has done the same thing with autonomous driving too. They claimed to be something they’re not (fanboys don’t @ me about semantics) and made the REAL thing less trusted and take even longer to come to market.

          Drives me crazy.

          • FlashMobOfOne@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            ·
            3 months ago

            Yup, and I hate that.

            I really would like to one day just take road trips everywhere without having to actually drive.

            • humorlessrepost@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              ·
              edit-2
              3 months ago

              For road trips (i.e. interstates and divided highways), GM’s Super Cruise is pretty much there unless you go through a construction zone. I just went from Atlanta to Knoxville without touching the steering wheel once.

            • neatchee@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              ·
              edit-2
              3 months ago

              Right? Waymo is already several times safer than humans and tesla’s garbage, yet municipalities keep refusing them. Trust is a huge problem for them.

              And yes, haters, I know that they still have problems in inclement weather but that’s kinda the point: we would be much further along if it weren’t for the unreasonable hurdles they keep facing because of fear created by Tesla

        • rottingleaf@lemmy.zip
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 months ago

          but it is also literally and technologically impossible for that to spontaneously coelesce into reason/logic/sentience

          Yeah, see, one very popular modern religion (without official status or need for one to explicitly identify with id, but really influential) is exactly about “a wonderful invention” spontaneously emerging in the hands of some “genius” who “thinks differently”.

          Most people put this idea far above reaching your goal after making myriad of small steps, not skipping a single one.

          They also want a magic wand.

          The fans of “AI” today are deep inside simply luddites. They want some new magic to emerge to destroy the magic they fear.

          • JackGreenEarth@lemm.ee
            link
            fedilink
            English
            arrow-up
            0
            ·
            3 months ago

            Lol, the AI haters are luddites, not the AI supporters. AI is the present and future, and just because it isn’t perfect doesn’t mean it’s not good enough for many things. And it will continue to get better, most likely.

            • rottingleaf@lemmy.zip
              link
              fedilink
              English
              arrow-up
              0
              ·
              edit-2
              3 months ago

              You should try and understand that it’s not magic, it’s a very specific set of actions aimed at a very specific result with very specific area of application. Every part of it is clear. There’s no uncharted area where we don’t know at all what happens. Engineering doesn’t work like that anywhere except action movies.

              By the same logic as that “it isn’t perfect” a plane made of grass by cargo cult members can suddenly turn into a real aircraft.

              And it won’t magically become something above it, if that’s what you mean by “get better”.

              For the same reason we still don’t have a computer virus which developed conscience, and we won’t.

              And if you think otherwise then you are what I described.

      • kadu@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 months ago

        So no, if this law came into effect, people would just stop using AI. And imo, they probably should stop for cases like this unless it has direct human oversight of everything coming out of it.

        Then you and I agree. If AI can be advertised as a source of information but at the same time can’t provide safeguarded information, then there should not be commercial AI. Build tools to help video editing, remove backgrounds from photos, go nuts, but do not position yourself as a source of information.

        Though if fixing AI is at all possible, even if we predict it will only happen after decades of technology improvements, it for sure won’t happen if we are complacent and do not add such legislative restrictions.

      • rottingleaf@lemmy.zip
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 months ago

        Unless you build some other system that tries to make sense of what the LLM says, but that approaches the difficulty of just building an intelligent agent in the first place.

        I actually think an attempt at such an agent would have to include the junk generator. And some logical structure with weights and feedbacks it would form on top of that junk would be something easier for me to call “AI”.

    • rayyy@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      3 months ago

      Another of Musk cutting corners to the max and endangering lives but why should he care? He is in control and that is the only thing that matters to him, even if he loses billions of dollars.

  • Nobody@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    3 months ago

    Beware, terminally incompetent interns everywhere. Doing something incredibly damaging to your company over social media on your first day is officially a job that’s been taken by AI.

  • style99@kbin.social
    link
    fedilink
    arrow-up
    0
    ·
    3 months ago

    People who deploy AI should be held responsible for the slander and defamation the AI causes.

    • cygon@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      3 months ago

      I assume that Twitter still has tons of managers and team leads that allowed this and have their own part of the responsibility. However, Musk is known to be a choleric with a mercurial temper, someone who makes grand public announcements and then pushes his companies to release stuff that isn’t nearly ready for production. Often it’s “do or get fired”.

      So… an unshackled AI generating official posts, no human hired to curate the front page, headlines controlled through up-voting by trolls and foreign influence campaigns, all running unchecked in the name of “free speech” – that’s very much on brand for a Musk-run business, I’d say.

      • h3rm17@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 months ago

        Nah, it’s a bit like government. Its only his responsability if it is no one elses responsability. Like, they can have the most corrupt gabinets, most presidents do not resign/abdicate, whatever the word is.

          • h3rm17@sh.itjust.works
            link
            fedilink
            English
            arrow-up
            0
            ·
            3 months ago

            Ok, you just downvote and say no, but no explanation given. In my gov several cases of corruption arised during the last couple of years, and way more in the past. They affect high ranking ministers, and yet the oresident does not resign. Same with companies, they get paid the most, do the least, claim it is vecause they have “lots of responsabilities” but still never pay the price

            • maynarkh@feddit.nl
              link
              fedilink
              English
              arrow-up
              0
              ·
              3 months ago

              Corporations are completely authoritarian, while most governments are not, or at least not completely. If there really is a “rogue engineer”, Musk can very easily fire them. Even if there was, it’s his responsibility to organize a company in such a way that this cannot happen, with people having oversight over other people.

              He is very clearly failing to do any of that.

            • baru@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              ·
              3 months ago

              but no explanation given

              You didn’t explain, so why should I? I did see you made things up.

  • Otter@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    3 months ago

    I don’t really understand this headline

    The bot made it? So why was it promoted as trending?

    • Deceptichum@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      0
      ·
      3 months ago

      It’s pretty, trending is based on . . . What’s trending by users.

      Or as the article explains for those who can’t comprehend what trending means.

      Based on our observations, it appears that the topic started trending because of a sudden uptick of blue checkmark accounts (users who pay a monthly subscription to X for Premium features including the verification badge) spamming the same copy-and-paste misinformation about Iran attacking Israel. The curated posts provided by X were full of these verified accounts spreading this fake news alongside an unverified video depicting explosions.

        • Deceptichum@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 months ago

          It does say it’s likely hyperbole, so they probably just tazed and arrested the earthquake.

          Also I’m impressed by the 50,000 to 1,000,000 range for officers deployed. It leaves little room for error.

          • PopShark@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            ·
            3 months ago

            I wonder if the wide margin is the AI trying to formulate logic and numbers in the story but it realizes it doesn’t know how many officers would be needed to shoot the earthquake since it would logically depends on the magnitude of the earthquake which the AI doesn’t know so it figures well alright tectonic plates are rather resistant to firearms discharge and other potential law enforcement tactics so it starts high at 50,000 but decides 1,000,000 is a reasonable cap as there just can’t be more than that many officers present in the state or country

  • IninewCrow@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    3 months ago

    Same similiar thing happened with major newspapers about 100 / 150 years ago … governments realized that if any one group or company had control over all the information without regulation, businesses will quickly figure out ways to monetize information for the benefit of those with all the money and power. They then had to figure out how to start regulating newspapers and news media in order to maintain some sort of control and sanity to the entire system.

    But like the newspapers of old … no one will do anything about all this until it causes a major crisis or causes a terrible event … or events.

    In the meantime … big corporations controlling 99% of all media and news information will stay unregulated or regulated as little as possible until terrible things happen and society breaks down.

  • sik0fewl@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    3 months ago

    To everyone that goes to “X” to get the “real”, unfiltered news, I hope you can see that it’s not that site anymore.

    • SeedyOne@lemm.ee
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      3 months ago

      Yet, annoyingly, much of the press still uses it to disseminate news.

      I understand journalism is in a rough spot these days and many are there against their will but something needs to change abruptly. This slow exodus is too slow for democracy to survive '24.

      • rottingleaf@lemmy.zip
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 months ago

        They could use Nostr. It’s a bit similar to going to a square and yelling. The downside is that you are not heard from every corner of it, but I just remembered of this existing and thought that actually the idea is very nice.

    • expr@programming.dev
      link
      fedilink
      English
      arrow-up
      0
      ·
      3 months ago

      In case you’re not familiar, https://en.m.wikipedia.org/wiki/Grok.

      It’s somewhat common slang in hacker culture, which of course Elon is shitting all over as usual. It’s especially ironic since the meaning of the word roughly means “deep or profound understanding”, which their AI has anything but.

      • umbraroze@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 months ago

        Yup. Got also added to the Jargon File, which was an influential collection of hacker slang.

        If there’s one thing that Elon is really good at, it’s taking obscure beloved nerd tidbits and then pigeon-shitting all over them.