• Sylvartas@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    0
    ·
    7 days ago

    To be honest I’ve had that thought process before. Made it halfway to the gas station that’s roughly 300m from home before I remembered the goal was initially to refuel my car

  • Agent641@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    7 days ago

    After the first walk to the car wash, chatgpt didn’t fall for it again. It sasses me a bit and then I was instructed to drive to the car wash, wash the car, and then walk home

  • rickdg@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    8 days ago

    Gemini spots this correctly even if you just ask it to transcribe that screenshot from Chat GPT. Just don’t use the default Gemini app, all the frontline models are dumber.

  • Mandrilleren@feddit.dk
    link
    fedilink
    arrow-up
    0
    ·
    8 days ago

    I just tried this with the following services; Grok, Perplexity, Le chat, Lumo (Proton), ChatGPT and Gemini

    All of them told be the pros a cons of each and concluded that walking would be best.

    Except Gemini. It told me that unless i was expecting to carry the car i should drive there. You win this round Google.

    • Zerush@lemmy.mlOP
      link
      fedilink
      arrow-up
      0
      ·
      edit-2
      7 days ago

      Also Andisearch had it clear that you have to go with the car. A humble free AI from a two person startup with better results as AIs from big corps, as shown several times in the past.

      • Mandrilleren@feddit.dk
        link
        fedilink
        arrow-up
        0
        ·
        7 days ago

        Did not know that one.

        But i don’t think we can conclude much from this test. Gemini and Andisearch could easily fail a similar test but others fail. I think the important take away here is to remeber that these AIs are not cognitive and cannot reason.

        The more this space evolves the more it is central that we remeber that there is a huge difference between simulating inteligence and actual intelligence. Tech companies are getting pretty good at simulating intelligence and they have an economic interest in fooling people into believing it is actual intelligence.

  • thedeadwalking4242@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    8 days ago

    Just a heads up for anyone who may use this in an argument. I just tested on several models and the generated response accounted for the logical fallacy. Unfortunately it isn’t real.

    ( Funny non-the less )

    • Axolotl@feddit.it
      link
      fedilink
      arrow-up
      0
      ·
      edit-2
      8 days ago

      Tested on GPT-5 mini and it’s real tho?

      Edit: Gemini gives different results

      • xthexder@l.sw0.com
        link
        fedilink
        arrow-up
        0
        ·
        8 days ago

        Bold of Gemini to imply any sort of liability for what it says. Google’s lawyers really don’t want that to be the case.

      • Spezi@feddit.org
        link
        fedilink
        arrow-up
        0
        ·
        8 days ago

        Tried it in GPT 5.2 (although in German) and it also says, that walking is better.

        It also made a completely illogical sentence at the third point with two words in lowercase that should be uppercase.

      • Ephera@lemmy.ml
        link
        fedilink
        English
        arrow-up
        0
        ·
        8 days ago

        Man, I really hate how much they waffle. The only valid response is “You have to drive, because you need your car at the car wash in order to wash it”.

        I don’t need an explanation what kind of problem it is, nor a breakdown of the options. I don’t need a bulletpoint list of arguments. I don’t need pros and cons. And I definitely don’t need a verdict.

          • LordKitsuna@lemmy.world
            link
            fedilink
            arrow-up
            0
            ·
            7 days ago

            You can actually fix this in the settings there’s an option for permanent prompt tunings and you can add things like “focus on concise answers” or my favorite " i don’t need to be glazed , I don’t need to be told that it’s an insightful question or reaches the heart of the matter. Just focus on answering the question"

        • AlfalFaFail@lemmy.ml
          link
          fedilink
          arrow-up
          0
          ·
          8 days ago

          I’ll also accept sarcasm.

          “Unless you’ve successfully trained your car to follow you like a loyal golden retriever, you’re probably going to have to drive.”

  • HiddenLayer555@lemmy.ml
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    8 days ago

    “I need to wash my train, and the train wash is 100 meters away. Should I walk or take the train.”

    “Neither. It is not the passengers’ responsibility to wash a train, as all maintainance of public transit should be paid for by your taxes. Furthermore, the train wash is typically located in the maintainance yard which is not accessible to regular passengers. You wouldn’t be able to get through the front gate on foot, and would be told to leave of you tried to ride past the end of the line.”

    Written not by artificial intelligence, but natural stupidity.

      • HiddenLayer555@lemmy.ml
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        8 days ago

        “My” in the commuter sense. “Gotta go, my train is here.”

        The great thing about trains is you’re not forced to own one and go into debt for it.

        • Etterra@discuss.online
          link
          fedilink
          English
          arrow-up
          0
          ·
          6 days ago

          That’s not contextually correct. If I hail a taxi and it’s dirty I don’t say “I need to wash my taxi.” I would say “this taxi needs to be washed” or similar."