• Buffalox@lemmy.world
    link
    fedilink
    English
    arrow-up
    55
    arrow-down
    3
    ·
    edit-2
    14 days ago

    According to a new study by researchers at Carnegie Mellon, MIT, Oxford, and UCLA,

    Study should be solid I guess.

    participants who were given AI assistants (in this case, a chatbot powered by OpenAI’s GPT-5 model) would have the aid pulled from them without warning during the test

    Wow, interesting idea. 👍

    where they had their assistant removed, the AI group saw the solve rate fall off a cliff. They had a solve rate about 20% lower

    And even worse IMO:

    They also had nearly double the skip rate, meaning they simply chose not to solve the questions.

    This seems very alarming IMO, because this indicates they lost some of their ability to think constructively on how to actually solve a problem!

    I know there have always been some who cried wold every time new technology has become available, like calculators and computers. Even dictionaries were once claimed to be harmful once!
    But maybe this time there is a real danger, because AI takes away a lot of the need to actually think creatively and constructively. And that’s an ability we must not lose.

    The last paragraph of the article is even worse. As it mentions 2 studies that show these effects are also long term!!!

    • scarabic@lemmy.world
      link
      fedilink
      English
      arrow-up
      17
      ·
      13 days ago

      Changing the terms of the test in the middle of it, without warning, is disruptive. I’m not convinced it “fried their brains.” The same would happen with a calculator suddenly removed during the middle of an exam.

      • howrar@lemmy.ca
        link
        fedilink
        English
        arrow-up
        5
        ·
        13 days ago

        Or any task change really. You tell me that I’m here for a writing task, then halfway through it becomes a math test? There’s no way I’m doing anywhere near as well as if they told me what was happening ahead of time.

      • Buffalox@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        13 days ago

        You are disregarding the last paragraph, where 2 other studies showed similar results, without having the “disruptive” factor.

        • scarabic@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          arrow-down
          3
          ·
          edit-2
          13 days ago

          Here’s that last paragraph. Microsoft’s finding actually sounds like it does have the disruptive factor: people are trained to use AI and then it is removed. And finally, finally in the very last sentence of the entire article we get the one piece of information that’s been missing the entire time: doctors perform better with AI help, but then worse than ever without it.

          My conclusion? Let people have AI and perform better with it.

          Carpenters trained on power tools will suddenly perform worse with hand tools than carpenters who were never given power tools. But if they are given power tools, they can build homes faster.

          No shit?

          The findings are also in line with a study Microsoft published last yearthat looked at cognitive decline among knowledge workers, which found that the more people lean on AI, the worse they perform when asked to work without support. It also echoes a study out of Poland, which found that while doctors are better at spotting cancer risks with AI assistance, they perform worse than the no-AI baseline once that assistance is removed.

          • Buffalox@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            7
            ·
            edit-2
            13 days ago

            Carpenters trained on power tools will suddenly perform worse with hand tools than carpenters who were never given power tools.

            Now you are just making shit up. None of these examples are about people being trained on AI. The comparison would be if a carpenter using power tolls for 10 minutes, suddenly becomes worse at using the traditional tools he is trained to use.

            Your claim is baseless, there is no evidence for it, and the lack of any evidence of it, makes it an unreasonable assumption based on your prejudice alone, and should not be believed.

            Let people have AI and perform better with it.

            Again a very loaded statement, nobody is preventing anybody from using AI based on this research. But maybe people are not really performing better, or at least not always, it may depend on the task.

            Your logic is fundamentally flawed and inconsistent, and you seem to lack any ability to see this as a potential problem, so much so that it reeks of you having an agenda.

            Your flawed logic and prejudice does not beat 3 research papers.

              • Buffalox@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                arrow-down
                4
                ·
                13 days ago

                Yes the article reporting on a research paper has an agenda, and not the random guy ignoring the evidence to contradict it. With absolutely zero to show for your argument, and clearly using flawed logic.

                All I hear is the laugh of ignorance.

                • scarabic@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  3
                  arrow-down
                  1
                  ·
                  edit-2
                  12 days ago

                  Ah yes, Gizmodo, arbiter of scientific truths. Their agenda is clear: to get you to click, typically with an outragey clickbait headline that reinforces your favorite narrative.

                  You need to learn the difference between debating someone and shouting at them that they have no argument, no logic, no evidence, and ill motivations. I can think of a couple other things you also need to do, but I’ll keep it PG.

    • NeilNuggetstrong@lemmy.world
      link
      fedilink
      English
      arrow-up
      9
      arrow-down
      1
      ·
      13 days ago

      If I use AI for my personal coding projects I’ve found that if the task is unsolvable by the ai model, I’m not able to sit down and do it myself until the next day. It’s like I’ve got to reset my brain.

      If I want to save time and use AI for a specific part of the code, it probably saves me 5 hours of work. But then I spend five hours yelling at the ai to try to get it to actually solve it. Next day I’ll just fix it myself in 2 hours.

      • Sockenklaus@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        12 days ago

        But what you’re describing is not that uncommon, even without AI: Oftentimes when trying to solve a complex problem and being unsuccessful you have to reset your brain by doing something fundamentally different or have a good night of sleep and after that you solve the problem easily.

        May what you’re experiencing is not AI related at all.

        • NeilNuggetstrong@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          12 days ago

          You’re probably right, but I think it’s made worse by AI. Jumping into the code after 3 hours with Claude doing the dirty work feels like an impossibility

      • Buffalox@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        13 days ago

        That sounds a lot like what the studies show. And IMO that sounds like a serious problem.

        • NeilNuggetstrong@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          13 days ago

          I’m really just tricking my brain to think I’m being more productive lmao.

          But then again, some of the stuff I’m working on is in principle quite easy to do, but is also outside of my skillet, for these cases I benefit from using AI.

          IMO the challenge is knowing how and when to use AI. Small companies using AI correctly can probably benefit massively from it. Although it’s risky

    • Chloé 🥕@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      1
      ·
      13 days ago

      there have always been some who cried wold every time new technology has become available, like calculators and computers

      and they kinda have a point, really. people got worse at memorizing stuff by heart when writing was invented, and people got worse at mental calculus when calculators when invented.

      but they allowed many things that were simply not possible. a calculation that takes me 2 minutes in wolfram alpha could take hours if not days to solve by hand!

      ai, meanwhile, or at least the ai we’re sold, does not offer significant advantages (at best it saves a few minutes), at the cost of making us worse at thinking, a skill that is absolutely essential to have… and of course, that’s the point. the tech oligarchs want us to be dependent on their extremely expensive products.

      • Buffalox@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        13 days ago

        and people got worse at mental calculus when calculators when invented.

        That may be true, but that is a much more limited problem, than losing some of our ability for critical thinking and problem solving in general.

        ai, meanwhile, or at least the ai we’re sold, does not offer significant advantages

        This is very true, the AI are shown to even hallucinate, and give incorrect and harmful solutions. A calculator does NOT do that.
        So not only is the AI a danger to our critical thinking, we actually need it MORE when using AI.

      • badgermurphy@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        13 days ago

        But they’re using the hell out of it, too, right? They’re exactly the types of people that love and use it the most: managers and owners.

    • FauxLiving@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      1
      ·
      13 days ago

      This paper shows that a person who has performed a task 12 times performs better than a person who has never performed the same task.

      They also do not properly control for performance loss due to context switching which is a well known contributor to performance loss.

      It’s a paper on arXiv, it hasn’t been peer reviewed or published.

      • Buffalox@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        2
        ·
        13 days ago

        No the test is not training, that’s a weird thing to claim. The switch is what is tested, and you disregard that 2 other tests have shown similar results. An actual decline in critical and problem solving thinking.

        • FauxLiving@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          1
          ·
          edit-2
          12 days ago

          Here is the paper: https://ai-project-website.github.io/AI-assistance-reduces-persistence/

          No the test is not training, that’s a weird thing to claim.

          The control group solved 12 questions manually and then the 3 test questions manually. The AI grouped solved 0 questions manually and the 3 test questions manually. One group had 12 more manual math tasks to prepare for the manual math test the other group had 0 and also had to context switch.

          The AI-assisted group was dealt a context switch, which results in a pretty severe performance loss. A context switch causes performance loss of around 40% according to this paper, which was peer-reviewed and published and is also the most cited paper on the topic, in the APA: https://www.apa.org/pubs/journals/releases/xhp274763.pdf

          The AI-assisted group also did not have 12 questions to adjust to the new context, like the control group did. If they wanted to wipe out the context switching performance loss they should have kept asking questions to see if, after 12 questions, the AI-assisted group had a similar performance.

          The switch is what is tested, and you disregard that 2 other tests have shown similar results.

          No, they did not switch what was tested. Here is an image from the actual paper.

          They were given 12 tasks with one group using AI and another doing mental math and then 3 tasks doing mental math. One group had 12 more tasks worth of preparation than the other.

          Nothing, not even the article in theOP, says that they did math and swapped to reading to test.

          They did 3 different experiments, in each experiment they gave 12 tasks and then disabled the AI for one group and gave 3 more tasks as a test. At no point did they ask 12 math questions and then finish with 3 reading questions or vice versa. They did 2 experiments using math tasks and 1 experiment using reading comprehension tasks.

          So one group had 15 math tasks and one group had 12 ‘how to ask an AI’ tasks and then 3 math questions.

          They also did not control for context switching losses, which is a well documented (see the APA paper) effect. The proper control would be to continue asking questions so the AI group also had 12 math tasks before the test.

          There’s a reason that this is published on arXiv and not in a peer-reviewed journal. Designing a poor quality experiment doesn’t tell you anything useful even if you do multiple different versions of the same experiment.

          This paper demonstrates a lack of a proper control group, specifically a failure to control for context switching performance loss.

          • Buffalox@lemmy.world
            link
            fedilink
            English
            arrow-up
            3
            arrow-down
            1
            ·
            12 days ago

            The picture you post contradict your claims. The 2 groups are getting the same question, but one has AI assistance, the other has not.
            Again you fail to show anything to support your claims.

            • FauxLiving@lemmy.world
              link
              fedilink
              English
              arrow-up
              3
              ·
              12 days ago

              I also wrote text.

              If you’re just going to cherry pick a single point and dismiss everything else then we’re done here.

            • Sockenklaus@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              2
              ·
              12 days ago

              No, what they meant is: The control group had 12 questions to get into the flow of solving math problems and then solved three more math problems for good measure.

              The AI group on the other hand got into the flow of formulating math problems to ChatGPT and then had to actually solve three math problems themselves

              Their critique is, that solving math problems yourself and prompting ChatGPT to solve math problems are not necessarily comparable tasks and require different skill sets so disabling AI after 12 tasks meant the first group had to switch context and therefore had worse performance.

              If you want to analyze the first groups general ability of problem solving you should give them again twelve tasks after disabling AI so they get used to this new type of task (solving math problems yourself vs. prompting math problems to the AI) before measuring their performance.

              • Buffalox@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                12 days ago

                The AI group on the other hand got into the flow of formulating math problems to ChatGPT and then had to actually solve three math problems themselves

                That’s what the friggin test is about! So of course they did.

        • iglou@programming.dev
          link
          fedilink
          English
          arrow-up
          1
          ·
          13 days ago

          Not training, no, but warm up. And no, it is not about critical thinking, it’s about reading comprehension and calculations.

  • Rioting Pacifist@lemmy.world
    link
    fedilink
    English
    arrow-up
    28
    arrow-down
    1
    ·
    13 days ago

    The test seems kind of dogshit, you could make the same argument against any tool, calculators or even abacuses would have the same effect.

    I’m made to use it for work and it does speed up some tasks, however for some stuff it ends up being like the experiment where not doing the work the first time means the whole process takes longer at the end.

    • FauxLiving@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      edit-2
      13 days ago

      To add to this, we already know that context switching causes a loss in performance.

      A person who’s thinking about how to solve a problem one way and then has to suddenly think about solving it in another way will perform worse.

      https://medium.com/@codewithmunyao/the-hidden-cost-of-context-switching-why-your-most-productive-hours-are-disappearing-43c5b501de19

      The Neuroscience Behind the Pain

      Context switching isn’t just annoying — it’s neurologically expensive. When you shift from debugging a race condition to answering emails, your brain doesn’t simply “change tabs.” It goes through a complex process:

      -Memory consolidation: Storing your current mental model

      -Attention disengagement: Breaking focus from the current task

      -Cognitive reloading: Building a new mental model for the next task

      -Re-engagement: Getting back into flow

      Research from Carnegie Mellon shows that even brief interruptions can increase task completion time by up to 23%. For complex cognitive work like programming, this cost multiplies dramatically.

      Here’s another article from CMU discussing the same thing: https://www.sei.cmu.edu/blog/addressing-the-detrimental-effects-of-context-switching-with-devops/

      What this study shows is that a person who is faced with an unexpected context switch performs worse on a task than a user who has spent the last 12 questions performing the task the same way.

      This exact problem would happen if you replaced AI with a calculator, or made a person swap from using paper to doing mental math. The problem here is context switching, not AI.

      The way to ensure that the problem is AI and not the context switch, would be to continue the quest and see if the first group reverts back to baseline after 12 questions. 12 questions is how long the control group had to become acclimated to the task before their last context swap at the start of the test.

      Also, of note, this is a paper on arXiv it is not published so it has not gone through a peer-review process which would certainly catch the failure to set a proper control group.

      • chunes@lemmy.world
        link
        fedilink
        English
        arrow-up
        5
        ·
        edit-2
        13 days ago

        Context switching isn’t just X — it’s Y.

        Are we sure this was written by a human?

          • chunes@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            13 days ago

            Thanks.

            And I’m all for em dashes. After all, I started using them after reading enough books. It’s just that particular construct that strikes me as especially LLM-y.

            • FauxLiving@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              edit-2
              13 days ago

              I’d like to see a study on that, I see it mentioned so much it’s almost achieved meme status.

              It could very well be a Baader–(👀)Meinhof phenomenon.

  • texture@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    arrow-down
    1
    ·
    edit-2
    13 days ago

    i think reading the title of this post hurt my brain. like what are we doing here? making medical claims using sensationalist and meaningless language… seems unhelpful

  • Comet79@lemmy.world
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    3
    ·
    edit-2
    12 days ago

    1980: TVs will fry your brain

    1990: Videogames will fry your brain

    2000: Computers will fry your brain

    2010: Smartphones will fry your brain

    2020: AI will fry your brain

    Any takes for the 2030s?

    • EightBitBlood@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      12 days ago

      I mean, based fully on our current dystopian reality, I feel you just made a really good point about tech growing to a point where it fully captures you from reality, and indeed frys your brain by convincing you that fantasies are real.

      MAGA is a great example of people with brains so fried they think a pedophile exconman with 34 felonies who killed millions of Americans trough a poor pandemic response is somehow helping them by destroying USAID, DEI, Healthcare, and Social Security.

      Their brains are gonzo, all through the constant applied exploitation of all the tech you just mentioned combined.

      AI will absolutley make it worse.

  • HertzDentalBar@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    7
    ·
    12 days ago

    I fucking hate this AI shit but I’ll admit I end up using Gemini (knowing its wrong sometimes) but it’s like how I’d use Google but just more of a complex ask instead of simple search query’s, I couldn’t imagine using it beyond that other than a follow-up or two.

    It’s just a chatbot that has access to info, who goes onto their cable companies website and befriends the chatbot?

    • Boingboing_r@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      12 days ago

      I have found Google search to be getting progressively worse where as I can type out a question to Gemini that will return better results than Google search. It’s annoying that Google search has gotten so bad and duckduckgo will return you something interesting but not relavent. So Gemini is my Google search nowadays.

    • WIZARD POPE💫@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      12 days ago

      I’ve used gpt a coup times when I was searching the web and forums for well over an hour and found nothing relevant enough to work. Theissue got solved in 5-10 minutes.

  • zebidiah@lemmy.ca
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    2
    ·
    13 days ago

    AI is like a dog looking at itself in a mirror.

    Some dogs are smart, and understand that this is a tool and that it is there to help you see things better… Some dogs are fucking morons and think their reflection is another dog, and they wanna fuck and fight…

    There are a ton of good use cases for ai, and none of them include coquettish sexbots or drawings of me as a Simpson or a Ghibli sketch.

  • ElReatonVaquer0@lemmy.world
    link
    fedilink
    English
    arrow-up
    8
    arrow-down
    2
    ·
    14 days ago

    I think that if you use AI responsibly (as an assisting tool) like mentioned in the article, then you are pretty much on the safe side.

    But when you have AI do everything for you, then there’s a big problem.

    Personally I try not to use it at all, not a fan of all the problems that come with it.

    • Buffalox@lemmy.world
      link
      fedilink
      English
      arrow-up
      9
      ·
      edit-2
      13 days ago

      You clearly didn’t read the article, and you are dead wrong.
      Except you are right that if you let the AI do everything, it’s worse, and you lose a lot of ability for critical thinking.
      The last paragraph of the article even shows that other studies have shown that using AI assistance over time, will even have long term effect of lowering problem solving abilities!!

      Personally I try not to use it at all, not a fan of all the problems that come with it.

      This is the way. 😀

  • nonentity@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    8
    arrow-down
    2
    ·
    13 days ago

    I’ll never understand how an explosively imprecise, statistically luke-warm, grey goo extrusion sphincter could ever be mistaken for intelligence.

    AI doesn’t exist, it’s a vacuous marketing term.

    LLMs have vanishingly narrow legitimate, defensible use cases, but their output is intrinsically inaccurate, and should never be used without supervision from relevant domain experts.

    • texture@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      13 days ago

      theres plenty of legitimate use cases. your comment just sounds like youre repeating what everyone else says about it.

      • nonentity@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        ·
        13 days ago

        There could be many use cases, and some of them may even be legitimate, but I’m yet to observe any which have broad applicability, and they should only ever be wielded by a responsible, expert adult.

        • texture@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          13 days ago

          its a multitool, its applicability needent be broad imo. anyway, im glad we arent speaking in absolutes.

  • tidderuuf@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    13 days ago

    Reaearchers: “Is the AI in the room with us now?”

    Test Subjects: “No Asshole! You just took it from me while I was in the middle of using it!”

  • lechekaflan@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    ·
    13 days ago

    I don’t want it, all it does is to negate years of learned experience and ability to organically formulate ideas.

  • melsaskca@lemmy.ca
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    1
    ·
    12 days ago

    Should we trust a researcher whose brain got fried. Did they remember to do the old double-blind setup before the frying of the brains occurred?

  • GreenKnight23@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    1
    ·
    13 days ago

    can confirm. Reddit is filled with abject brain dead dumbasses. since most content is AIGEN it makes sense.

  • mechoman444@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    3
    ·
    edit-2
    12 days ago

    Studies show that using a bulldozer for plowing a field decreases the farmers muscle density after just one day of use.

    Christ. What a load of shit.