im personally excited for it.

  • steam@programming.devOP
    link
    fedilink
    arrow-up
    1
    arrow-down
    11
    ·
    12 hours ago

    openai for example claims chatgpt can perform close to or above phd level math. so your claim about it not passing math tests isn’t really correct.

    • trxxruraxvr@lemmy.world
      link
      fedilink
      arrow-up
      5
      ·
      12 hours ago

      so your claim about it not passing math tests isn’t really correct.

      You base this statement on claims from the company that’s trying to sell the product. Maybe try to make chatgpt do some sums first before believing these claims.

    • slazer2au@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      11 hours ago

      openai for example claims

      Claiming something doesn’t make it true. If it were true. It wouldn’t be a claim it would be a statement of fact which is directly opposed to the point of marketing.

      Seeing as you don’t support your claim that it can do permanent head damage maths, I will support my statement of they suck at maths.

      https://www.theregister.com/software/2026/02/26/ai-models-suck-slightly-less-at-math-than-they-did-last-year/5191967

    • brynden_rivers_esq@lemmy.ca
      link
      fedilink
      arrow-up
      3
      ·
      11 hours ago

      lol, sorry…the company selling a product made an outlandish claim and you believe it? I got a bridge to sell you!

      I think the fundamental problem is that you haven’t actually used these tools enough to see that the companies peddling them are snake oil salesmen. Nothing they say is true, they’re completely full of shit, and they’re terrified they’re not going to make any of the money they promised their investors they would make.

      And like…if you think about it for a moment it would make sense, given how these tools work, that they can’t do much in the way of reasoning. They just get better and better at replicating the language they’re supposed to mimic, which sometimes even carries the right content! Wow! But that’s only a secondary effect, right?