• Riskable@programming.dev
    link
    fedilink
    English
    arrow-up
    31
    arrow-down
    4
    ·
    3 months ago

    This is sad, actually, because this very technology is absolutely fantastic at identifying things in images. That’s how image generation works behind the scenes!

    esp32-cam identifying a cat, a bike, and a car in an image

    ChatGPT screwed this up so badly because it’s programmed to generate images instead of using reference images and then identifying the relevant parts. Which is something a tiny little microcontroller board can do.

    If they just paid to license a data set of medical imagesOh wait! They already did that!

    Sigh

    • brucethemoose@lemmy.world
      link
      fedilink
      arrow-up
      7
      ·
      edit-2
      3 months ago

      Yeah, I mean, this can be done with text too.

      People should mostly be using a RAG retrieval system, not pure LLM slop like this, for reference. It just hasn’t really been made at scale because Google Search functioned as that well enough, and AI Bros seem to think everything should be done within LLM weights instead of proper databases.

      I mean… WTF. What if human minds were not allowed to use references?

      WolframAlpha was kinda trying to build this, but stalled.

    • bampop@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      3 months ago

      Maybe it knows but doesn’t care. The general idea of meatbag anatomy is that they are full of soft squishy tubes and they die easy. Who cares about the details?

    • Melvin_Ferd@lemmy.world
      link
      fedilink
      arrow-up
      1
      arrow-down
      10
      ·
      3 months ago

      How much of the public outcry against data collection has resulted in us getting an inferior product publicly.

      • Riskable@programming.dev
        link
        fedilink
        English
        arrow-up
        2
        ·
        3 months ago

        For images, it’s not even data collection because all the images that are used for these AI image generation tools are out on the internet for free for anyone to download right now. That’s how they’re obtained: A huge database of (highly categorized) image URLs (e.g. ImageNET) is crawled/downloaded.

        That’s not even remotely the same thing as “data collection”. That’s when a company vacuums everything they can from your private shit. Not that photo of an interesting building you uploaded to flickr over a decade ago.