• fidodo@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      3
      ·
      9 months ago

      The llm is executing a function on a diffusion image model. The llm does not generate the image itself

      • kelvie@lemmy.ca
        link
        fedilink
        English
        arrow-up
        9
        arrow-down
        2
        ·
        9 months ago

        This doesn’t contradict what the OP said. ChatGPT is now an interface to both an LLM and a diffusion-based image generator.

      • tsonfeir@lemm.ee
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1
        ·
        9 months ago

        You’re being pedantic—and confidently ignorant. The product is called “ChatGPT” and through that you can access multiple models. Like ChatGPT 3.5, or DALL•E.

      • CrayonRosary@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 months ago

        ChatGPT is just a front-end that maintains a session that gets fed to an LLM each time you add a reply, and now has access to image gen, too, so I was wrong.

    • h3rm17@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      3
      ·
      9 months ago

      Yeah, but the model that does the images is actually Dall-e, you are just using gpt’s interface to create them