KOSMOS-G: Generating Images in Context with Multimodal Large Language Models

Even_Adder@lemmy.dbzer0.com · edit-2 1 year ago

KOSMOS-G: Generating Images in Context with Multimodal Large Language Models

Even_Adder@lemmy.dbzer0.com · 1 year ago

You’re kinda right. I saw this video a little while ago. I’ve linked it at the relevant part.

PipedLinkBot@feddit.rocks · 1 year ago

Here is an alternative Piped link(s):

this video

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I’m open-source; check me out at GitHub.

rewarp@slrpnk.net · 1 year ago

Thank you for sharing this talk! Literally sweating as the ramifications started to hit me while watching it. Probably the most profound video I have watched in many years.

Even_Adder@lemmy.dbzer0.com · edit-2 1 year ago

PBS did a short video on this too. You might have seen it already.

PipedLinkBot@feddit.rocks · 1 year ago

Here is an alternative Piped link(s):

PBS

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I’m open-source; check me out at GitHub.