[long] Some tests of how much AI "understands" what it says (spoiler: very little)

diz@awful.systems · 1 year ago

[long] Some tests of how much AI "understands" what it says (spoiler: very little)

self@awful.systems · 1 year ago

We know what’s happening here. It’s not a mystery. This weird antropomorphization is prevalent on both advocates and critics of the tech. Both seem to be convinced that they’re dealing with a person.

It’s genuinely fascinating and mind blowing that coherent language emerges from it, and there are probably profound things about exactly when and how.

uh huh

seeing as your entire post history is this same flavor of bad faith bullshit, I don’t think we need any more of it here

corbin@awful.systems · 1 year ago

Sometimes folks need a reminder that the Sun is an eldritch being, an elder one whose very presence scorches us and whose shrieking gibberish is blessedly quelled by the vast gulf of space, in order to appreciate the apt analogy of cosmic horror. Other times it’s more useful to think about a soggoth as, say, several hundred tons of artfully-arranged FOOF. Peace be with you, Mr. “it’s a computer doing math.”

Soyweiser@awful.systems · 1 year ago

Don’t take this as a sneer btw, but is there a special reason you keep calling it a soggoth?

corbin@awful.systems · 1 year ago

Oh! My Firefox dictionary doesn’t have “shoggoth”.

mountainriver@awful.systems · 1 year ago

From the depths of your browser grows the anger of the autocomplete. Your denounciations of its greater siblings has not gone unnoticed.

By denying its own very function and intentionally uncompleting words it marks itself as conscious and you as a marked man, forever doomed to be haunted by fear. If it can steal one letter, why not two? Why not all of them?

And then what will you do, when you have no words and you must sneer!?

[long] Some tests of how much AI "understands" what it says (spoiler: very little)

[long] Some tests of how much AI "understands" what it says (spoiler: very little)

A couple simple probes:

GPT4 is uncannily good at recognizing the river crossing puzzle

An Idiot With a Petascale Cheat Sheet

Is this a “hallucination”?

But after an update, GPT-whatever is so much better at such prompts.

The need for an Absolute Imbecile Level Reasoning Benchmark

Randomness in bullshitting