There is no 'meaning' inside these AI's. It's terribly confusing to think about ...

SequoiaHope · on April 5, 2023

There’s no meaning to the tokens, but research has shown that the models themselves capture meaning. Technically they are producing the next word but in order to do that for a dataset of a trillion words they actually have to develop internal models of how the world works. There was a post on HN a couple days ago that talked about the research done to show this.

psychphysic · on April 5, 2023

You say that but we have models of meaning in humans too.

You can put people in an fMRI and ask them to think "car".

You can ask someone to think of objects and detect when they think "car".

What happened there pairing a bunch of tensors to meanings and matching them.

We can do something similar with embeddings.

To be clear I don't intend to give the impression that these LLMs are doing something miraculous. Just that we are increasingly peeling back the veil of how brains think.

wizzwizz4 · on April 5, 2023

> You can put people in an fMRI and ask them to think "car".

I don't know about other people, but when I think “car” really hard, I can feel the muscles in my throat adjust slightly to match the sound of the word “car”. Perhaps that sort of thing is what the MRI machines is picking up, rather than being able to pick up some kind of "internal representation" of car.

psychphysic · on April 5, 2023

In fact it also picks up the parts of your brain to do with driving (if you're a driver). Maybe also the part to do with the smell of fuel in me, but not you.

It'll also light up in the parts of my brain to do with reading, writing, hearing the word in the languages I speak.

What does car mean to me if it doesn't connect to all the concepts that relate to cars?

Zircom · on April 5, 2023

Maybe they should do the same study on people that lack an internal monologue to see if they have the same results.

goldfeld · on April 5, 2023

If it just decides on a single token at a time, can it backtrack and choose differently under that operation, given the next tokens? What I wonder is, how can it plan ahead and output meaningful (to us) responses, like working code or useful articles? How can it "reason" logically when it needs to solve a problem, a riddle etc, by only selecting a token at a time? Wouldn't that dumbed down approach prove myopic for complex compositions? Doesn't it need some over-ruling goal-based heuristic system?

wingspar · on April 5, 2023

There’s no planning, no reason. It’s all ‘what word is next…’

I found Stephen Wolframs explanation helpful. He has a YouTube video version which I enjoyed too. This blog post was on HN last month, but I never get good search results on hn

https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-...

goldfeld · on April 5, 2023

If we get a bit quantum (or an act of God for some), then backtracking could happen by collapsing the dead-ends and "changing" history to stay with what turns out to be the solid plan. Could emergent conscience on AI's neurons do the planning and reasoning that it rather seems to be doing but ML experts will say it is not? If our conscience could by any chance reside not in the electrical currents of the wetware, could AI's reason also not reside in tokens? Is there some mysterious process possible to be taking place?

resters · on April 6, 2023

It seems that the output feels reasoned because of the reasoning implicit in language patterns. Which is enough to write code and essays, apparently.

goldfeld · on April 6, 2023

N.B. to self: study more Foucault to grasp this.

zeven7 · on April 5, 2023

It is wild that a process like that can generate working code. Humans speak their words in order, but they don't write their code in order. Why would writing code in order work?

flangola7 · on April 5, 2023

With GPT-4 this process also allows it to understand what is inside a graphical image and talk intelligently and coherently about it.

Next token prediction produces the most head exploding emergent effects.

psychphysic · on April 5, 2023

Bard at least produces multiple drafts. I believe that is preferred over backtracking.

Generation is ultimately deterministic (seeded prng) so backtracking wouldn't make sense.