I have worked on similar problems. See e.g. [1]. The LLMs I have tested have ter...

malfist · 2026-04-26T18:04:23 1777226663

I'm going to ignore all that and tell my developers working in complicated codebases that they have to use AI. I'm sure comprehending side effects in a world building text adventure is completely different that understanding spaghetti code

red75prime · 2026-04-26T18:53:09 1777229589

Desarcasmed version: "I think that problems with Zork make those models virtually useless in programming tasks." Correct?

cptskippy · 2026-04-26T22:52:24 1777243944

He said complicated code bases. LLMs are great at producing small snippets of code to address very targeted problems.

red75prime · 2026-04-27T01:17:29 1777252649

Great on small snippets of code, passable on larger pieces of code, great at finding vulnerabilities in large pieces of code, terrible in Zork. All-in-all, a jagged frontier that defies a simple sarcastic characterization.

seanmcdirmid · 2026-04-26T19:24:07 1777231447

You can code your prompts to read and write an external world model on the side. This is what most people do who are seriously doing games with LLMs.

stingraycharles · 2026-04-27T02:41:57 1777257717

What do you mean with this? What is this world model, what does it capture?

seanmcdirmid · 2026-04-27T04:39:50 1777264790

You keep a document going called "state of the world", on every turn, you read this document in (as context), use it to help compute what happens, and based on what happens, create an updated "state of the world" document. You track important details so your LLM is consistent from turn to turn.

If you doing an RPG, which I guess is where this is more obvious, you track the play and enemy positions, their health, their moods and perhaps top thoughts, the state of important inanimate objects. if you break down the door, you update the door's state in the document. This is in contrast to just giving the LLM the previous turns and hoping it realizes the door is broken down later (just by statistical completion).

Schlagbohrer · 2026-04-27T08:48:01 1777279681

I would love to see consistent-world-state-capturing more integrated into, for example, SillyTavern.

mnky9800n · 2026-04-26T17:06:29 1777223189

we should talk. i sent you an email.