More

dwaltrip · 2026-06-09T19:10:20 1781032220

METR is an independent organization.

dwaltrip · 2026-06-08T17:07:12 1780938432

I'm so sick of people who peddle outrage for a living.

dwaltrip · 2026-06-03T21:29:48 1780522188

The Alaskan sovereign wealth fund is a much better metaphor.

akramachamarei · 2026-06-04T00:18:10 1780532290

I see the comparison--like AI companies are mining value out of public corpora, somewhat like how an oil or mining company extracts resources from the earth. An important difference being that when minerals are extracted they are removed from common use, unlike when a model is trained, which does not subtract from the commons (at least not directly, or substantially?)

dwaltrip · 2026-06-03T15:22:39 1780500159

You aren’t really engaging with the substance or heart of the post, and your reading feels a bit knee-jerky and bad-faith to me.

dwaltrip · 2026-06-03T02:42:07 1780454527

The total market size might be low this year or the next. But, for better or worse, humans will continue to push into the unknown.

Reusable rockets will change the economics of space travel beyond recognition. Jevon’s paradox will strike hard and fast. Starlink is the initial proof of this.

Maybe Starship will be the first to achieve the fabled dream of rapid reusability. Maybe not. Either way, it’s a tractable engineering problem at this point and the path has been made pretty clear.

I have no idea what the valuation of SpaceX should be. But, in general, I’d bet a lot on the launch industry growing enormously in the coming decades.

ndsipa_pomu · 2026-06-03T08:19:16 1780474756

The increasing amount of space debris will likely change the economics of getting satellites into space and keeping them there. The more junk there is, the more likely that it's going to hit something and create yet more debris.

dwaltrip · 2026-05-28T18:04:26 1779991466

Wait, doesn’t the blog post say the price is the same as 4.7?

> Claude Opus 4.8 is available everywhere today. Pricing for regular usage is unchanged from Opus 4.7: $5 per million input tokens and $25 per million output tokens. Pricing for fast mode is $10 per million input tokens and $50 per million output tokens.

Where do you see the 2x cost?

XCSme · 2026-05-28T18:10:34 1779991834

The total cost of running my benchmarks, was 1.6x higher compared to Opus 4.7, mostly because of 2x output tokens:

https://i.snipboard.io/vrdwTa.jpg

dwaltrip · 2026-05-28T19:36:51 1779997011

ah ok, thanks for clarifying!

spprashant · 2026-05-28T18:18:29 1779992309

If it spends 2x tokens to achieve the same result, that's effective 2x cost in a manner of speaking

dwaltrip · 2026-05-28T17:54:00 1779990840

“Grown” is a highly apt metaphor, IMO. It quite succinctly captures some of the most fundamental differences between building Claude and building an Ikea desk, for example.

dwaltrip · 2026-05-28T17:50:34 1779990634

If you are using Claude code, just set effort to xhigh.

This one change will probably solve 80% of the problems you have noticed.

orwin · 2026-05-28T18:03:44 1779991424

This. XHigh and the 'plan' mode for complex tasks is absolutely a must have.

Still, the context window is sometimes too small for my usage.

jayGlow · 2026-05-28T20:15:18 1779999318

agent teams can help with that, the main agent acts as an orchestrator and spawns sub agents to do the actual tasks it generally keeps the main context from overflowing.

whatevaa · 2026-05-28T20:08:27 1779998907

Isn't xhigh on opus 4.7 very expensive on tokens?

dwaltrip · 2026-05-28T20:25:52 1779999952

I’ve never ran into the limits on the $100 plan, and rarely even get close.

I normally have only one session going at once though.

joshstrange · 2026-05-28T21:08:52 1780002532

Same here and while I have multiple sessions going from time to time, my day isn't spent primarily developing software directly anymore (due to role, nothing about LLMs).

I only ever hit the $100/mo limits 1-2 times ever and it was always <1hr before reset (once it was <5min, the other was like ~45min).

I'm even considering going back down to $20 and using extra usage for the times I need to "burst".

sumedh · 2026-05-29T10:49:43 1780051783

Yes but Anthropic made a deal with SpaceX and increase usage limits by 50%, so you might not hit your limits.

dwaltrip · 2026-05-23T21:16:59 1779571019

Programming is just another form of tool building, no? So anyone who builds things that humans use to solve problems is a job eliminator.

akoboldfrying · 2026-05-24T22:25:59 1779661559

I agree with you that there's no obvious way to separate tool creation from job elimination. This argument holds going right back to the wheel.

I think the strongest argument that can be made against this is that the supply of human labour has varied by time and place, so that in times where labour was short, new tools were no doubt welcomed by all, whereas when labour was plentiful, new tools that eliminated some of that demand for labour were opposed by those whose livelihoods were threatened. But this is not very satisfying because the types of labour available at any time are highly contingent on the current culture and technology, i.e., highly path-dependent.

(And I think there are different kinds of labour, and that not everyone can do every kind, contra the usual capitalist assumption.)

dwaltrip · 2026-05-19T08:06:05 1779177965

It’s wild how each model version is obsessed with certain particular phrases.

*load-bearing* just started popping up like crazy with opus 4.7.

Although Claude will never hold a candle to Codex’s jargon, at least in my experience.