Hacker Newsnew | past | comments | ask | show | jobs | submit | dwaltrip's commentslogin

METR is an independent organization.

I'm so sick of people who peddle outrage for a living.

The Alaskan sovereign wealth fund is a much better metaphor.

I see the comparison--like AI companies are mining value out of public corpora, somewhat like how an oil or mining company extracts resources from the earth. An important difference being that when minerals are extracted they are removed from common use, unlike when a model is trained, which does not subtract from the commons (at least not directly, or substantially?)

You aren’t really engaging with the substance or heart of the post, and your reading feels a bit knee-jerky and bad-faith to me.

The total market size might be low this year or the next. But, for better or worse, humans will continue to push into the unknown.

Reusable rockets will change the economics of space travel beyond recognition. Jevon’s paradox will strike hard and fast. Starlink is the initial proof of this.

Maybe Starship will be the first to achieve the fabled dream of rapid reusability. Maybe not. Either way, it’s a tractable engineering problem at this point and the path has been made pretty clear.

I have no idea what the valuation of SpaceX should be. But, in general, I’d bet a lot on the launch industry growing enormously in the coming decades.


The increasing amount of space debris will likely change the economics of getting satellites into space and keeping them there. The more junk there is, the more likely that it's going to hit something and create yet more debris.

Wait, doesn’t the blog post say the price is the same as 4.7?

> Claude Opus 4.8 is available everywhere today. Pricing for regular usage is unchanged from Opus 4.7: $5 per million input tokens and $25 per million output tokens. Pricing for fast mode is $10 per million input tokens and $50 per million output tokens.

Where do you see the 2x cost?


The total cost of running my benchmarks, was 1.6x higher compared to Opus 4.7, mostly because of 2x output tokens:

https://i.snipboard.io/vrdwTa.jpg


ah ok, thanks for clarifying!

If it spends 2x tokens to achieve the same result, that's effective 2x cost in a manner of speaking

“Grown” is a highly apt metaphor, IMO. It quite succinctly captures some of the most fundamental differences between building Claude and building an Ikea desk, for example.

If you are using Claude code, just set effort to xhigh.

This one change will probably solve 80% of the problems you have noticed.


This. XHigh and the 'plan' mode for complex tasks is absolutely a must have.

Still, the context window is sometimes too small for my usage.


agent teams can help with that, the main agent acts as an orchestrator and spawns sub agents to do the actual tasks it generally keeps the main context from overflowing.

Isn't xhigh on opus 4.7 very expensive on tokens?

I’ve never ran into the limits on the $100 plan, and rarely even get close.

I normally have only one session going at once though.


Same here and while I have multiple sessions going from time to time, my day isn't spent primarily developing software directly anymore (due to role, nothing about LLMs).

I only ever hit the $100/mo limits 1-2 times ever and it was always <1hr before reset (once it was <5min, the other was like ~45min).

I'm even considering going back down to $20 and using extra usage for the times I need to "burst".


Yes but Anthropic made a deal with SpaceX and increase usage limits by 50%, so you might not hit your limits.

Programming is just another form of tool building, no? So anyone who builds things that humans use to solve problems is a job eliminator.


I agree with you that there's no obvious way to separate tool creation from job elimination. This argument holds going right back to the wheel.

I think the strongest argument that can be made against this is that the supply of human labour has varied by time and place, so that in times where labour was short, new tools were no doubt welcomed by all, whereas when labour was plentiful, new tools that eliminated some of that demand for labour were opposed by those whose livelihoods were threatened. But this is not very satisfying because the types of labour available at any time are highly contingent on the current culture and technology, i.e., highly path-dependent.

(And I think there are different kinds of labour, and that not everyone can do every kind, contra the usual capitalist assumption.)


It’s wild how each model version is obsessed with certain particular phrases.

*load-bearing* just started popping up like crazy with opus 4.7.

Although Claude will never hold a candle to Codex’s jargon, at least in my experience.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: