More

Amekedl · 2026-06-24T07:57:34 1782287854

Looks good. Won't ever buy a font though.

Amekedl · 2026-06-22T08:21:03 1782116463

usage billing over the monthly plan when deepseek is over x25 cheaper?

3stacks · 2026-06-23T01:03:42 1782176622

Maybe I'll try out a month on the usage plan and see how it compares dollar for dollar. I think I'm squeezing about as much as I can out of Anthropic under the plan

Amekedl · 2026-06-06T22:40:50 1780785650

Very, very early already with GPT-3.

The simple fact that people will act on it and believe just because what they insinuated as a prompt and the answer being churned out on the screen looking somewhat readable.

That alone was going to seed so much discord and reinforce invalid messages, truly "oh shit".

Amekedl · 2026-05-25T05:24:59 1779686699

DeepSeek rules. I'm using it to do stuff that's not too big in scope, because I still need to remain in charge. Even for this, western competitors have no chance, least Anthropic and OpenAI, plus Gemini also has gotten too expensive besides flash (which is arguably just great, too).

With this, I am sticking to deepseek-v4-pro entirely.

Amekedl · 2026-05-24T01:23:10 1779585790

Dev tools. The debugger is something for example that Microsoft ostensibly keeps to their own products, and how they totally slaughtered omnisharp.

It killed my daily csharp vscode driver couple of years ago, only now catching back up somewhat, but still unusable for bigger solutions.

That move made me gravitate towards vscodium, and avoiding csharp where possible.

Microsoft's move only recently got more understandable to me, because Cursor and others basically stole vscode to establish their "empire".

jayd16 · 2026-05-24T05:51:20 1779601880

If you can use Jetbrains, Rider is on par with IntelliJ. From that perspective, both languages have a best in class debugger.

369548684892826 · 2026-05-24T07:50:24 1779609024

Rider is very good but this subthread is about the lack of open source dev tooling.

Amekedl · 2026-05-22T22:00:45 1779487245

Agreed, also amazing citations in the parent comment ^^

Amekedl · 2026-05-22T21:47:31 1779486451

I don't buy it. A lot of stuff this finds is also just simply wrong, benignly reported as true, despite upper/lower layers in the code burying the possibility of a vulnerability actually being exploited. It's a performance/security trade-off too, it always has been. Additional checks and other measures do in fact need to be performed for security purposes.

Great marketing as always, but the rose-tinted view many have seems vicariously misplaced.

solenoid0937 · 2026-05-22T21:54:43 1779486883

In the article they describe how all the vulns are actually exploitable end to end and >1000 have been independently verified as critical.

These aren't unreachable vulns.

Amekedl · 2026-05-22T22:03:12 1779487392

Where is the link to the advisories then? :/

skybrian · 2026-05-22T22:19:06 1779488346

As the article explains, they mostly haven't been disclosed, because they're not fixed. They're giving people 90 days, or 45 after a patch is made.

gck1 · 2026-05-22T23:10:24 1779491424

> haven't been disclosed, because they're not fixed.

That's convinient.

But wait, don't they have this amazing AI that can fix all the issues itself with a single /goal command? What's the holdup?

solenoid0937 · 2026-05-22T23:21:21 1779492081

You should really read the article, every question asked so far in this thread has been very clearly answered.

I miss the days when HN would RTFA.

dyauspitr · 2026-05-23T07:46:03 1779522363

He doesn’t want to read the article. He just wants to LLM bad.

TOMDM · 2026-05-23T02:46:04 1779504364

From the article

> As we noted above, the bottleneck in fixing bugs like these is the human capacity to triage, report, and design and deploy patches for them.

...

> To begin, we’ve released Claude Security in public beta for Claude Enterprise customers. It’s a tool that helps teams scan their codebases for vulnerabilities, and which can generate proposed fixes for them. In the three weeks since launch, Claude Opus 4.7 has been used to patch over 2,100 vulnerabilities. (This is faster than the open-source patching described above in large part because enterprises are fixing their own code, whereas open-source fixes usually require volunteer maintainers who work through coordinated disclosure.)

Your critique of the article would likely land much better if you engaged with it.

solenoid0937 · 2026-05-22T23:09:06 1779491346

> The software industry’s longstanding convention is to disclose new vulnerabilities 90 days after they’re discovered (or, if a patch is created before the 90 days is up, around 45 days after the patch becomes available). This allows time for end users to update their software before a vulnerability can be exploited by attackers. Our own Coordinated Vulnerability Disclosure policy takes this approach.

> However, this means that disclosed vulnerabilities are a lagging indicator of the accelerating frontier of AI models’ cyber capabilities: we’re not yet at the point where we can fully detail our partners’ findings with Mythos Preview without putting end users at risk. Instead, we provide illustrative examples of the model’s performance, along with aggregate statistics on our progress to date. Once patches for the vulnerabilities that Mythos Preview has discovered are widely deployed, we’ll provide much more detail about what we’ve learned.

darkamaul · 2026-05-22T22:07:30 1779487650

I guess you could look at https://red.anthropic.com/2026/cvd/ to see exactly what was discovered.

Amekedl · 2026-05-22T22:11:35 1779487895

Thank you. Looking at the WebDAV in nginx, this is exactly what I searched for, wanted to read, and confirmed my suspicions ^^ But this one takes the cake truly... https://red.anthropic.com/2026/cvd/findings/ANT-2026-CN7KX43...

rafgg · 2026-05-22T22:28:00 1779488880

Specially when this has been OAI/Anthropic's MO for years at this point.

Amekedl · 2026-05-19T07:19:41 1779175181

You are absolutely right! Kidding, but the analogy sits comfortably with me. I wonder though if this kind of behavior is potentially harmful, most likely less than drugs but nonetheless...

skiing_crawling · 2026-05-19T07:55:03 1779177303

triggered me with that first sentence

Amekedl · 2026-05-13T15:10:17 1778685017

The future of open washing

Amekedl · 2026-04-24T17:00:41 1777050041

I’d call it “open washing”, but it looks cool. Good luck with it

LarsenCC · 2026-04-24T18:42:20 1777056140

Curious why? You can just take this and run locally or deploy anywhere you'd like with any provider agent provider.