Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Eight Short Studies on Excuses (lesswrong.com)
1 point by microsoftedging 19 hours ago | past | discuss
How Go Players Disempower Themselves to AI (lesswrong.com)
2 points by cubefox 21 hours ago | past | discuss
Article explains a method to estimate parameter count of closed sourcemodels (lesswrong.com)
2 points by theanonymousone 1 day ago | past | 1 comment
Cyborg Evals (lesswrong.com)
2 points by frmsaul 2 days ago | past | 1 comment
Probes trace an emergent jailbreak in OLMo 2 to mislabeled training data (lesswrong.com)
2 points by aranguri 3 days ago | past | discuss
Opus 4.7 Part 3: Model Welfare (lesswrong.com)
11 points by omer_k 10 days ago | past | 8 comments
Claude Code sometimes hallucinates user messages (lesswrong.com)
2 points by cubefox 12 days ago | past | 2 comments
There are only four skills: design, technical, management and physical (lesswrong.com)
3 points by samuel246 13 days ago | past | discuss
Summarizing and Reviewing my earliest ML research paper, 7 years later (lesswrong.com)
2 points by joozio 13 days ago | past | discuss
Resources for starting and growing an AI safety org (lesswrong.com)
1 point by omer_k 13 days ago | past | discuss
Only Law Can Prevent Extinction (lesswrong.com)
3 points by namanyayg 14 days ago | past | 1 comment
LLMs will soon disrupt algorithmic media feeds (lesswrong.com)
3 points by linhns 14 days ago | past
Working hurts less than procrastinating, we fear the twinge of starting (2011) (lesswrong.com)
14 points by davikr 14 days ago | past | 2 comments
The AlphaFold moment for materials is not any time soon (lesswrong.com)
8 points by gmays 17 days ago | past
Morale (lesswrong.com)
2 points by jger15 17 days ago | past
You're gonna need a bigger benchmark, METR (lesswrong.com)
3 points by frmsaul 19 days ago | past
Hypotheses for Why Models Fail on Long Tasks (lesswrong.com)
1 point by joozio 19 days ago | past
Splitting Mounjaro pens for fun and profit (lesswrong.com)
2 points by henryaj 20 days ago | past
We're running out of benchmarks to upper bound AI capabilities (lesswrong.com)
15 points by gmays 22 days ago | past | 10 comments
AIs can now do easy-to-verify SWE tasks, I've shortened timelines (lesswrong.com)
3 points by gmays 22 days ago | past
The effects of caffeine consumption do not decay with a ~5 hour half-life (lesswrong.com)
101 points by swah 22 days ago | past | 105 comments
My Picture of the Present in AI (lesswrong.com)
1 point by speckx 22 days ago | past
Most people can't juggle one ball (lesswrong.com)
507 points by surprisetalk 23 days ago | past | 175 comments
"Alignment" and "Safety", Part One: What Is "AI Safety"? (lesswrong.com)
1 point by joozio 25 days ago | past
Paper Close Reading: "Why Language Models Hallucinate" (lesswrong.com)
2 points by joozio 26 days ago | past
Estimates of the expected utility gain of AI Safety Research (lesswrong.com)
1 point by joozio 26 days ago | past
What I like about MATS and Research Management (lesswrong.com)
2 points by joozio 27 days ago | past
Predicting When RL Training Breaks Chain-of-Thought Monitorability (lesswrong.com)
2 points by gmays 27 days ago | past
AI Safety at the Frontier: Paper Highlights of February and March 2026 (lesswrong.com)
2 points by joozio 28 days ago | past
How to emotionally grasp the risks of AI Safety (lesswrong.com)
3 points by joozio 28 days ago | past

Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: