Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
from
login
ArXivLean: How Well Can LLMs Formally Prove Research Math?
(
matharena.ai
)
3 points
by
OxfordCommand
9 days ago
|
past
|
discuss
BrokenArXiv: How often do LLMs claim to prove false theorems?
(
matharena.ai
)
3 points
by
robinhouston
49 days ago
|
past
MathArena: Evaluating LLMs on uncontaminated math questions
(
matharena.ai
)
2 points
by
GaggiX
77 days ago
|
past
New open source model achieves same score as GPT 5.2 High on AIME2026 I
(
matharena.ai
)
3 points
by
mh3467
81 days ago
|
past
|
2 comments
MathArena Apex: Unconquered Final-Answer Problems
(
matharena.ai
)
2 points
by
frozenseven
7 months ago
|
past
Evaluating publicly available LLMs on IMO 2025
(
matharena.ai
)
79 points
by
hardmaru
9 months ago
|
past
|
89 comments
Not Even Bronze: Evaluating LLMs on 2025 International Math Olympiad
(
matharena.ai
)
3 points
by
amichail
9 months ago
|
past
|
2 comments
IMO 2025 LLM results are in
(
matharena.ai
)
5 points
by
arberavdullahu
9 months ago
|
past
|
1 comment
Not Even Bronze? Evaluating LLMs on 2025 International Math Olympiad
(
matharena.ai
)
1 point
by
EvgeniyZh
9 months ago
|
past
|
1 comment
Gemini 2.5 gets 24.4% on MathArena USAMO beating previous top score of 4.7%
(
matharena.ai
)
54 points
by
alphabetting
on April 2, 2025
|
past
|
10 comments
OpenAI o3-mini scores 78% on yesterday's AIME 2025 math competition
(
matharena.ai
)
3 points
by
bmislav
on Feb 7, 2025
|
past
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: