Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Bullshit Benchmark Explorer
(
petergpt.github.io
)
9 points
by
smusamashah
65 days ago
|
hide
|
past
|
favorite
|
3 comments
fragebogen
65 days ago
|
next
[–]
Such a great project that could automate a lot vibes testing hopefully! A pity that the dataset only contains 55 questions. I'd like to see this number in the thousands.
smusamashah
65 days ago
|
prev
|
next
[–]
https://github.com/petergpt/bullshit-benchmark
drsalt
64 days ago
|
prev
[–]
this isn't really bullshit, it's just nonsense. bullshit can only be understood in proper context. i swear i'm not bullshitting you.
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: