Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Bullshit Benchmark Explorer (petergpt.github.io)
9 points by smusamashah 65 days ago | hide | past | favorite | 3 comments


Such a great project that could automate a lot vibes testing hopefully! A pity that the dataset only contains 55 questions. I'd like to see this number in the thousands.



this isn't really bullshit, it's just nonsense. bullshit can only be understood in proper context. i swear i'm not bullshitting you.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: