Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Gemini 3.1 basically takes it home on that benchmark, anyway, it's done.


Gemini is heavily benchmaxxed and sucks in agentic coding so no surprise.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: