Hacker News
Latest
Lean proved this program correct; then I found a bug
2026-04-14 @ 00:25:08Points: 145Comments: 79
Why it's impossible to measure England's coastline
2026-04-13 @ 23:44:10Points: 23Comments: 17
WiiFin – Jellyfin Client for Nintendo Wii
2026-04-13 @ 23:33:18Points: 97Comments: 36
N-Day-Bench – Can LLMs find real vulnerabilities in real codebases?
2026-04-13 @ 21:54:03Points: 46Comments: 11
Static vulnerability discovery benchmarks become outdated quickly. Cases leak into training data, and scores start measuring memorization. The monthly refresh keeps the test set ahead of contamination — or at least makes the contamination window honest.
Each case runs three agents: a Curator reads the advisory and builds an answer key, a Finder (the model under test) gets 24 shell steps to explore the code and write a structured report, and a Judge scores the blinded submission. The Finder never sees the patch. It starts from sink hints and must trace the bug through actual code.
Only repos with 10k+ stars qualify. A diversity pass prevents any single repo from dominating the set. Ambiguous advisories (merge commits, multi-repo references, unresolvable refs) are dropped.
Currently evaluating GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro, GLM-5.1, and Kimi K2.5. All traces are public.
Methodology: https://ndaybench.winfunc.com/methodology
Live Leaderboard: https://ndaybench.winfunc.com/leaderboard
Live Traces: https://ndaybench.winfunc.com/traces
Tokens – The New Dopamine Economy
2026-04-13 @ 21:48:00Points: 14Comments: 4
Stanford report highlights growing disconnect between AI insiders and everyone
2026-04-13 @ 21:25:38Points: 226Comments: 308
Mathematical Minimalism
2026-04-13 @ 20:45:39Points: 27Comments: 1
Show HN: Continual Learning with .md
2026-04-13 @ 20:42:19Points: 22Comments: 10
For retrieval, there is a semantic filesystem that makes it easy for LLMs to search using shell commands.
It is currently a scrappy v1, but it works better than anything I have tried.
Curious for any feedback!
GitHub Stacked PRs
2026-04-13 @ 20:36:49Points: 530Comments: 286
Ascending into the Realm of Japanese Charts
2026-04-13 @ 20:29:55Points: 46Comments: 1
Just Enough Chimera Linux
2026-04-13 @ 19:37:54Points: 50Comments: 12
(AMD) Build AI Agents That Run Locally
2026-04-13 @ 19:28:41Points: 111Comments: 25
Show HN: Ithihāsas – a character explorer for Hindu epics, built in a few hours
2026-04-13 @ 19:10:07Points: 128Comments: 31
I’ve always found it hard to explore the Mahābhārata and Rāmāyaṇa online. Most content is either long-form or scattered, and understanding a character like Karna or Bhishma usually means opening multiple tabs.
I built https://www.ithihasas.in/ to solve that. It is a simple character explorer that lets you navigate the epics through people and their relationships instead of reading everything linearly.
This was also an experiment with Claude CLI. I was able to put together the first version in a couple of hours. It helped a lot with generating structured content and speeding up development, but UX and data consistency still needed manual work.
Would love feedback on the UX and whether this way of exploring mythology works for you.