Hacker News

Latest

Ransomware Is Growing Three Times Faster Than the Spending Meant to Stop It

2026-04-14 @ 08:52:12Points: 21Comments: 16

An AI Vibe Coding Horror Story

2026-04-14 @ 08:35:45Points: 164Comments: 162

Backblaze has stopped backing up your data

2026-04-14 @ 08:30:27Points: 243Comments: 163

Introspective Diffusion Language Models

2026-04-14 @ 07:57:33Points: 73Comments: 19

The secrets of the Shinkansen

2026-04-14 @ 06:41:01Points: 100Comments: 90

Distributed DuckDB Instance

2026-04-14 @ 06:31:44Points: 81Comments: 17

MOS tech 6502 8-bit microprocessor in pure SQL powered by Postgres

2026-04-14 @ 05:46:49Points: 32Comments: 3

Multi-Agentic Software Development Is a Distributed Systems Problem

2026-04-14 @ 05:32:48Points: 47Comments: 16

TanStack Start Now Support React Server Components

2026-04-14 @ 05:28:51Points: 70Comments: 52

A new spam policy for “back button hijacking”

2026-04-14 @ 03:06:27Points: 423Comments: 251

DaVinci Resolve – Photo

2026-04-14 @ 02:25:15Points: 660Comments: 171

Lean proved this program correct; then I found a bug

2026-04-14 @ 00:25:08Points: 276Comments: 132

WiiFin – Jellyfin Client for Nintendo Wii

2026-04-13 @ 23:33:18Points: 176Comments: 76

N-Day-Bench – Can LLMs find real vulnerabilities in real codebases?

2026-04-13 @ 21:54:03Points: 78Comments: 24

Static vulnerability discovery benchmarks become outdated quickly. Cases leak into training data, and scores start measuring memorization. The monthly refresh keeps the test set ahead of contamination — or at least makes the contamination window honest.

Each case runs three agents: a Curator reads the advisory and builds an answer key, a Finder (the model under test) gets 24 shell steps to explore the code and write a structured report, and a Judge scores the blinded submission. The Finder never sees the patch. It starts from sink hints and must trace the bug through actual code.

Only repos with 10k+ stars qualify. A diversity pass prevents any single repo from dominating the set. Ambiguous advisories (merge commits, multi-repo references, unresolvable refs) are dropped.

Currently evaluating GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro, GLM-5.1, and Kimi K2.5. All traces are public.

Methodology: https://ndaybench.winfunc.com/methodology

Live Leaderboard: https://ndaybench.winfunc.com/leaderboard

Live Traces: https://ndaybench.winfunc.com/traces

GitHub Stacked PRs

2026-04-13 @ 20:36:49Points: 739Comments: 396

How to make Firefox builds 17% faster

2026-04-13 @ 18:50:08Points: 183Comments: 34

Someone bought 30 WordPress plugins and planted a backdoor in all of them

2026-04-13 @ 17:54:39Points: 988Comments: 277

Building a CLI for all of Cloudflare

2026-04-13 @ 15:44:02Points: 314Comments: 101

Nothing Ever Happens: Polymarket bot that always buys No on non-sports markets

2026-04-13 @ 15:31:06Points: 433Comments: 242

Make tmux pretty and usable (2024)

2026-04-13 @ 14:48:55Points: 391Comments: 242

US appeals court declares 158-year-old home distilling ban unconstitutional

2026-04-13 @ 13:37:33Points: 395Comments: 267

Android now stops you sharing your location in photos

2026-04-13 @ 11:48:15Points: 382Comments: 304

I just want simple S3

2026-04-11 @ 10:36:15Points: 193Comments: 103

Write less code, be more responsible

2026-04-11 @ 09:20:45Points: 104Comments: 63

Franklin's bad ads for Apple ][ clones and the beloved impersonator they depict

2026-04-10 @ 19:33:08Points: 26Comments: 6

Design and implementation of DuckDB internals

2026-04-10 @ 13:59:26Points: 134Comments: 9

Anastasia (1997) live action reference material

2026-04-10 @ 12:03:16Points: 33Comments: 8

A soft robot has no problem moving with no motor and no gears

2026-04-09 @ 22:33:41Points: 41Comments: 6

Rust Threads on the GPU

2026-04-09 @ 19:20:10Points: 85Comments: 24

Lumina – a statically typed web-native language for JavaScript and WASM

2026-04-09 @ 16:02:11Points: 19Comments: 6

Archives

2026

2025

2024

2023

2022