Hacker News

Latest

Boeing knew of flaw in part linked to UPS plane crash

2026-01-16 @ 04:11:17Points: 41Comments: 10

OpenBSD-current now runs as guest under Apple Hypervisor

2026-01-16 @ 03:10:36Points: 110Comments: 5

SETI Home Flags 100 Signals After Sorting 12B Others

2026-01-16 @ 02:45:15Points: 55Comments: 13

Show HN: Reversing YouTube’s “Most Replayed” Graph

2026-01-16 @ 02:06:11Points: 22Comments: 4

I recently noticed a recurring visual artifact in the "Most Replayed" heatmap on the YouTube player. The highest peaks were always surrounded by two dips. I got curious about why they were there, so I decided to reverse engineer the feature to find out.

This post documents the deep dive. It starts with a system design recreation, reverse engineering the rendering code, and ends with the mathematics.

This is also my first attempt at writing an interactive article. I would love to hear your thoughts on the investigation and the format.

Show HN: Gambit, an open-source agent harness for building reliable AI agents

2026-01-16 @ 00:13:25Points: 58Comments: 11

Wanted to show our open source agent harness called Gambit.

If you’re not familiar, agent harnesses are sort of like an operating system for an agent... they handle tool calling, planning, context window management, and don’t require as much developer orchestration.

Normally you might see an agent orchestration framework pipeline like:

compute -> compute -> compute -> LLM -> compute -> compute -> LLM

we invert this so with an agent harness, it’s more like:

LLM -> LLM -> LLM -> compute -> LLM -> LLM -> compute -> LLM

Essentially you describe each agent in either a self contained markdown file, or as a typescript program. Your root agent can bring in other agents as needed, and we create a typesafe way for you to define the interfaces between those agents. We call these decks.

Agents can call agents, and each agent can be designed with whatever model params make sense for your task.

Additionally, each step of the chain gets automatic evals, we call graders. A grader is another deck type… but it’s designed to evaluate and score conversations (or individual conversation turns).

We also have test agents you can define on a deck-by-deck basis, that are designed to mimic scenarios your agent would face and generate synthetic data for either humans or graders to grade.

Prior to Gambit, we had built an LLM based video editor, and we weren’t happy with the results, which is what brought us down this path of improving inference time LLM quality.

We know it’s missing some obvious parts, but we wanted to get this out there to see how it could help people or start conversations. We’re really happy with how it’s working with some of our early design partners, and we think it’s a way to implement a lot of interesting applications:

- Truly open source agents and assistants, where logic, code, and prompts can be easily shared with the community.

- Rubric based grading to guarantee you (for instance) don’t leak PII accidentally

- Spin up a usable bot in minutes and have Codex or Claude Code use our command line runner / graders to build a first version that is pretty good w/ very little human intervention.

We’ll be around if ya’ll have any questions or thoughts. Thanks for checking us out!

Walkthrough video: https://youtu.be/J_hQ2L_yy60

My Gripes with Prolog

2026-01-16 @ 00:11:28Points: 53Comments: 36

All 23-Bit Still Lifes Are Glider Constructible

2026-01-15 @ 23:59:06Points: 28Comments: 3

Why senior engineers let bad projects fail

2026-01-15 @ 22:33:58Points: 161Comments: 115

Aviator (YC S21) is hiring to build multiplayer AI coding platform

2026-01-15 @ 21:00:42Points: 1

Linux boxes via SSH: suspended when disconected

2026-01-15 @ 20:20:13Points: 144Comments: 94

Briar keeps Iran connected via Bluetooth and Wi-Fi when the internet goes dark

2026-01-15 @ 19:38:02Points: 203Comments: 89

Data is the only moat

2026-01-15 @ 18:54:32Points: 101Comments: 25

JuiceFS is a distributed POSIX file system built on top of Redis and S3

2026-01-15 @ 18:45:18Points: 130Comments: 70

CVEs affecting the Svelte ecosystem

2026-01-15 @ 17:51:24Points: 147Comments: 26

Supply Chain Vuln Compromised Core AWS GitHub Repos & Threatened the AWS Console

2026-01-15 @ 17:30:10Points: 104Comments: 23

Ask HN: How can we solve the loneliness epidemic?

2026-01-15 @ 16:49:13Points: 478Comments: 785

Countless voiceless people sit alone every day and have no one to talk to, people of all ages, who don't feel that they can join any local groups. So they sit on social media all day when they're not at work or school. How can we solve this?

Found: Medieval Cargo Ship – Largest Vessel of Its Kind Ever

2026-01-15 @ 15:09:16Points: 139Comments: 35

Apple is fighting for TSMC capacity as Nvidia takes center stage

2026-01-15 @ 15:02:42Points: 618Comments: 372

25 Years of Wikipedia

2026-01-15 @ 13:17:07Points: 472Comments: 382

Photos capture the breathtaking scale of China's wind and solar buildout

2026-01-15 @ 09:54:10Points: 592Comments: 446

Use of Bayesian methodology in clinical trials of drug and biological products [pdf]

2026-01-15 @ 07:36:15Points: 54Comments: 17

Pocket TTS: A high quality TTS that gives your CPU a voice

2026-01-15 @ 05:14:08Points: 274Comments: 52

Claude is good at assembling blocks, but still falls apart at creating them

2026-01-14 @ 16:26:11Points: 203Comments: 148

Inside The Internet Archive's Infrastructure

2026-01-14 @ 07:26:52Points: 294Comments: 65

First impressions of Claude Cowork

2026-01-14 @ 06:14:26Points: 164Comments: 90

Show HN: OpenWork – An open-source alternative to Claude Cowork

2026-01-14 @ 04:55:04Points: 158Comments: 30

i built openwork, an open-source, local-first system inspired by claude cowork.

it’s a native desktop app that runs on top of opencode (opencode.ai). it’s basically an alternative gui for opencode, which (at least until now) has been more focused on technical folks.

the original seed for openwork was simple: i have a home server, and i wanted my wife and i to be able to run privileged workflows. things like controlling home assistant, or deploying custom web apps (e.g. our customs recipe app recipes.benjaminshafii.com), legal torrents, without living in a terminal.

our initial setup was running the opencode web server directly and sharing credentials to it. that worked, but i found the web ui unreliable and very unfriendly for non-technical users.

the goal with openwork is to bring the kind of workflows i’m used to running in the cli into a gui, while keeping a very deep extensibility mindset. ideally this grows into something closer to an obsidian-style ecosystem, but for agentic work.

some core principles i had in mind:

- open by design: no black boxes, no hosted lock-in. everything runs locally or on your own servers. (models don’t run locally yet, but both opencode and openwork are built with that future in mind.) - hyper extensible: skills are installable modules via a skill/package manager, using the native opencode plugin ecosystem. - non-technical by default: plans, progress, permissions, and artifacts are surfaced in the ui, not buried in logs.

you can already try it: - there’s an unsigned dmg - or you can clone the repo, install deps, and if you already have opencode running it should work right away

it’s very alpha, lots of rough edges. i’d love feedback on what feels the roughest or most confusing.

happy to answer questions.

Go-legacy-winxp: Compile Golang 1.24 code for Windows XP

2026-01-12 @ 13:32:42Points: 89Comments: 35

The five orders of ignorance (2000)

2026-01-12 @ 13:00:21Points: 13Comments: 4

I Built a 1 Petabyte Server from Scratch [video]

2026-01-11 @ 03:55:53Points: 30Comments: 2

What a Programmer Does (1967) [pdf]

2026-01-11 @ 02:15:33Points: 35Comments: 5

Archives

2026

2025

2024

2023

2022