Hacker News

Latest

Fintech Engineering Handbook

2026-06-27 @ 10:28:53Points: 92Comments: 26

DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

2026-06-27 @ 09:18:52Points: 394Comments: 116

Beer CSS – Build material design in record time

2026-06-27 @ 09:06:11Points: 50Comments: 12

IBM MCGA Gate Array Reverse Engineering

2026-06-27 @ 05:17:08Points: 40Comments: 8

OpenTTD 16.0-Beta1

2026-06-27 @ 04:31:06Points: 188Comments: 33

WordStar: A Writer's Word Processor (1996)

2026-06-27 @ 03:30:01Points: 109Comments: 46

Anatomy of a Failed (Nation-State?) Attack

2026-06-27 @ 02:41:53Points: 77Comments: 12

Show HN: Hacker News on a train station-style flip board

2026-06-27 @ 00:43:39Points: 80Comments: 18

The idea here is that people (like me or you) can create something neat like this, and others can remix it, change it and publish their own version. This is that all in action and it worked great. I wrote a blog about it (the blog is dogfooding, it's just an app hosted on quickish that uses the built in db lib).

For the HN version of this flip board I use their firebase api via the built in quickish server functions that make use of the fact that the front-end can get realtime updates (now that you mention firebase) from cloud function db updates. Of course that's over-kill but I wanted to show something fun. You can remix and host your own version for free, just need a google oauth login that's it.

OG flip board I built (Portland Based - Current Weather): https://popflame.quickish.space/flipboard-preview

Blog post that dives a tiny bit deeper: https://popflame.quickish.space/blog/hacker-news-on-a-split-...

Foreign funds help make housing unaffordable: research

2026-06-26 @ 23:36:30Points: 94Comments: 29

Om

2026-06-26 @ 23:33:46Points: 414Comments: 19

Om Malik has died - https://news.ycombinator.com/item?id=48678852 - June 2026 (161 comments)

U.S. allows Anthropic to release Mythos AI to ‘trusted’ US organizations

2026-06-26 @ 22:48:28Points: 471Comments: 589

Why does kinetic energy increase quadratically, not linearly, with speed? (2011)

2026-06-26 @ 22:43:29Points: 267Comments: 125

AI in mathematics is forcing big questions

2026-06-26 @ 22:36:51Points: 147Comments: 111

A C++ implementation of a fast hash map and hash set using hopscotch hashing

2026-06-26 @ 21:18:28Points: 98Comments: 16

The gap between open weights LLMs and closed source LLMs

2026-06-26 @ 21:14:47Points: 230Comments: 183

We can still stop California's 3D printer surveillance scheme

2026-06-26 @ 21:13:50Points: 420Comments: 149

U.S. government will decide who gets to use GPT-5.6

2026-06-26 @ 18:23:14Points: 1067Comments: 1123

What Is a Nomogram and Why Would It Interest Me?

2026-06-26 @ 17:24:25Points: 131Comments: 20

Previewing GPT‑5.6 Sol: a next-generation model

2026-06-26 @ 17:06:55Points: 1029Comments: 648

Show HN: Smart model routing directly in Claude, Codex and Cursor

2026-06-26 @ 16:40:11Points: 176Comments: 99

https://www.youtube.com/watch?v=isKhAyivtfM.

At Weave, we write most of our code with AI, and it's been getting more expensive. This came to a head when Opus 4.7 was released and, thanks to its tokenizer changes, our costs shot up. We knew we didn't need Opus for everything but we didn't want to lose out on the intelligence for the cases where you really need it. So we decided to build a model router to handle this for us.

The Weave Router acts as an Anthropic/OpenAI endpoint specifically for coding agents. It looks at every inference request and intelligently (more on that in a sec) decides what model to send it to, handling all the translations required along the way. So it can use faster/cheaper models (e.g. DeepSeek v4, GLM 5.2, Kimi K2.6) when possible, and frontier models (Opus 4.8 & GPT 5.5 (& Fable whenever it's back)) when necessary.

How do we know what model to route to? We trained an RL model on tens of thousands (so far!) of agent traces. We reward the routing model when it selects an LLM that successfully completes the given task.

Here's an example: if you ask the router to plan a complex change, it will (probably) route that request to Opus 4.8. Subagents exploring the codebase to gather context will be routed to more suitable models (e.g. DeepSeek V4 Flash). Then when you have the plan ready to implement, it will be (most likely) be handed to a quicker model (e.g. GLM 5.2) to carry it out.

We've been using this internally for the last month or so. We've saved 40% on tokens vs. what we otherwise would have paid, with no noticeable differences in quality or velocity.

The router is source-available under Elastic License 2.0, so you can self-host it. Or if you prefer, you can also use our hosted version: weaverouter.com.

I'll be here to answer any questions you may have!

Ultrasound imaging of the brain

2026-06-26 @ 11:51:11Points: 291Comments: 114

Faster KNN search in Manticore: 2-pass HNSW, batched distances, and AVX-512

2026-06-26 @ 04:13:28Points: 23Comments: 1

Long Wave radio era set to end with switch-off

2026-06-25 @ 18:42:17Points: 62Comments: 73

The US Army Issued Ocarinas to Soldiers in World War II

2026-06-25 @ 07:26:06Points: 21Comments: 9

Linux on Older Hardware: The Complete Revival Guide

2026-06-25 @ 04:06:29Points: 87Comments: 37

Jest/Vitest interactive course (runs in the browser)

2026-06-24 @ 21:09:07Points: 14Comments: 7

Fusion Programming Language

2026-06-24 @ 11:27:07Points: 87Comments: 37

SCC Technical Assistance Program

2026-06-24 @ 08:46:04Points: 20Comments: 1

MicroVMs: Run isolated sandboxes with full lifecycle control

2026-06-23 @ 09:39:35Points: 342Comments: 188

Hellishly Slow Level 13 Deflate Compression

2026-06-22 @ 18:17:20Points: 70Comments: 20

Archives

2026

2025

2024

2023

2022