It's not fast and it's not pretty, but it works.
Hacker News
Latest
Medicare's new payment model is built for AI. Most of the tech world has no idea
2026-05-13 @ 21:24:14Points: 50Comments: 28
Meta won't let you block its AI account on Threads
2026-05-13 @ 20:22:21Points: 107Comments: 40
In-person examinations at Princeton will be proctored starting July 1
2026-05-13 @ 20:12:16Points: 240Comments: 342
Rars: a Rust RAR implementation, mostly written by LLMs
2026-05-13 @ 19:59:35Points: 79Comments: 66
Making the news available at no cost is a victory
2026-05-13 @ 19:14:47Points: 107Comments: 110
"Not Medically Necessary": Helping America's Health Insurers Deny Coverage
2026-05-13 @ 19:01:54Points: 173Comments: 161
MacBook Neo Deep Dive: Benchmarks, Wafer Economics, and the 8GB Gamble
2026-05-13 @ 18:30:51Points: 137Comments: 142
A sentimental tour of late 1990s and early 2000s hacking tools
2026-05-13 @ 18:25:09Points: 44Comments: 16
Launch HN: Ardent (YC P26) – Postgres sandboxes in seconds with zero migration
2026-05-13 @ 16:54:03Points: 64Comments: 26
In the last two years coding agents have gotten dramatically more capable at handling complex engineering tasks. But without access to a realistic sandbox at the DB layer for testing, they ship garbage that can take down production databases. I spent over a year building an AI Data Engineer that failed for this exact reason. Evan spent the last 12 years in data engineering and hit this wall building agents at his last company.
Ardent was built to make it possible for coding agents to get near instant access to production-like sandboxes so they can test their work. To do this we write a replication stream out of the target DB, scaling with kafka onto a read replica with copy on write enabled and autoscaling compute (we currently prefer neon as a primary branching engine due to their implementation of these properties).
Our replication stream uses logical replication + ddl triggers to enable usage on any hosted postgres DB since most platforms do not allow physical replication which is traditionally used for creating replicas.
This provides a few primary benefits:
1. Does not require a platform migration to a DB provider like neon, allowing strong separation of production and development concerns. 2. Minimal impact on the production database while allowing clones to spin up in <6s, even at TB scale with copy-on-write
Security matters a lot with cloning production so we run a proxy layer to generate custom postgres URLs and route all connections to allow more granular access control to clones, prevent credential leak, and follow a split plane architecture to allow full data residency on your cloud through BYOC.
We also support anonymization through the ability to register SQL that runs on branches before they are returned. This has been used for PII redaction and branch modification.
Our goal is to make every data infrastructure platform “cloneable” in one place so agents can fully test the impact of their changes on production like data environments without risk.
Here's a demo of it: https://youtu.be/5S1kwPtiRU0
We’d love to understand how you work with coding agents on the DB and if you try Ardent (it's free to get started) what worked, what broke and what’s missing.
S-100 Virtual Workbench
2026-05-13 @ 15:52:00Points: 98Comments: 20
Open Source Resistance: keep OSS alive on company time
2026-05-13 @ 15:13:37Points: 245Comments: 78
Setting up a free *.city.state.us locality domain (2025)
2026-05-13 @ 14:45:18Points: 483Comments: 154
The US is winning the AI race where it matters most: commercialization
2026-05-13 @ 13:53:53Points: 162Comments: 461
Leaving GitHub for Forgejo
2026-05-13 @ 12:54:00Points: 524Comments: 281
Substrate (YC S24) Is Hiring a Technical Success Manager
2026-05-13 @ 12:00:30Points: 1
I moved my digital stack to Europe
2026-05-13 @ 11:42:20Points: 882Comments: 536
The Emacsification of Software
2026-05-13 @ 07:06:14Points: 187Comments: 124
Twin brothers wipe 96 government databases minutes after being fired
2026-05-12 @ 22:28:07Points: 296Comments: 225
Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model
2026-05-12 @ 18:03:11Points: 638Comments: 182
We were always frustrated by the little effort made towards building agentic models that run on budget phones, so we conducted investigations that led to an observation: agentic experiences are built upon tool calling, and massive models are overkill for it. Tool calling is fundamentally retrieval-and-assembly (match query to tool name, extract argument values, emit JSON), not reasoning. Cross-attention is the right primitive for this, and FFN parameters are wasted at this scale.
Simple Attention Networks: the entire model is just attention and gating, no MLPs anywhere. Needle is an experimental run for single-shot function calling for consumer devices (phones, watches, glasses...).
Training: - Pretrained on 200B tokens across 16 TPU v6e (27 hours) - Post-trained on 2B tokens of synthesized function-calling data (45 minutes) - Dataset synthesized via Gemini with 15 tool categories (timers, messaging, navigation, smart home, etc.)
You can test it right now and finetune on your Mac/PC: https://github.com/cactus-compute/needle
The full writeup on the architecture is here: https://github.com/cactus-compute/needle/blob/main/docs/simp...
We found that the "no FFN" finding generalizes beyond function calling to any task where the model has access to external structured knowledge (RAG, tool use, retrieval-augmented generation). The model doesn't need to memorize facts in FFN weights if the facts are provided in the input. Experimental results to published.
While it beats FunctionGemma-270M, Qwen-0.6B, Granite-350M, LFM2.5-350M on single-shot function calling, those models have more scope/capacity and excel in conversational settings. We encourage you to test on your own tools via the playground and finetune accordingly.
This is part of our broader work on Cactus (https://github.com/cactus-compute/cactus), an inference engine built from scratch for mobile, wearables and custom hardware. We wrote about Cactus here previously: https://news.ycombinator.com/item?id=44524544
Everything is MIT licensed. Weights: https://huggingface.co/Cactus-Compute/needle GitHub: https://github.com/cactus-compute/needle