Hacker News
Latest
The Dating App Swipe Is Dying. What Comes Next May Be Worse
2026-05-14 @ 12:05:13Points: 18Comments: 50
Ryan Cohen hits back at eBay, says his takeover proposal should not be dismissed
2026-05-14 @ 11:26:58Points: 8Comments: 9
Myths about /dev/urandom
2026-05-14 @ 11:26:22Points: 39Comments: 24
They Said It Would Cost $54M. We Said "No Thanks."
2026-05-14 @ 11:05:05Points: 58Comments: 59
Show HN: Running the second public ODoH relay
2026-05-14 @ 10:44:50Points: 76Comments: 23
Classic 7 is a Windows 10 LTSC mod to look 1:1 to Windows 7
2026-05-14 @ 07:03:18Points: 122Comments: 121
Claude for Small Business
2026-05-14 @ 03:59:35Points: 406Comments: 372
A Claude Code and Codex Skill for Deliberate Skill Development
2026-05-14 @ 03:13:31Points: 137Comments: 24
Show HN: Nibble
2026-05-14 @ 01:46:45Points: 76Comments: 21
Cisco workforce reductions
2026-05-14 @ 01:38:05Points: 241Comments: 260
Avoiding and reducing microplastic false positives from dry glove contact
2026-05-14 @ 01:13:02Points: 80Comments: 32
Scorched Earth 2000 – Web
2026-05-14 @ 00:41:29Points: 320Comments: 132
MacBook Neo Deep Dive: Benchmarks, Wafer Economics, and the 8GB Gamble
2026-05-13 @ 18:30:51Points: 278Comments: 332
Launch HN: Ardent (YC P26) – Postgres sandboxes in seconds with zero migration
2026-05-13 @ 16:54:03Points: 94Comments: 39
In the last two years coding agents have gotten dramatically more capable at handling complex engineering tasks. But without access to a realistic sandbox at the DB layer for testing, they ship garbage that can take down production databases. I spent over a year building an AI Data Engineer that failed for this exact reason. Evan spent the last 12 years in data engineering and hit this wall building agents at his last company.
Ardent was built to make it possible for coding agents to get near instant access to production-like sandboxes so they can test their work. To do this we write a replication stream out of the target DB, scaling with kafka onto a read replica with copy on write enabled and autoscaling compute (we currently prefer neon as a primary branching engine due to their implementation of these properties).
Our replication stream uses logical replication + ddl triggers to enable usage on any hosted postgres DB since most platforms do not allow physical replication which is traditionally used for creating replicas.
This provides a few primary benefits:
1. Does not require a platform migration to a DB provider like neon, allowing strong separation of production and development concerns. 2. Minimal impact on the production database while allowing clones to spin up in <6s, even at TB scale with copy-on-write
Security matters a lot with cloning production so we run a proxy layer to generate custom postgres URLs and route all connections to allow more granular access control to clones, prevent credential leak, and follow a split plane architecture to allow full data residency on your cloud through BYOC.
We also support anonymization through the ability to register SQL that runs on branches before they are returned. This has been used for PII redaction and branch modification.
Our goal is to make every data infrastructure platform “cloneable” in one place so agents can fully test the impact of their changes on production like data environments without risk.
Here's a demo of it: https://youtu.be/5S1kwPtiRU0
We’d love to understand how you work with coding agents on the DB and if you try Ardent (it's free to get started) what worked, what broke and what’s missing.
Setting up a free *.city.state.us locality domain (2025)
2026-05-13 @ 14:45:18Points: 589Comments: 188
The US is winning the AI race where it matters most: commercialization
2026-05-13 @ 13:53:53Points: 223Comments: 625
The Emacsification of Software
2026-05-13 @ 07:06:14Points: 354Comments: 220
Twin brothers wipe 96 government databases minutes after being fired
2026-05-12 @ 22:28:07Points: 466Comments: 394
Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model
2026-05-12 @ 18:03:11Points: 697Comments: 198
We were always frustrated by the little effort made towards building agentic models that run on budget phones, so we conducted investigations that led to an observation: agentic experiences are built upon tool calling, and massive models are overkill for it. Tool calling is fundamentally retrieval-and-assembly (match query to tool name, extract argument values, emit JSON), not reasoning. Cross-attention is the right primitive for this, and FFN parameters are wasted at this scale.
Simple Attention Networks: the entire model is just attention and gating, no MLPs anywhere. Needle is an experimental run for single-shot function calling for consumer devices (phones, watches, glasses...).
Training: - Pretrained on 200B tokens across 16 TPU v6e (27 hours) - Post-trained on 2B tokens of synthesized function-calling data (45 minutes) - Dataset synthesized via Gemini with 15 tool categories (timers, messaging, navigation, smart home, etc.)
You can test it right now and finetune on your Mac/PC: https://github.com/cactus-compute/needle
The full writeup on the architecture is here: https://github.com/cactus-compute/needle/blob/main/docs/simp...
We found that the "no FFN" finding generalizes beyond function calling to any task where the model has access to external structured knowledge (RAG, tool use, retrieval-augmented generation). The model doesn't need to memorize facts in FFN weights if the facts are provided in the input. Experimental results to published.
While it beats FunctionGemma-270M, Qwen-0.6B, Granite-350M, LFM2.5-350M on single-shot function calling, those models have more scope/capacity and excel in conversational settings. We encourage you to test on your own tools via the playground and finetune accordingly.
This is part of our broader work on Cactus (https://github.com/cactus-compute/cactus), an inference engine built from scratch for mobile, wearables and custom hardware. We wrote about Cactus here previously: https://news.ycombinator.com/item?id=44524544
Everything is MIT licensed. Weights: https://huggingface.co/Cactus-Compute/needle GitHub: https://github.com/cactus-compute/needle