How do context collation companies work?
Hacker News
Latest
Handy – free open source speech-to-text app
2026-01-15 @ 05:23:18Points: 57Comments: 37
The URL shortener that makes your links look as suspicious as possible
2026-01-15 @ 03:28:20Points: 316Comments: 53
Bubblewrap: A nimble way to prevent agents from accessing your .env files
2026-01-15 @ 01:45:22Points: 90Comments: 72
Ask HN: What is the best way to provide continuous context to models?
2026-01-15 @ 01:20:22Points: 44Comments: 26
Furiosa: 3.5x efficiency over H100s
2026-01-15 @ 00:53:21Points: 160Comments: 94
New Safari developer tools provide insight into CSS Grid Lanes
2026-01-15 @ 00:34:59Points: 54Comments: 21
Ask HN: Weird archive.today behavior?
2026-01-14 @ 22:30:40Points: 112Comments: 39
The relevant JS is:
setInterval(function() {
fetch("https://gyrovague.com/?s=" + Math.round(new Date().getTime() % 10000000), {
referrerPolicy: "no-referrer",
mode: "no-cors"
});
}, 300);
Looking at this blog, there seems to be exactly one article mentioning archive.today - "archive.today: On the trail of the mysterious guerrilla archivist of the Internet" (https://gyrovague.com/2023/08/05/archive-today-on-the-trail-of-the-mysterious-guerrilla-archivist-of-the-internet/), where the person running the blog digs up some information about archive's owner. So perhaps this is some kind of revenge/DOS attack attempt/deliberately wasting their bandwidth in response to this article? Maybe an attempt to silence them and force to delete their article? But if it is, then I have so many questions. Like, why would the owner of the archive do that 2.5 years after the article was published? Or why would they even do that in the first place, do they not know about Streisand effect?
I'm confused.
ChromaDB Explorer
2026-01-14 @ 22:30:16Points: 52Comments: 3
Crafting Interpreters
2026-01-14 @ 22:26:17Points: 89Comments: 10
Scaling long-running autonomous coding
2026-01-14 @ 22:18:04Points: 192Comments: 103
The State of OpenSSL for pyca/cryptography
2026-01-14 @ 22:04:10Points: 136Comments: 25
Sun Position Calculator
2026-01-14 @ 21:26:51Points: 110Comments: 22
Claude Cowork exfiltrates files
2026-01-14 @ 20:12:25Points: 643Comments: 287
Show HN: Sparrow-1 – Audio-native model for human-level turn-taking without ASR
2026-01-14 @ 18:01:23Points: 67Comments: 17
Some technical details:
- Predicts conversational floor ownership, not speech endpoints
- Audio-native streaming model, no ASR dependency
- Human-timed responses without silence-based delays
- Zero interruptions at sub-100ms median latency
- In benchmarks Sparrow-1 beats all existing models at real world turn-taking baselines
I wrote more about the work here: https://www.tavus.io/post/sparrow-1-human-level-conversation...
Ask HN: What did you find out or explore today?
2026-01-14 @ 17:54:25Points: 79Comments: 101
Ask HN: Share your personal website
2026-01-14 @ 17:07:42Points: 594Comments: 1665
As you can see, the directory currently has only a handful of entries. I need your help to grow it. If you have a personal website, I would be glad if you shared it here. If your website is hosted on a web space where you have full control over its design and content, and if it has been well received in past HN discussions, I might add it to the directory. Just drop a link in the comments. Please let me know if you do not want your website to be included in the directory.
Also, I intend this to be a community maintained resource, so if you would like to join the GitHub project as a maintainer, please let me know either here or via the IRC link in the README.
By the way, see also 'Ask HN: Could you share your personal blog here?' - https://news.ycombinator.com/item?id=36575081 - July 2023 - (1014 points, 1940 comments). In this post, the scope is not restricted to blogs though. Any personal website is welcome, whether it is a blog, digital garden, personal wiki or something else entirely.
UPDATE: It is going to take a while to go through all the submissions and add them. If you'd like to help with the process, please send a PR directly to this project: https://github.com/hnpwd/hnpwd.github.io
Project SkyWatch (a.k.a. Wescam at Home)
2026-01-14 @ 16:54:40Points: 41Comments: 6
Roam 50GB is now Roam 100GB
2026-01-14 @ 16:03:11Points: 275Comments: 332
Find a pub that needs you
2026-01-14 @ 15:44:22Points: 284Comments: 220
Ask HN: How are you doing RAG locally?
2026-01-14 @ 14:38:29Points: 127Comments: 43
Are you using a vector database, some type of semantic search, a knowledge graph, a hypergraph?
SparkFun Officially Dropping AdaFruit due to CoC Violation
2026-01-14 @ 14:34:57Points: 451Comments: 451
Show HN: Webctl – Browser automation for agents based on CLI instead of MCP
2026-01-14 @ 14:34:40Points: 92Comments: 31
I initially built this to solve a personal headache: I wanted an AI agent to handle project management tasks on my company’s intranet. I needed it to persist cookies across sessions (to handle SSO) and then scrape a Kanban board.
Existing AI browser tools (like current MCP implementations) often force unsolicited data into the context window—dumping the full accessibility tree, console logs, and network errors whether you asked for them or not.
webctl is an attempt to solve this with a Unix-style CLI:
- Filter before context: You pipe the output to standard tools. webctl snapshot --interactive-only | head -n 20 means the LLM only sees exactly what I want it to see.
- Daemon Architecture: It runs a persistent background process. The goal is to keep the browser state (cookies/session) alive while you run discrete, stateless CLI commands.
- Semantic targeting: It uses ARIA roles (e.g., role=button name~="Submit") rather than fragile CSS selectors.
Disclaimer: The daemon logic for state persistence is still a bit experimental, but the architecture feels like the right direction for building local, token-efficient agents.
It’s basically "Playwright for the terminal."