Hacker News
Latest
Orthodox C++
2026-06-13 @ 13:58:46Points: 36Comments: 14
US bans differential privacy in Census data
2026-06-13 @ 13:54:56Points: 211Comments: 74
Treating pancreatic tumours may have revealed cancer's master switch
2026-06-13 @ 13:34:00Points: 63Comments: 11
Introduction to the experience of rendering Arabic typography&its technical debt
2026-06-13 @ 12:40:16Points: 55Comments: 7
An Interview with Intel's Kira Boyko: Xeon 6's Product Director
2026-06-13 @ 12:20:05Points: 35Comments: 1
Sam Bankman-Fried loses bid to appeal against fraud conviction in FTX case
2026-06-13 @ 12:17:27Points: 55Comments: 32
Show HN: 2 Weeks of Hallucinate – The Photo Gallery
2026-06-13 @ 12:11:35Points: 54Comments: 15
AI OSS tool repo goes archived over night after raising $7.3M Seed
2026-06-13 @ 12:10:47Points: 157Comments: 103
Arch Linux Now Believes Malware Incident Under Control: More Than 1,500 Packages
2026-06-13 @ 11:55:30Points: 174Comments: 90
Every Frame Perfect
2026-06-13 @ 11:40:20Points: 117Comments: 19
RTX 5080 and RTX 3090 Setup: 80 Tok/s on Qwen 3.6 27B Q8
2026-06-13 @ 09:55:32Points: 61Comments: 17
Show HN: Paca – Lightweight Jira alternative for human-AI collaboration
2026-06-13 @ 09:44:25Points: 83Comments: 29
A low-carbon computing platform from your retired phones
2026-06-13 @ 09:38:32Points: 160Comments: 77
Leaving Mozilla
2026-06-13 @ 05:57:14Points: 392Comments: 228
Shepherd's Dog: A Game by the Most Dangerous AI Model
2026-06-13 @ 05:44:46Points: 138Comments: 109
There is a shadow hanging over this Fable thing
2026-06-13 @ 05:16:41Points: 413Comments: 386
Open source AI must win
2026-06-13 @ 02:14:24Points: 1328Comments: 414
Statement on US government directive to suspend access to Fable 5 and Mythos 5
2026-06-13 @ 00:51:30Points: 2868Comments: 2093
Show HN: Putt.day a daily mini golf game
2026-06-12 @ 22:56:06Points: 251Comments: 98
Twenty One Zero-Days in FFmpeg
2026-06-12 @ 22:13:29Points: 265Comments: 173
Electric motors with no rare earths
2026-06-12 @ 22:08:03Points: 625Comments: 178
Swift at Apple: Migrating the TrueType hinting interpreter
2026-06-12 @ 19:54:27Points: 226Comments: 109
How to setup a local coding agent on macOS
2026-06-12 @ 17:34:55Points: 435Comments: 110
CRISPR tech selectively shreds cancer cells, including "undruggable" cancers
2026-06-12 @ 15:15:24Points: 925Comments: 203
Malware developers added nuclear and biological weapons text to to their spyware
2026-06-11 @ 20:24:18Points: 431Comments: 228
H.R. 6028 would fundamentally change the U.S. Copyright Office
2026-06-11 @ 00:00:42Points: 257Comments: 102
Show HN: I am building a map of people who lived in the Roman Empire
2026-06-10 @ 19:28:49Points: 59Comments: 14
There are databases where Classicists have done this manually for specific regions, Trismegistos https://www.trismegistos.org/ and Latin Inscriptions of the Roman Empire (LIRE) https://pure.au.dk/portal/en/publications/latin-inscriptions... are two major efforts I found. But there doesn't seem to be a project that did what I set out to do, although I have read in some places that it was believed to be possible.
I am not a classicist or a web developer, but I have Claude and Gemini and I can sort of read basic Latin - so I set to work. I used LIRE and another database as ground truth and built a pipeline to extract and process the inscriptions to recover the names. The process I developed uses a high end LLM like Sonnet or Gemini Pro to supervise the extraction and tuning process on a regional basis until the obvious error rate is reasonable. For this, so far, reasonable to me means less than 1-2% in the smaller initial samples of 100-500 and no observed systemic issues. The different regions often need different prompts, so this basically became an exercise in letting the higher level AI tune the prompt for the lower level AI. The extraction when measured against LIRE produces an F1 score between 0.64 and 0.87, but take this with a grain of salt.
Once I had done a few regions, I wanted to see the work, so I threw together a pretty crude website but as I am not a web developer, it was crude in how it accessed its data. It does look cool and I also added summarization, and machine translation to each entry. I wanted to eventually get feedback from an actual team of classicists and make the website work better, so I am rewriting it as we speak but it is broadly functional now with a few extra bugs but substantially improved performance compared to the old one. All entries link back to the proper sources, and the old web app linked to several additional sources where the data was present, but I haven't gotten that working again just yet on the new one. (The old web interface is still available at https://roman-names.com, but I will warn you it is clunky and not mobile friendly at all)
Key findings so far:
AI supervised AI extraction saved me time. I was manually tuning things for a while and then the runbook became an idea that I feed my instructions in and let the big AI go with sparse oversight from me.
The extraction improved significantly (by about 10 F1 points) when I fed the model the raw text including the markers, vs a cleaned up version of the text.
I just thought it was a cool little project and wanted to share. If you happen to work in any adjacent space and there is something I could do better etc let me know.