Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Show HN: OSS AI agent that indexes and searches the Epstein files (trynia.ai)
68 points by jellyotsiro 6 hours ago | hide | past | favorite | 18 comments
Hi HN,

I built an open-source AI agent that has already indexed and can search the entire Epstein files, roughly 100M words of publicly released documents.

The goal was simple: make a large, messy corpus of PDFs and text files immediately searchable in a precise way, without relying on keyword search or bloated prompts.

What it does:

- The full dataset is already indexed - You can ask natural language questions - Answers are grounded and include direct references to source documents - Supports both exact text lookup and semantic search

Discussion around these files is often fragmented. This makes it possible to explore the primary sources directly and verify claims without manually digging through thousands of pages.

Happy to answer questions or go into technical details.

Code: https://github.com/nozomio-labs/nia-epstein-ai





I keep thinking that the lack of children’s faces in the blacked out rectangles make the files much less shocking. I wonder if AI could put back fake images to make clearer to people how sick all this is.

Those are going to be some spicy hallucinations.

Is it able to handle a much larger dataset? Only a tiny fraction of data has been release from what is looks like.

And what did you learn?

Trump famously told New York Magazine in 2002: "I've known Jeff for 15 years. Terrific guy. He's a lot of fun to be with. It is even said that he likes beautiful women as much as I do, and many of them are on the younger side."

Trump and Epstein were social acquaintances in Palm Beach and New York circles during the 1990s-early 2000s. They socialized together at Mar-a-Lago and other venues


Interesting. It is my impression that almost everyone globally already knew this. What else did you learn?

ill take like 1 hour in the evening to dive deeper, i was never familiar with epstein stuff until i built the agent to simplify things for me.

This is one of the most widey quoted phrases by trump on the topic of epstein

In 2024, Trump used Epstein's former private jet for campaign appearances

Does this work with vector embeddings?

it uses semantic search so yes

This is a good idea. One thing I never understand about these kinds of projects though: why are the standard questions provided to the user as prompts never cached?

Outputs are usually generated with random sampling, so the same prompt may get different outputs.

oh forgot about it, thanks. just a funny project i build in couple hours so didnt really sweat haha

This agent is really interesting! Learning a lot. Thanks!

can search the entire Epstein files

It's worth noting that only about 1% of the files have been released, according to the DOJ.

Of the released files, many have redactions.


If the Lake Michigan thing is just in the first 1%, then whatever's in the other 99% is going to be absolutely disgusting.

sorry all publicly available files *



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: