Skip to main content

r/DataHoarder

members
online

Instagram deleted massive archive of journalist, Saleh who was murdered in Gaza almost immediately after he was confirmed dead. Any instagram hoarders? Instagram deleted massive archive of journalist, Saleh who was murdered in Gaza almost immediately after he was confirmed dead. Any instagram hoarders?
Backup

Epstein Files - For Real Epstein Files - For Real
Scripts/Software

A few hours ago there was a post about processing the Epstein files into something more readable, collated and what not. Seemed to be a cash grab.

I have now processed 20% of the files, in 4 hours, and uploaded to GitHub, including transcriptions, a statically built and searchable site, the code that processes them (using a self hosted installation of llama 4 maverick VLM on a very big server. I’ll push the latest updates every now and then as more documents are transcribed and then I’ll try and get some dedupe.

It processes and tries to restore documents into a full document from the mixed pages - some have errored, but will capture them and come back to fix.

I haven’t included the original files - save space on GitHub - but all json transcriptions are readily available.

If anyone wants to have a play, poke around or optimise - feel free

Total cost, $0. Total hosting cost, $0.

Not here to make a buck, just hoping to collate and sort through all these files in an efficient way for everyone.

https://epstein-docs.github.io

https://github.com/epstein-docs/epstein-docs.github.io

magnet:?xt=urn:btih:5158ebcbbfffe6b4c8ce6bd58879ada33c86edae&dn=epstein-docs.github.io&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce


The everything app, for work. Get everyone working in a single platform designed to manage any type of work.

Trusted by 3 million+ teams, try ClickUp for free today!

The everything app, for work. Get everyone working in a single platform designed to manage any type of work.



Morty Proxy This is a proxified and sanitized view of the page, visit original site.