265 points · DuffJohnson · 15 hours ago
pdfa.organigbrowl
ted_bunny
yonatan8070
waynenilsen
hopefully someone is independently archiving all documents
my understanding is that some are being removed
embedding-shape
JKCalhoun
OCR is so bad of course that decoding the Base64 seems futile without a lot of effort.
Example: https://www.justice.gov/epstein/files/DataSet%2011/EFTA02609...
(More mentioned here: https://old.reddit.com/r/Epstein/comments/1qu9az2/theres_unr...)
originalvichy
Beijinger
Who paid him?
Who did get paid?
_def
nkozyra
Maybe I'm underestimating the issue at full, but isn't this a very lightweight problem to solve? Is converting the images to lower DPI formats/versions really any easier than just stripping the metadata? Surely the DOJ and similar justice agencies have been aware of and doing this for decades at this point, right?
bugeats
shevy-java
Some of the gathered data is shown here, right? Probably not all.
Now ... that's static information though. That's not really an analysis, most definitely not an independent (open ended) analysis. And it will only show a very incomplete part of the full picture.
This is why I think the "release the files" movement, as good as they are, seems incomplete. I'd rather know a lot more about how they operate their networks, getting away involving underage women. How about secret services of other countries? Should that not also be highly important? So why is there not really a larger investigation as well as independent analysis? Those .pdf files alone can not tell the whole picture. That can just be the tip of the iceberg; and it evidently involves other countries too, with Prince Andrew being the most famous here (aka, the UK, but we already saw that other countries also have similar issues where people suddenly had to step away from politics when it was found out they visited the party-locations of Mr. Epstein).
corygarms
tibbon
(But seriously, great work here!)
mmooss
Is the scope at least limited somehow? Generally I favor transparency, but of course probably the most important parts are withheld.
meidan_y
NoToP
There are also other documents that appear to simulate a scanned document but completely lack the “real-world noise” expected with physical paper-based workflows. The much crisper images appear almost perfect without random artifacts or background noise, and with the exact same amount of image skew across multiple pages. Thanks to the borders around each page of text, page skew can easily be measured, such as with VOL00007\IMAGES\0001\EFTA00009229.pdf. It is highly likely these PDFs were created by rendering original content (from a digital document) to an image (e.g., via print to image or save to image functionality) and then applying image processing such as skew, downscaling, and color reduction.