Remix.run Logo
NoMoreNicksLeft 19 hours ago

Could have sworn they did this years ago. I even have the first 80 years or whatever on DVD in the closet.

throwup238 15 hours ago | parent | next [-]

Normally when laymen say "digitized" they mean one of two things: scanned images in a PDF or fully transcribed (and possible formatted) text extracted from the scan. The Complete New Yorker you're thinking of was mostly the former, with a bit of indexing (table of contents pointing to the PDFs if I remember correctly).

This latest digitization project does the latter, transcribing the text into their existing content management system and as far as I can tell, preserving much of the formatting. This comes with full text search, allows cross linking between articles, and all that good stuff.

I suspect that since they include an LLM summary and started this digitization project in early 2024, this was enabled by LLMs.

smelendez 18 hours ago | parent | prev | next [-]

If I’m reading this correctly, they now have all their historic articles loaded into their CMS. I think they previously just had a system where you could page (and maybe search?) through scans of old issues, which is also cool but not as versatile.

ghaff 19 hours ago | parent | prev [-]

When a lot of content was being put out on CD/DVD, a number of publications did but they are not straightforwardly accessible these days because they're usually on an old version of Windows. (Yes, if you want to make a project of it, you can probably get into them but has never been worth it for me.)

haunter 18 hours ago | parent | next [-]

Usually Windows/Wine is the much better case than the old Mac apps (32bit, PPC etc) in the age of Apple Silcon

https://old.reddit.com/r/thenewyorker/comments/1jlhrve/instr...

Breaking the DJVU DRM would be the perfect solution though

qingcharles 16 hours ago | parent [-]

It has been broken. I actually have the set on my desk ready to rip, I just couldn't find my USB DVD drive.

Here's a link to the guy that broke it:

https://github.com/reconSuave/PlayboyPDF/

mekael 16 hours ago | parent | prev | next [-]

Surprisingly, this has been a project I’ve been tinkering with for years. There is an easy way to get the raw png/jpeg files out, but it does require a windows box. Im planning on working on it more over the long holiday.

zorked 18 hours ago | parent | prev | next [-]

I think the disc release GP is talking about had files in DjVu format.

Tomte 16 hours ago | parent [-]

Encrypted DjVu, and the viewer doesn‘t run on modern Windows.

medler 10 hours ago | parent [-]

It runs great on windows 11. The install took a long time but I didn’t have to do anything special to make it work

Tomte 5 hours ago | parent [-]

Maybe we have different editions? I never got mine to work.

fsckboy 18 hours ago | parent | prev | next [-]

doesn't wine have old versions of mswindows pretty much nailed?

kopirgan 18 hours ago | parent | prev [-]

I have the MAD archives bought in 90s on CDs but can't use..

haunter 18 hours ago | parent | next [-]

The issues on the Absolutely MAD DVD (1952-2005) are just plain PDF files, no DRM, they work perfectly

https://files.catbox.moe/x4np6u.png

kopirgan 5 minutes ago | parent | next [-]

No mine were pre dvd era. In CD. Older. They had a surprisingly good UI with its own funny stuff. Your install that and insert the disk 1-7 based on which issue you select. Even scold you for installing wrong disk & comments about 'you can insert a CD of Yanni if you prefer screeching' or something like that

ghaff 17 hours ago | parent | prev [-]

The CDs I have seem to be proprietary for Windows from the late 90s. But I also have PDFs through 2005 on my computer which I must have "acquired" at some point.

kopirgan 3 minutes ago | parent | next [-]

Yes the file names are something unknown. It has a software to access. They did a damn good job.

haunter 16 hours ago | parent | prev [-]

The browser app might be some outdated Windows application, that's the case with the MAD DVD too, but you can find the actual issue files in some folders

ghaff 18 hours ago | parent | prev [-]

I have MAD archives somewhere. I thought they were in some standard format but maybe not.

A lot of the gen 1 or so CD content isn't easily accessible although a more industrious person could probably get to it in some manner.

kopirgan 3 minutes ago | parent [-]

I have the CD backed up as ISO files which I can mount. Since these days laptops don't have CD players.

Need to try on latest windows 11 I gave up earlier. For a while had a windows 2000 virtual machine that worked.