- cross-posted to:
- datahoarder
- technology
- cross-posted to:
- datahoarder
- technology
After its website was crippled for nearly a month by a cyberattack, the Internet Archive announced on Monday that it had restored one of its most valuable services—the Save Page Now feature that allows users to add copies of webpages to the organization’s digital library.
In a social media post, the Internet Archive said web pages that users had attempted to save since October 9 are beginning to be archived now, although it did not provide an estimate for when the process would be completed. So, if you were worried that all of that election coverage was in danger of disappearing, the Archive says it’s handling the backlog. And if you stopped archiving because it was down, get back to work.
The organization had been operating its collection in read-only mode since October 21 as it steadily worked to restore services.
Founded in 1996, the Internet Archive is a nonprofit based in San Francisco that provides access to historic web pages, digitized books, and a variety of other media that it has uploaded through its partnerships with hundreds of physical libraries and other partners.
Its unparalleled collection currently contains 835 billion web pages, 44 million books and texts, 15 million audio recordings, 10.6 million videos, 4.8 million images, and 1 million software programs.
A friendly reminder to everyone to check out ArchiveBox if you’re looking for a self-hosted archiving solution. I’ve been using it for a while now and it works great; it can be a little rough around the edges at times, but I think it’s a wonderful tool. It’s allowed me to continue saving pages during the Internet Archive’s outage.
I like ArchiveBox, but in my experience, it kept on running into issues saving pages, and stopped functioning after it worked the first few times. I really wish there was a more streamlined application that did a similar thing somewhere out there.
I’ve been looking at Linkwarden’s page archiving solution, but it crashes whenever I try importing any large number of links, so that’s a bust too.
What sort of content are you archiving. I was trying to backup Wikipedia at some point and it was just a nightmare.