

I wonder if an authorised remote user (ie an affiliated researcher) can easily instruct ArchiveBox to store a URL and later retrieve it. Also, ideally a random user should be able to retrieve the archived web page or file (eg a PDF, CSV etc). The idea is that authorised researchers can get URLs archived, and then any user reading our reports can click on a citation and get our archived source if the original is not available any more. I’ll need to run it and see, but it looks promising.
Keeping the archive alive for years later, possibly after funding dries up, is another challenge but there are public repositories that may be suitable for that.



One advantage and disadvantage of having webrecorder host our archived pages is that the archive may survive longer than, or not as long as our project.
I have been using singlefile for years. It’s great but not for seamlessly making cached web pages available to the general public reading our reports and finding that cited links are now dead. And it doesn’t support URLs point to PDF, CSV files. A public-facing repository of singlefile files with an index for ToC might do it though. Simplicity is good for future-proofing an archive.
Something like archive.org and archive.is would be ideal, but we have no control over its future and practices.