Recommendations for authors to enable web archiving
This page presents recommendations to create web pages that can be efficiently archived and accessed across time. It is impossible to preserve the web without the authors collaboration.
Last update on August 10, 2015.
Web archives work similarly to web search engines like Google. Therefore, the presented recommendations contribute to create contents that can be correctly processed of both these types of systems.
The following recommendations were classified in fundamental and advisable. They were written with the purpose of helping any author. Please contact us if you have any doubt.
Web site organization
- Maintain address to the same content across time
- Robots Exclusion Protocol authorizing the harvest of important content
Web page content
- Links published using HTML
- Texts published using textual formats
- Media type and character set encoding correctly identified
- Metadata describing the content
- Format specification compliance
- Date of publication explicitly published
- Adequate formats for preservation
- Designing Preservable Websites, Library of Congress.
- Guidelines for Preservable Websites, Columbia University Libraries
- Five Tips for Designing Preservable Websites, Smithsonian Institution Archives.
- ArchiveReady.com: website archivability evaluation tool.
- Archivability, Stanford University Libraries.
- Web Archiving Guidance, The National Archives and Central Office of Information (UK).
- Managing Web Resources for Persistent Access, National Library of Australia.
- Creating Preservation-Ready Web Resources, J. A. Smith, M. L. Nelson.