Starting a Web Archives collection 

Starting a Web Archives collection 

Local Web Archiving Policies

  • Collection Planning 
    • Collection Scope……….……………………………………….……1
    • Collection Planning……….……………………………………….…1
    • Storage Allocation……………………………………………….……1
    • Collection Development Policy…………………………………..….2
    • Collection Organization………………………………………….…..3 
    • Collection Naming Conventions………………………………….. 3-4 
  • Policies
    • Intellectual Property/Copyright………………………………..…….4
    • About the robots.txt File……………………………………….…….4 
    • Storage and Contingency Planning…………………………..…… 4
    • Password- Protect Content……………………………….…………4

Creating your Collection

  • Term Definitions…………………..………………….…………………..…..1
  • Group Seeds………………………………….……………….………..……1
  • Web Crawl set up………………………………….……………………… 2-3
  • Seed settings…………………………….………………………………….3-7
  • test crawls…………………………………………………………….…….7-8 
  • Web Crawler Settings…………………………..………………….……. 8-10 
  • Collection guidance by content type……………………………………10-14
  • Challenges……..…………………………………………….……………14-17

Managing your Collection

  • Quality Assurance……………………………………………….…………1-2 
  • Descriptive Metadata……………………………………………..………. 2-9

 

For additional training please email: webarchives@library.illinois.edu 

The documentation reflects Archive-It’s documentation and will be updated when Archive-It updates. If you want to see the most up to date information please refer to the Archive-It documentation.