Starting a Web Archives collection
- Collection Planning
- Collection Scope……….……………………………………….……1
- Collection Planning……….……………………………………….…1
- Storage Allocation……………………………………………….……1
- Collection Development Policy…………………………………..….2
- Collection Organization………………………………………….…..3
- Collection Naming Conventions………………………………….. 3-4
- Policies
- Intellectual Property/Copyright………………………………..…….4
- About the robots.txt File……………………………………….…….4
- Storage and Contingency Planning…………………………..…… 4
- Password- Protect Content……………………………….…………4
- Term Definitions…………………..………………….…………………..…..1
- Group Seeds………………………………….……………….………..……1
- Web Crawl set up………………………………….……………………… 2-3
- Seed settings…………………………….………………………………….3-7
- test crawls…………………………………………………………….…….7-8
- Web Crawler Settings…………………………..………………….……. 8-10
- Collection guidance by content type……………………………………10-14
- Challenges……..…………………………………………….……………14-17
- Quality Assurance……………………………………………….…………1-2
- Descriptive Metadata……………………………………………..………. 2-9
For additional training please email: webarchives@library.illinois.edu
The documentation reflects Archive-It’s documentation and will be updated when Archive-It updates. If you want to see the most up to date information please refer to the Archive-It documentation.