This afternoon, Craig, Rebecca, and I sat down for a webinar from ArchiveIt about archiving social media sites. The advanced training session covered the reasons for archiving social media (“a tweet is a record”) and then explored how to add specific seed URLs to one’s ArchiveIt web archive to get the content being created via these social media sites.
Some takeaways from the webinar include:
- Always run a test crawl to see how many documents and space you are crawling
- Review test crawl results when adding new seed URLs
- Be specific with social media site URLs
Additional information about how to put scope and document limits when adding social media seed URLs to ArchiveIt can be found on the “Archiving Social Networking Sites with ArchiveIt” page on the ArchiveIt wiki. We’ll be organizing and adding new social media seeds to the ZSR ArchiveIt account!