Today Erik and Audra attended a webex session from the Internet Archive on new features in ArchiveIT 4.0. They had me from the first few minutes when they announced that this year had been named the ‘year of metadata’ at the Internet Archive!
They focused on new features including metadata searching, crawl date limiting, and improved video crawling and streaming.
They also have enhanced their reporting features, specifically introducing a URL report that shows exactly what URLs got archived during a given crawl. They also introduced a number of automatic metadata harvesting features during the seed assignment process and some new features to scope-it that helps you set constraints on specific hosts.
One interesting metadata feature they introduced was the ability to export metadata records for archived items to both MARC and MODS. I thought this was an interesting concept as a way to leverage archived content in local indexes or webservices. They also introduced a third party tool called ProxyToggle, a Firefox plug-in that helps do quality control testing on archived content.