Difference between revisions of "Improvements/warc"

From Archivematica
Jump to navigation Jump to search
(Created page with "== Synopsis == Improvements to Archivematica's handling of WARC files could go in a number of directions, most of which involve better extraction of technical and provenance ...")
(No difference)

Revision as of 16:57, 7 September 2016

Synopsis

Improvements to Archivematica's handling of WARC files could go in a number of directions, most of which involve better extraction of technical and provenance metadata to Archivematica's METS file, which would improve the understanding and preservation of the WARC files overtime.

User story

Status

Some code is in a development branch of Archivematica (https://github.com/artefactual/archivematica/tree/dev/issue-8634-warc-mets) which will read certain elements of the WARC header. This lays the groundwork for parsing this descriptive information to the METS file. This code is based on an Archivematica branch that introduces external agents to the METS file, which lays the ground work for describing the software agent that created the WARC (e.g. ArchiveIt, wget, chrome extension, etc)

Analysis