Format policies
Jump to navigation
Jump to search
Main Page > Documentation > Media type preservation plans
Archivematica maintains the original format of all ingested files to support migration and emulation preservation strategies.
Archivematica's primary preservation strategy is to normalize files to preservation and access formats upon ingest. The choice of access formats is based on the ubiquity of viewers for the file format.
Archivematica's preservation formats are all open standards. Additionally, the choice of preservation format is based on community best practices, availability of open-source normalization tools, and an analysis of the significant properties for each media type.
Media type | Preservation format(s) | Access format(s) | Normalization tool | Comments |
---|---|---|---|---|
Video files | Motion JPEG2000/MXF or MPEG-2/MXF | OGG,FLV | FFmpeg | Motion JPEG2000 is the emerging preferred standard for video files but it is hard to find a tool for Linux that converts to that codec. MPEG-2 is an accepted standard, however, which is in use by a number of institutions. |
Audio files | LPCM/WAVE | OGG, MP3 | FFmpeg | We may also wish to consider FLAC as a preservation format for audio files. It is less well-known in the archival community but is a fully lossless, openly-specified, non-proprietary audio format. |
Raster images (except raw camera files) | TIFF, JPEG2000 or PNG | PNG | ImageMagick | Since TIFF, JPEG2000 and PNG are all good formats for preservation, we could leave any files in those formats as they are (as long as they are uncompressed or losslessly compressed). However, we could normalize other formats, such as JPEG, GIF and BMP, to one of the preservation formats. |
Raw camera files | DNG | TIFF or PNG | DigiKam DNG Converter | |
Vector images | SVG | |||
Word processing files | Open Document Format; PDF/A | PDF or PDF/A | OpenOffice Writer | PDF/A normalization of MS Word files is somewhat problematic because best results are achieved from within the native application - i.e. MS Office running in MS Windows. Archivematica does not support either Windows or MS Office since these are proprietary software packages. |
Spreadsheets | Open Document Format | OpenOffice Calc | ||
Presentation files | Open Document Format; PDF/A | PDF or PDF/A | Xena or OpenOffice Impress | Xena may be preferable, since it appears to produce a more accurate representation of the original. |