Tests: Normalize

From Archivematica
Revision as of 14:14, 1 February 2011 by Glenn (talk | contribs) (→‎General)
Jump to navigation Jump to search

Main Page > Development > Development documentation > Testing > Tests: Normalize

Archivematica normalizes certain ingested file formats into preservation-friendly formats.


General

  • Normalization is triggered automatically when the user approves Appraise SIP in the dashboard.
  • The original files are not deleted.
  • When normalized files have the same file extensions as the original, the original files are not overwritten.
  • If the SIP contains files for which there are no default normalization paths, processing of the SIP does not stop.
  • A text file named normalizationLog.txt is added to the logs directory in the AIP.

Audio files

  • .ac3 files are normalized to uncompressed WAVE
  • .aif/.aiff files are normalized to uncompressed WAVE
  • .mp3 files are normalized to uncompressed WAVE
  • .wav files are normalized to uncompressed WAVE
  • .wma files are normalized to uncompressed WAVE

Office documents

  • .doc files are normalized to ODT
  • .docx files are left in their original format
  • .ppt files are normalized to ODP
  • .pptx files are left in their original format
  • .rtf files are normalized to ODT
  • .xls files are normalized to ODS
  • .xlsx files are left in their original format
  • .wpd files are normalized to ODT

Portable Document files

  • .pdf is normalized to pdf/a (note that the file extension remains .pdf)


Plain text

.txt files are left in their original format

Raster images

  • .bmp files are normalized to uncompressed TIFF 6.0
  • .gif files are normalized to uncompressed TIFF 6.0
  • .jpg files are normalized to uncompressed TIFF 6.0
  • .jp2 files are normalized to losslessly compressed JPEG 2000 (not yet implemented)
  • .png files are not normalized
  • .psd files are normalized to uncompressed TIFF 6.0
  • .tif files are normalized to uncompressed TIFF 6.0
  • .tga files are normalized to uncompressed TIFF 6.0

Vector images

  • .ai files are normalized to SVG 1.1 (not yet implemented)
    • multi-page .ai files are normalized to multiple single-page SVG 1.1 files
  • .eps files are normalized to SVG 1.1
  • .svg files are normalized to SVG 1.1
    • .svg files that tagged as SVG 1.0 but which are entirely conformant to the SVG 1.1 standard are not normalized to SVG 1.1

Video files

  • .avi files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .flv files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .mov files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .mpg files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .m2v files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .mp4 files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .swf files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .wmv files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF