Tests: Normalize

From Archivematica
Jump to navigation Jump to search

Main Page > Development > Development documentation > Testing > Tests: Normalize

Archivematica normalizes certain ingested file formats into preservation-friendly formats.


General[edit]

  • Normalization is triggered automatically when the user approves Appraise SIP in the dashboard.
  • The original files are not deleted.
  • When normalized files have the same file extensions as the original, the original files are not overwritten.
  • If the SIP contains files for which there are no default normalization paths, processing of the SIP does not stop.
  • If any files in the SIP fail normalization, the Normalization micro-service shows the error in the dashboard.
  • A copy of the SIP is placed in the failedNormalization directory.
  • The user can approve the SIP by clicking Approve in the dashboard.
  • The user can reject the SIP by clicking Reject. Rejected SIPs are moved to the /rejected directory.
  • A text file named normalizationLog.txt is added to the logs directory in the AIP.

Audio files[edit]

  • .ac3 files are normalized to uncompressed WAVE
  • .aif/.aiff files are normalized to uncompressed WAVE
  • .mp3 files are normalized to uncompressed WAVE
  • .wav files are normalized to uncompressed WAVE
  • .wma files are normalized to uncompressed WAVE

Office documents[edit]

  • .doc files are normalized to ODT
  • .docx files are left in their original format
  • .ppt files are normalized to ODP
  • .pptx files are left in their original format
  • .rtf files are normalized to ODT
  • .xls files are normalized to ODS
  • .xlsx files are left in their original format
  • .wpd files are normalized to ODT

Portable Document files[edit]

  • .pdf is normalized to pdf/a (note that the file extension remains .pdf)


Plain text[edit]

.txt files are left in their original format

Raster images[edit]

  • .bmp files are normalized to uncompressed TIFF 6.0
  • .gif files are normalized to uncompressed TIFF 6.0
  • .jpg files are normalized to uncompressed TIFF 6.0
  • .jp2 files are normalized to losslessly compressed JPEG 2000
  • .png files are not normalized
  • .psd files are normalized to uncompressed TIFF 6.0
  • .tif files are normalized to uncompressed TIFF 6.0
  • .tga files are normalized to uncompressed TIFF 6.0

Vector images[edit]

  • .ai files are normalized to SVG 1.1
    • multi-page .ai files are normalized to multiple single-page SVG 1.1 files
  • .eps files are normalized to SVG 1.1
  • .svg files are normalized to SVG 1.1
    • .svg files that tagged as SVG 1.0 but which are entirely conformant to the SVG 1.1 standard are not normalized to SVG 1.1

Video files[edit]

  • .avi files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .flv files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .mov files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .mpg files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .m2v files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .mp4 files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .swf files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .wmv files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF