Tests: Normalize

From Archivematica
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Main Page > Development > Development documentation > Testing > Tests: Normalize

Archivematica normalizes certain ingested file formats into preservation-friendly formats.


General

  • Normalization is triggered automatically when the user approves Appraise SIP in the dashboard.
  • The original files are not deleted.
  • When normalized files have the same file extensions as the original, the original files are not overwritten.
  • If the SIP contains files for which there are no default normalization paths, processing of the SIP does not stop.
  • If any files in the SIP fail normalization, the Normalization micro-service shows the error in the dashboard.
  • A copy of the SIP is placed in the failedNormalization directory.
  • The user can approve the SIP by clicking Approve in the dashboard.
  • The user can reject the SIP by clicking Reject. Rejected SIPs are moved to the /rejected directory.
  • A text file named normalizationLog.txt is added to the logs directory in the AIP.

Audio files

  • .ac3 files are normalized to uncompressed WAVE
  • .aif/.aiff files are normalized to uncompressed WAVE
  • .mp3 files are normalized to uncompressed WAVE
  • .wav files are normalized to uncompressed WAVE
  • .wma files are normalized to uncompressed WAVE

Office documents

  • .doc files are normalized to ODT
  • .docx files are left in their original format
  • .ppt files are normalized to ODP
  • .pptx files are left in their original format
  • .rtf files are normalized to ODT
  • .xls files are normalized to ODS
  • .xlsx files are left in their original format
  • .wpd files are normalized to ODT

Portable Document files

  • .pdf is normalized to pdf/a (note that the file extension remains .pdf)


Plain text

.txt files are left in their original format

Raster images

  • .bmp files are normalized to uncompressed TIFF 6.0
  • .gif files are normalized to uncompressed TIFF 6.0
  • .jpg files are normalized to uncompressed TIFF 6.0
  • .jp2 files are normalized to losslessly compressed JPEG 2000
  • .png files are not normalized
  • .psd files are normalized to uncompressed TIFF 6.0
  • .tif files are normalized to uncompressed TIFF 6.0
  • .tga files are normalized to uncompressed TIFF 6.0

Vector images

  • .ai files are normalized to SVG 1.1
    • multi-page .ai files are normalized to multiple single-page SVG 1.1 files
  • .eps files are normalized to SVG 1.1
  • .svg files are normalized to SVG 1.1
    • .svg files that tagged as SVG 1.0 but which are entirely conformant to the SVG 1.1 standard are not normalized to SVG 1.1

Video files

  • .avi files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .flv files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .mov files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .mpg files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .m2v files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .mp4 files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .swf files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
  • .wmv files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF