Difference between revisions of "Tests: Normalize"
Jump to navigation
Jump to search
Line 16: | Line 16: | ||
*.wav files are normalized to uncompressed WAVE | *.wav files are normalized to uncompressed WAVE | ||
*.wma files are normalized to uncompressed WAVE | *.wma files are normalized to uncompressed WAVE | ||
+ | |||
+ | == Office documents == | ||
+ | *.doc | ||
+ | *.docx | ||
+ | *.ppt | ||
+ | *.pptx | ||
+ | *.rtf | ||
+ | *.xls | ||
+ | *.xlsx | ||
+ | *.wpd | ||
+ | |||
+ | == Portable Document files == | ||
+ | .pdf is normalized to pdf/a (note that the file extension remains .pdf) | ||
+ | |||
== Plain text == | == Plain text == |
Revision as of 16:54, 9 December 2010
Main Page > Development > Development documentation > Tests: Normalize
Archivematica normalizes certain ingested file formats into preservation-friendly formats.
General
- Normalization is triggered automatically when the user approves "Appraise SIP" in the dashboard.
- The original files are not deleted.
- When normalized files have the same file extensions as the original, the original files are not overwritten.
- If the SIP contains files for which there are no default normalization paths, processing of the SIP does not stop.
Audio files
- .ac3 files are normalized to uncompressed WAVE
- .aif/.aiff files are normalized to uncompressed WAVE
- .mp3 files are normalized to uncompressed WAVE
- .wav files are normalized to uncompressed WAVE
- .wma files are normalized to uncompressed WAVE
Office documents
- .doc
- .docx
- .ppt
- .pptx
- .rtf
- .xls
- .xlsx
- .wpd
Portable Document files
.pdf is normalized to pdf/a (note that the file extension remains .pdf)
Plain text
.txt files are left in their original format
Raster images
- .bmp files are normalized to uncompressed TIFF 6.0
- .gif files are normalized to uncompressed TIFF 6.0
- .jpg files are normalized to uncompressed TIFF 6.0
- .jp2 files are normalized to losslessly compressed JPEG 2000 (not yet implemented)
- .png files are not normalized
- .psd files are normalized to uncompressed TIFF 6.0
- .tif files are normalized to uncompressed TIFF 6.0
- .tga files are normalized to uncompressed TIFF 6.0
Spreadsheets
.*xls files are normalized to
Vector images
- .ai files are normalized to SVG 1.1 (not yet implemented)
- multi-page .ai files are normalized to multiple single-page SVG 1.1 files
- .eps files are normalized to SVG 1.1
- .svg files are normalized to SVG 1.1
- .svg files that tagged as SVG 1.0 but which are entirely conformant to the SVG 1.1 standard are not normalized to SVG 1.1
Video files
- .avi files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
- .flv files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
- .mov files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
- .mpg files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
- .m2v files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
- .mp4 files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
- .swf files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF
- .wmv files: video streams are normalized to MPEG-2; audio streams are normalized to uncompressed WAVE; wrapper is normalized to MXF