Normalizing based on FITS output

From Archivematica
Revision as of 17:28, 15 July 2011 by Evelyn (talk | contribs)
Jump to navigation Jump to search

Main Page > Development > Development documentation > Normalizing based on FITS output

This table shows DROID, FileUtility and exifTool output for file extensions for which Archivematica has preservation and access plans.

Media type Extension DROID identification fileUtility output exifTool output Notes
Audio AC3 (not identified) format: ATSC A/52 aka AC-3 aka Dolby Digital stream; mimetype: application/octet-stream Unknown file type
Audio AIF Name: Audio Interchange File Format; PUID: x-fmt/135; MimeType: audio/x-aiff format: IFF data, AIFF audio; mimetype: audio/x-aiff FileType: AIFF; MIMEType: audio/aiff
Audio MP3 Name: MPEG 1/2 Audio Layer 3; PUID: fmt/134; MimeType: audio/mpeg format: audio file with ID3 version 2.3.0, contains: MPEG ADTS, Layer III; mimetype: audio/mpeg FileType: MP3; MIMEType: audio/mpeg
Audio WAV Name: Waveform Audio; PUID: fmt/6; MimeType: audio/x-wav format: RIFF (little-endian) data, WAVE audio, Microsoft PCM; mimetype: audio/x-wav FileType: WAV; MIMEType: audio/x-wav
Audio WMA Name: Advanced Systems Format; PUID: fmt/131; MimeType: application/vnd.ms-asf format: Microsoft ASF; mimetype: video/x-ms-asf FileType: WMA; MIMEType: audio/x-ms-wma DROID and FileUtility misidentify WMA as video
Audio summary Can use DROID output for most but not all audio files (ac3 and WMA are not reliably identified)
Email PST
Office Open XML DOCX Name: Microsoft Office Open XML; PUID: fmt/189; MimeType: (none) format: Zip archive data; mimetype: application/zip FileType: ZIP; MIMEType: application/zip
Office Open XML PPTX Name: Microsoft Office Open XML; PUID: fmt/189; MimeType: (none) format: Zip archive data; mimetype: application/zip FileType: ZIP; MIMEType: application/zip
Office Open XML XLSX Name: Microsoft Office Open XML; PUID: fmt/189; MimeType: (none) format: Zip archive data; mimetype: application/zip FileType: ZIP; MIMEType: application/zip
Office Open XML summary FITS can't distinguish between word processing, spreadsheet and presentation files; must use file extensions
Portable Document Format PDF Name: Acrobat PDF 1.4 - Portable Document Format; PUID: fmt/18; MimeType: application/pdf format: PDF; mimetype: application/pdf FileType: PDF; MIMEType: application/pdf
Portable Document Format/Archival PDF
  • Name:Acrobat PDF 1.4 - Portable Document Format; PUID: fmt/18; MimeType: application/pdf
  • Name: Acrobat PDF/A - Portable Document Format; PUID: fmt/95; MimeType: application/pdf
format: PDF document; mimetype: application/pdf FileType: PDF; MIMEType: application/pdf
Portable Document Format summary Base on DROID output - very useful for distinguishing between PDF and PDF/A
Presentation PPT Name: Microsoft Powerpoint Presentation; PUID: fmt/126; MimeType: application/vnd.ms-powerpoint format: Microsoft Office Document; mimetype: application/octet-stream FileType: PPT; MIMEType: application/vnd.ms-powerpoint Exiftool identifies as FileType FPX, MIMEType image/vnd.fpx if file extension is missing
Presentation ODP
Raster image BMP name: Windows Bitmap; PUID: fmt/116; MimeType: image/bmp format: PC Bitmap, Windows 3.x format; mimetype: image/x-ms-bmp FileType: BMP; MIMEType: image/bmp
Raster image GIF name: Graphics Interchange Format; PUID fmt/4; MimeType: image/gif format: GIF image data, version 89a; mimetype: image/gif FileType: GIF; MIMEType: image/gif
Raster image JPG name: JPEG File Interchange Format; PUID: fmt/43; MimeType: image/jpeg format: JPEG image data, JFIF standard 1.01; mimetype: image/jpeg FileType: JPEG; MIMEType: image/jpeg
Raster image JP2 name: JPEG2000; PUID: x-fmt/392; MimeType: image/jp2 format: JPEG 2000 image data; mimetype: application/octet-stream FileType: JP2; MIMEType: image/jp2
Raster image PCT name: Macintosh PICT Image; PUID: x-fmt/80; MimeType: (none) format: data; mimetype: application/octet stream FileType: PICT; MIMEType:image/pict DROID doesn't recognize format if file extension is missing
Raster image PNG Name: Portable Network Graphics; PUID: fmt/11; MimeType: image/png format: PNG image; mimetype: image/png FileType: PNG; MIMEType: image/png
Raster image PSD Name: Adobe Photoshop; PUID: x-fmt/92; MimeType: (none) format: Adobe Photoshop Image; mimetype: image/vnd.adobe. photoshop FileType: PSD; MIMEType: application/photoshop
Raster image TIF Name: Tagged Image File Format; PUID: fmt/7, fmt/8, fmt/9, fmt/10; MimeType: image/tiff format: TIFF image data. big-endian image; mimetype: image/tiff FileType: TIFF; MIMEType: image/tiff
Raster image TGA Name: Truevision Graphics Adapter; PUID: x-fmt/367; MimeType: (none) format: Targa image data; mimetype: application/octet-stream Error: Unknown file type DROID doesn't recognize format if file extension is missing
Raster image summary DROID output seems reliable for raster images
Raw camera image 3FR
Raw camera image ARW Name: Tagged Image File Format; PUID fmt/7, fmt/8, fmt/9, fmt/10; MimeType: image/tiff format: TIFF image data, little-endian; mimetype: image/tiff FileType: ARW; MIMEType: image/x-raw Exiftool identifies as FileType TIFF MIMEType TIFF if file extension is missing
Raw camera image CR2 Name: Tagged Image File Format; PUID fmt/7, fmt/8, fmt/9, fmt/10; MimeType: image/tiff format: TIFF image data, little-endian; mimetype: image/tiff FileType: CR2; MIMEType: image/x-raw
Raw camera image CRW (not identified) format: data; mimetype: application/octet-stream FileType: CRW; MIMEType: image/x-raw
Raw camera image DCR Name: Tagged Image File Format; PUID fmt/7, fmt/8, fmt/9, fmt/10; MimeType: image/tiff format: TIFF image data, big-endian; mimetype: image/tiff FileType: DCR; MIMEType: image/x-raw Exiftool identifies as FileType TIFF MIMEType TIFF if file extension is missing
Raw camera image DNG Name: Exchangeable Image File Format (Uncompressed); PUID x-fmt/387; MimeType: image/tiff (no output) FileType: DNG; MIMEType: image/x-raw
Raw camera image ERF Name: Tagged Image File Format; PUID fmt/7, fmt/8, fmt/9, fmt/10; MimeType: image/tiff format: TIFF image data, big-endian; mimetype: image/tiff FileType: ERF; MIMEType: image/x-raw Exiftool identifies as FileType TIFF MIMEType TIFF if file extension is missing
Raw camera image KDC Name: Tagged Image File Format; PUID fmt/7, fmt/8, fmt/9, fmt/10; MimeType: image/tiff format: TIFF image data, big-endian; mimetype: image/tiff FileType: KDC; MIMEType: image/x-raw Exiftool identifies as FileType TIFF MIMEType TIFF if file extension is missing
Raw camera image MRW (not identified) format: Minolta Dimage camera raw image data; mimetype: application/octet-stream FileType: MRW; MIMEType: image/x-raw
Raw camera image NEF Name: Tagged Image File Format; PUID fmt/7, fmt/8, fmt/9, fmt/10; MimeType: image/tiff format: TIFF image data, big-endian; mimetype: image/tiff FileType: NEF; MIMEType: image/x-raw Exiftool identifies as FileType TIFF MIMEType TIFF if file extension is missing
Raw camera image ORF (not identified) format: Olympus ORF raw image data, little-endian; mimetype: image/x-olympus-orf FileType: ORF; MIMEType: image/x-raw Exiftool identifies as FileType TIFF MIMEType TIFF if file extension is missing
Raw camera image PEF Name: Tagged Image File Format; PUID fmt/7, fmt/8, fmt/9, fmt/10; MimeType: image/tiff format: TIFF image data, big-endian; mimetype: image/tiff FileType: PEF; MIMEType: image/x-raw Exiftool identifies as FileType TIFF MIMEType TIFF if file extension is missing
Raw camera image RAF (not identified) format: data; mimetype: application/octet-stream FileType: RAF; MIMEType: image/x-raw
Raw camera image RAW
Raw camera image X3F (not identified) format: Foveon X3F raw image data, version 2.1; mimetype: image/x-x3f FileType: X3F; MIMEType: image/x-raw
Raw camera image summary Must use file extensions for raw camera images
Spreadsheet XLS Name: OLE2 Compound Document Format; PUID: fmt/111; MimeType: (none) format: Microsoft Office Document; mimetype: application/octect-stream FileType: XLS; MIMEType: application/vnd.mx-excel Exiftool identifies as FileType: FPX; MimeType: image/vnd.fpx if file extension is missing
Vector image AI PDF PDF mimetype: application/PDF
Vector image EPS (not identified) PostScript document text conforming DSC level 3.1, type EPS, Level 2 mimetype: application/postscript
Video AVI Name: Audio/Video Interleaved Format; PUID: fmt/5; MimeType: video/x-msvideo format: RIFF (little-endian) data, AVI; mimetype: video/x-msvideo FileType: AVI; MIMEType: video/avi
Video FLV Name: Macromedia FLV; PUID: x-fmt/382; MimeType: video/x-flv format: Macromedia Flash Video; mimetype: video/x-flv FileType: FLV; MIMEType: video/x-flv
Video M2V Name: PUID: MimeType: format: mimetype: FileType: MIMEType:
Video MOV Name: Quicktime; PUID: x-fmt/384; MimeType: video/quicktime format: Apple QuickTime movie; mimetype: video/quicktime FileType: MOV; MIMEType: video/quicktime
Video MPG Name: MPEG-1 Video Format; PUID: x-fmt/385; MimeType: video/mpeg format: MPEG sequence, v1, system multiplex; mimetype: application/octet-stream FileType: MPEG; MIMEType: video/mpeg
Video MP4 Name: MPEG-4 Media File; PUID: fmt/199 format: ISO Media, MPEG v4 system, version 2: mimetype: video/mp4 FileType: MP4, MIMEType: video/mp4
Video SWF Name: Macromedia Flash; PUID: fmt/107; MimeType: format: Macromedia Flash data; mimetype: application/x-shockwave-flash FileType: SWF; MIMEType: application/x-shockwave-flash
Video WMV Name: Advanced Systems Format; PUID: fmt/131; MimeType: application/vnd.ms-asf format: Microsoft ASF; mimetype: video/x-ms-asf FileType: WMV MIMEType: video/x-ms-wmv DROID and FileUtility misidentify WMA as WMV/video
Video summary DROID output appears to be reliable for video, except for WMV
Word processing DOC
Word processing ODT Name: ZIP format; PUID: x-fmt/263; MimeType: application/zip format: OpenDocument Text; mimetype: application/vnd.oaisis.opendocument.text FileType: ZIP; MIMEType: application/zip Use fileUtility output or extension
Word processing RTF Name: Rich Text Format; PUID: fmt/50, fmt/51; MimeType: application/rtf, text/rtf format: Plain text; mimetype: text/rtf Error: Unknown file type If file extension is missing, DROID PUIDs are fmt/45, fmt/46, fmt/47, fmt/48, fmt/49
WPD