Test File Results

From Archivematica
Jump to navigation Jump to search

Main Page > Projects > Vancouver Digital Archives > System Testing > Test File Results


DROID, JHOVE, NLNZ Metadata Extractor[edit]

The purpose of these three tools is to validate formats and extract technical metadata. For more information see Technology/Tools Evaluation > DROID, JHOVE, NLNZ Metadata Extractor


File Description DROID NLNZ JHOVE Comments
0239.mpg MPEG-2 video file
2001-04-LegendsPBP_512kb.mp4 MPEG-4 video file
A08917.TIF TIFF image file
  • DROID suggested 4 possible versions (3.0 to 6.0)
accelerando.com Website consisting of html, xhtml, css, jpg, gif and png files
  • DROID correctly identified all but css file which it identified tentatively as either cascading style sheet or Stats+ Data File; identified versions of all other files
  • NLNZ did not identify css and identified XHTML file as HTML; identified all other formats; identified versions of html and gif files, not of jpg files
  • JHOVE identified css as ASCII and did not identify png; identified all other formats and versions
artefactual.com.zip Zip archive containing html, css, jpg, gif and png files
  • DROID identified as ZIP format but did not process individual zipped files
  • NLNZ identified as mimetype "application/zip" but did not process individual zipped files, even when the zip archive was processed as a single complex object
Basic search.odt OpenOffice.org Writer file with inserted png file.
  • DROID incorrectly identified as zip file and warned of "possible file extension mismatch"; did not process inserted png
  • NLNZ correctly identified format and versions but did not process inserted png
ct000654.jp2 JPEG2000 image file
  • DROID correctly identified but identification was "tentative"
DadClip_64kb.mp3 MPEG-1 Audio Layer-3 file
  • DROID correctly identified but identification was "tentative"
DemoANSI.txt US_ASCII text file
  • DROID suggested 9 possible formats: Tab-Delimited Text File, Macintosh Text File, MS-DOS Text File, Unicode Text File, Fixed Width Values Text File, Plain Text File, MS-DOS Text File with line breaks, and IBM DisplayWrite Document
  • NLNZ identified as plain text but did not identify encoding
DemoUTF-8.txt Unicode text file
  • DROID suggested 9 possible formats: Tab-Delimited Text File, Macintosh Text File, MS-DOS Text File, Unicode Text File, Fixed Width Values Text File, Plain Text File, MS-DOS Text File with line breaks, and IBM DisplayWrite Document
  • NLNZ identified as plain text but did not identify encoding
Free_to_use_10_sec.aif AIF audio file
  • DROID correctly identified but identification was "tentative"
Holdings.png PNG image file
  • NLNZ identified mimetype as image/png but format field was empty
Ica-atom-technical-architecture-2008-06.gif GIF image file
Ica-atom-technical-architecture-2008-06 (copy).jpg Corrupted GIF image file (file extension changed to jpg)
  • DROID correctly identified and warned of possible file extension mismatch
inkscape_wallpaper___blue_

by_ryanlerch.svg

SVG image file
  • NLNZ identified as xml file (SVG is an xml extension)
  • JHOVE identified as xml file (SVG is an xml extension)
inkscape_wallpaper___blue_

by_ryanlerch (copy).svg

Corrupted SVG image file (some xml tags removed)
  • DROID identified as xml and warned of possible file extension mismatch
  • NLNZ identified as xml
  • JHOVE identified as ASCII
Lake Chelan.JPG JPEG image file
m2a39986.mov QuickTime video
manual_excerpt.pdf PDF file
Members_Master2009.xls Excel spreadsheet
  • DROID guessed version as either 8 (97-2000) or 8X (2000-2003)
  • NLNZ stated version as 4.1+
Presentation.ppt MS PowerPoint presentation with inserted jpeg file
  • DROID did not process inserted jpeg file
  • NLNZ identified version as 6.0+. Did not process inserted jpeg file
Supported standards.doc MS Word Document with inserted png file
  • DROID identified as OLE2 Compound Document Format (fmt/111). This format has multiple subtypes, including MS Word 2000-2003. DROID also warned "possible file extension mismatch". Did not process inserted png file.
  • NLNZ identified format as Microsoft Word but in version field stated "Microsoft Office Word" instead of giving version number. Did not process inserted png file.
Supported standards.rtf Rich Text Format
  • DROID identified version as either 1.5 or 1.6
  • JHOVE identified as ASCII (RTF files are typically encoded in ASCII)
TechHouse52139.wav WAVE audio file
  • DROID correctly identified as waveform audio but did not identify subtype (pcmwaveformat)
  • NLNZ correctly identified as waveform audio but did not identify subtype (pcmwaveformat)