Difference between revisions of "Dataset preservation"

From Archivematica
Jump to navigation Jump to search
Line 1: Line 1:
==Dataset file types in UBC's Abacus==
+
==METS and DDI==
 
 
*.dbf: Database file.
 
**The DBF file type is primarily associated with 'Database'. Started as dBASE II then proceeded through versions III, III+, and IV. Most of these files can usually be opened in Excel or Access. Often now called xBASE. The database index has the extension .NDX. (http://filext.com/file-extension/dbf)
 
**The .dbf extension can refer to a number of versions of the Database format. See PRONOM: http://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=listReport.
 
**[https://sites.google.com/a/datanetworkservice.nl/mixed/ MIXED] (Migration to Intermediate XML for Electronic Data) can convert Dbase III and IV files to SIARD format, a software-independent XML format for long-term preservation.
 
*.por: IBM SPSS Portable Data File.
 
**Usually used to transfer data between a survey polling program and the statistical analysis program. (http://filext.com/file-extension/POR)
 
**Portable format that can be read by other versions of SPSS Statistics and versions on other operating systems. Variable names are limited to eight bytes and are automatically converted to unique eight-byte names if necessary. In most cases, saving data in portable format is no longer necessary, since SPSS Statistics data files should be platform/operating system independent. You cannot save data files in portable file in Unicode mode. (http://publib.boulder.ibm.com/infocenter/spssstat/v20r0m0/index.jsp?topic=%2Fcom.ibm.spss.statistics.help%2Fsavedatatypes.htm)
 
**PRONOM has no entry for .por.
 
*.sav: IBM SPSS Statistics format.
 
**PRONOM has no entry for .sav.
 
*ascii (no file extension)
 
**Although the files may be ascii text, they are often intended to be read by e.g. statistical analysis programs and are not meaningful when opened in a standard text editor. For example:
 
 
 
[[File:datag.png|680px|thumb|center|]]
 
 
 
 
 
 
 
  
 +
*DDI is Data Documentation Initiative, a metadata specification for the social and behavioral sciences; see http://www.ddialliance.org/.
 +
*DDI is considered descriptive metadata (mdSec) in METS. From http://www.loc.gov/standards/mets/METSOverview.v2.html: "Valid values for the MDTYPE element [in mdSec] include MARC, MODS, EAD...DDI (Data Documentation Initiative), FGDC (Federal Geographic Data Committee Metadata Standard [FGDC-STD-001-1998]."
  
  
 
[[Category:Development documentation]]
 
[[Category:Development documentation]]

Revision as of 18:42, 2 January 2013

METS and DDI

  • DDI is Data Documentation Initiative, a metadata specification for the social and behavioral sciences; see http://www.ddialliance.org/.
  • DDI is considered descriptive metadata (mdSec) in METS. From http://www.loc.gov/standards/mets/METSOverview.v2.html: "Valid values for the MDTYPE element [in mdSec] include MARC, MODS, EAD...DDI (Data Documentation Initiative), FGDC (Federal Geographic Data Committee Metadata Standard [FGDC-STD-001-1998]."