Difference between revisions of "Large datasets"

From Archivematica
Jump to navigation Jump to search
Line 5: Line 5:
 
**The dataset could be broken into a parent AIP which acts as an Archival Information Collection, consisting entirely of a METS structMap listing all its child AIPs; each child AIP would have a link back to the parent AIP in its own structMap.
 
**The dataset could be broken into a parent AIP which acts as an Archival Information Collection, consisting entirely of a METS structMap listing all its child AIPs; each child AIP would have a link back to the parent AIP in its own structMap.
 
*The large single file could be broken into multiple segments, each in its own AIP. Video files could be delivered to end users in these segments, the way large video files are delivered on Youtube, for example.
 
*The large single file could be broken into multiple segments, each in its own AIP. Video files could be delivered to end users in these segments, the way large video files are delivered on Youtube, for example.
*Other types of large files might have to be merged back into one for delivery to a user.
+
**Other types of large files might have to be merged back into one for delivery to a user.
  
  
 
[[Category:Development documentation]]
 
[[Category:Development documentation]]

Revision as of 15:10, 8 February 2013

Main Page > Development > Development documentation > Large datasets

What happens when a body of materials to be ingested consists of thousands of files (eg a large social science research dataset), or when one file is extremely large (eg an HD video file)?

  • The large number of files could be broken up and distributed across multiple AIPs, with relationships between them expressed in the METS structMaps.
    • The dataset could be broken into a parent AIP which acts as an Archival Information Collection, consisting entirely of a METS structMap listing all its child AIPs; each child AIP would have a link back to the parent AIP in its own structMap.
  • The large single file could be broken into multiple segments, each in its own AIP. Video files could be delivered to end users in these segments, the way large video files are delivered on Youtube, for example.
    • Other types of large files might have to be merged back into one for delivery to a user.