Difference between revisions of "Large datasets"
Jump to navigation
Jump to search
(Created page with "- For instance, in the case of big data which Chuck just mentioned, an AIP bag should be open until its size reaches xxMB and then this bag closes and another bag opens up to ...") |
|||
Line 1: | Line 1: | ||
− | + | What happens when a body of materials to be ingested consists of thousands of files (eg a large social science research dataset), or when one file is extremely large (eg an HD video file)? | |
− | + | *The large number of files could be broken up and distributed across multiple AIPs, with relationships between them expressed in the METS structMaps. | |
+ | **The dataset could be broken into a parent AIP which acts as an Archival Information Collection, consisting entirely of a METS structMap listing all its child AIPs; each child AIP would have a link back to the parent AIP in its own structMap. | ||
+ | *The large single file could be broken into multiple segments, each in its own AIP. Video files could be delivered to end users in these segments, the way large video files are delivered on Youtube, for example. | ||
+ | *Other types of large files might have to be merged back into one for delivery to a user. | ||
+ | |||
+ | |||
[[Category:Development documentation]] | [[Category:Development documentation]] |
Revision as of 14:07, 8 February 2013
What happens when a body of materials to be ingested consists of thousands of files (eg a large social science research dataset), or when one file is extremely large (eg an HD video file)?
- The large number of files could be broken up and distributed across multiple AIPs, with relationships between them expressed in the METS structMaps.
- The dataset could be broken into a parent AIP which acts as an Archival Information Collection, consisting entirely of a METS structMap listing all its child AIPs; each child AIP would have a link back to the parent AIP in its own structMap.
- The large single file could be broken into multiple segments, each in its own AIP. Video files could be delivered to end users in these segments, the way large video files are delivered on Youtube, for example.
- Other types of large files might have to be merged back into one for delivery to a user.