Difference between revisions of "AIP structure"

From Archivematica
Jump to navigation Jump to search
Line 61: Line 61:
 
</div>
 
</div>
  
*Objects: /data/objects contains original objects, normalized objects, /metadata and /submissionDocumentation. If there were any lower level directories within the SIP, that directory structure is maintained. (See '''Figure 5''' ) /metadata contains /transfers, which contains
+
*Objects: /data/objects contains original objects, normalized objects, /metadata and /submissionDocumentation. If there were any lower level directories within the SIP, that directory structure is maintained. (See '''Figure 5''' ) **/metadata contains /transfers, which contains
/submisionDocumentation contains  
+
**/submisionDocumentation contains  
  
 
[[Image:DataObjects-10.png|600px|thumb|'''figure 5'''  Objects folder content in Data]]
 
[[Image:DataObjects-10.png|600px|thumb|'''figure 5'''  Objects folder content in Data]]

Revision as of 14:15, 1 May 2013

Main Page > Development > Development documentation > AIP structure

This page documents the structure of the AIP produced by Archivematica 0.10-beta.

Name

The AIP name is composed of the following:

  1. Either the name of the original transfer if no new name has been assigned to the SIP upon formation or the name of the SIP or SIPs created from the transfer and
  2. a UUID assigned during SIP formation

example: Pictures_of_my_cat-aebbfc44-9f2e-4351-bcfb-bb80d4914112

"Pictures_of_my_cat" is the name assigned by the user and "aebbfc44-9f2e-4351-bcfb-bb80d4914112" is the UUID generated during SIP formation.

Directory Structure

Figure 1 AIP directory - top level
  • The AIP is zipped in the AIPsStore. The AIP directories are broken down into UUID quad directories* for efficient storage and retrieval. (*UUID quad directories: Some file systems limit the number of items allowed in a directory, Archivematica uses a directory tree structure to store AIPs. The tree is based on the AIP UUIDs. The UUID is broken down into manageable 4 character pieces, or "UUID quads", each quad representing a directory. The first four characters (UUID quad) of the AIP UUID will compose a sub directory of the AIP storage. The second UUID quad will be the name of a sub directory of the first, and so on and so forth, until the last four characters (last UUID Quad) create the leaf of the AIP store directory tree, and the AIP with that UUID resides in that directory.)(figure 1)

BagIt documentation

  • The AIP is packaged in accordance with the Library of Congress Bagit specification (PDF, 84KB) In figure 2, the BagIt files are bag-info.txt, bagit.txt, manifest-sha512.txt and tagmanifest-md5.txt.
Figure 2
  • The following describes the contents of the AIP once extracted.

Data

Figure 3 AIP data directory

The data directory consists of the METS file for the AIP and three folders: logs, objects. and thumbnails. (See figure 3)


  • Logs: /data/logs contains the /transfers directory, normalization log, malware scan log, and the extraction log (from unpackaging packages) generated during SIP creation. (See figure 4)
    • The /transfers directory contains
Figure 4 Logs folder content in Data
  • Objects: /data/objects contains original objects, normalized objects, /metadata and /submissionDocumentation. If there were any lower level directories within the SIP, that directory structure is maintained. (See Figure 5 ) **/metadata contains /transfers, which contains
    • /submisionDocumentation contains
figure 5 Objects folder content in Data
  • Thumbnails: /data/thumbnails contains any thumbnails generated for use in the access system.