Ingest (0.2)
Main Page > Documentation > Release 0.2 Documentation > Ingest (0.2)
AD1 Receive SIP[edit]
Archivematica UML Activity diagram AD1 Receive SIP
Expected Procedures
Case 1:Internal Producer
- [1.1] --> [1.3] --> [1.4] --> [1.6] --> [1.8] --> [end]
- Assumption - Submission agreements with internal producers will mandate that some type of checksum for validating the integrity of the SIP must be included in the SIP; therefore, this case should never have to invoke step 1.5 - Generate Checksum
Case 2: External Producer (may or may not include a checksum as part of the SIP)
- [1.1] --> [1.2] --> [1.4] --> [1.6] --> [1.8] --> [end] (SIP includes checksum); or,
- [1.1] --> [1.2] --> [1.5] --> [1.6] --> [1.8] --> [end] (SIP does not include checksum)
Exceptions
- (for both cases) Submitted SIP includes integrity checksums and fails integrity check at [1.6]
Step | Implementation | Notes |
---|---|---|
1.1 Receive notification of intent to submit SIP from Producer (UC-1.1) |
Receive e-mail notification from internal producer that scheduled records are ready for transfer; receive signed donor form or other agreement from external producer. Create an accession documentation folder. (E.g., /home/demo/accessions/2009_0001/). Save notification and any related records to this folder.
|
|
1.2 [External producer] Assign offline storage location for SIP (UC-1.1) |
Select a secure physical location in which to store the media (i.e. disks, hard drive or other).
|
|
1.3 [Internal producer] Assign network storage location for SIP |
Assign a network location where internal producers can transfer submissions
|
There seems to be a problem with the diagram, because this step needs to be carried out for external producers too. |
1.4 Receive SIP from Producer (UC-1.1) |
Internal producer copies the SIP to /home/demo/ingest/2009_0001/; for external producer, archivist copies the SIP to that folder.
|
|
1.5 Generate checksum |
|
The checksum reports need to be copied to the accessions folder because the SIP will later be broken up into AIPs and we need to save copies of the original reports (I guess). |
1.6 Check integrity of transferred SIP (UC-1.2) |
|
|
1.7 Request resubmission of SIP |
|
|
1.8 Send confirmation of receipt to Producer (UC-1.2) |
|
AD2 Audit SIP[edit]
Archivematica UML Activity diagram AD2 Audit SIP
Expected Procedure
- [2.1] --> [2.2] --> [2.4] --> [2.5] --> [2.8] --> [end]
Exceptions
- malware detected at step 2.3
- SIP not compliant at step 2.5
Step | Implementation | Notes |
---|---|---|
General procedures | ||
2.1 Copy SIP to quarantine |
|
|
2.2 Check SIP for malware |
[There is not currently any malware checking software included in the Archivimatica 0.2 release.]
|
Documentation should include:
|
2.3 Remove malware |
Attempt to remove malware from the SIP
Go to: 2.4 - Audit SIP for compliance |
Case: malware not removed
|
2.4 Audit SIP for compliance (UC-1.3; UC-4.6) | Manually verify that the SIP conforms to the archives' data formatting and documentation standards and meets the specifications of the Submission Agreement. Do this by skimming the filenames and extensions to make sure that what was supposed to be in the SIP according to the Submission Agreement is actually there.
Create audit documentation
Go to: 2.5 - Assess SIP deficiencies |
|
2.5 - Assess SIP deficiencies |
Based on the results of the audit performed in step 2.4 - Audit SIP for compliance, determine if the deficiencies identified, if any, warrant rejection of the SIP, or if the SIP can be accepted despite identified deficiencies. If the majority of objects conform to standards and Submission Agreement, the SIP may be considered acceptable. Note that the non-conforming objects can be deleted after appraisal (see AD3 step 3.9). Document the decision to accept or reject the SIP If SIP can be accepted for ingest,
If SIP can not be accepted for ingest, go to: 2.7 - Notify Producer of SIP rejection |
|
2.6 Notify Producer of SIP rejection |
|
|
2.7 Evaluate appeals |
Evaluate the Producer's appeal of the SIP rejection
|
|
2.8 Notify producer of SIP acceptance |
Send Producer notification that the SIP has been accepted for ingest Document Go to: 3.1 - Extract content information from SIP |
|
2.9 Destroy SIP copies | Delete SIP from /home/demo/ingest and /home/demo/quarantine/.
End ingest |
AD3 Accept SIP for Ingest[edit]
Archivematica UML Activity diagram AD3 Accept SIP for Ingest
Expected Procedure
[3.1] --> [3.2] --> [3.3] --> [3.4] --> [3.5] --> [3.6] --> [3.9] --> [3.10]
Exceptions
- Producer notification required following [3.6]
- Appraisal desision appealed following [3.7]
Step | Implementation | Notes |
---|---|---|
General procedures | ||
3.1 Unpack SIP (UC-1.3) |
Create folders for the SIP contents
Unpack the SIP
Go to: 3.2 - Modify/Provide additional PDI |
Is this still needed?: Use md5sum to generate a new checksum report for the extracted files. |
3.2 Modify / provide additional PDI (UC-1.3) |
Go to: 3.3 - Identify Formats |
|
3.3 Identify format | Use DROID and NLNZ Metadata extractor to identify objects in the SIP. Save the reports to /home/demo/accessions/2009_0001/.
Go to: 3.4 - Validate formats |
|
3.4 Validate format | Use JHOVE to validate the objects in the SIP. Save the report to /home/demo/accessions/2009_0001/.
Go to: 3.5 - Extract metadata |
|
3.5 Extract metadata |
Extract preservation metadata from content objects in the SIP Go to: - 3.6 Audit submission and select for preservation |
In archivimatica, the metadata extraction activity takes place at the same time as format identification, as the NLNZ Metadata extractor tool also performs format identification.
|
3.6 Audit submission and select for preservation (UC-4.6) |
Based on the results of steps 3.2, 3.3, and 3.4, apply Archives policies and determine which (if any) content objects in the SIP should not be included in the AIP Document which content objects will not be included and why.
|
Possible reasons for exclusion:
|
3.7 Notify Producer about appraisal decision(s) | Provide the Producer with copies of the appraisal report or other documentation as required by the submisssion agreement identifying SIP components to be destroyed.
If the Producer appeals the appraisal decision, go to: 3.8 - Evaluate appeals Else, go to: 3.9 - Destroy unselected SIP components |
|
3.8 Evaluate appeals |
Receive Producer's appeals Evaluate appeals Based on the evaluation, make any necessary amendments to the appraisal decision Notify the Producer of the outcome of the appeal process Go to: 3.9 - Destroy unselected SIP components |
|
3.9 - Destroy unselected SIP components |
Destroy all SIP content objects identified for destruction in the appraisal decision (or amended appraisal decision as appropriate). Document destruction |
|
3.10 Accept selected SIP components for ingest |
AD4 Generate AIP[edit]
Archivematica UML Activity diagram AD4 Generate AIP
Step | Implementation | Notes |
---|---|---|
4.1 Create AIP containers | In /home/demo/ingestprocessing/ create new folders entitled 2009_0001_01, 2009_0001_02 etc. |
|
4.2 Add Content Information to AIP | Copy the relevant contents of /home/demo/ingestprocessing/2009_0001/ to the folders created in step 4.1. In addition:
|
|
4.3 Transform Content Information | *In each AIP, create a folder entitled 2009_0001_XX_normalized. Using Xena, normalize the contents of each AIP (excluding the the checksum reports). Ensure that the destination folder for each normalization is this folder.
|
The Xena log file gets saved to the accession folder rather than to individual AIPs because Xena creates one log file for the entire process of normalizing multiple AIPs. |
4.4 Add Transformed Content Information to AIP | If the destination folder is correctly set, Xena automatically saves the normalized content to the AIP. | |
4.5 Add PDI to AIP | Create plain text reports containing provenance and other PDI elements (including arrangement information) and save them to the AIPs. | We haven't added all the other PDI (for example, the DROID, NLNZ and JHOVE reports) to the AIPs because we created the reports for the entire SIP, not for the AIPs. This is part of the problem of processing the SIP through validation software etc. before dividing it into AIPs. |
4.6 Generate Descriptive Information (UC-1.4) |
|
The current version of Qubit (1.0.8-dev) does not recognize certain formats and does not allow their selection in the upload screen. Hopefully this will be corrected in archivematica 0.3. |
AD5 Transfer AIP to Archival Storage[edit]
Archivematica UML Activity diagram AD5 Transfer AIP to Archival Storage
Step | Implementation | Notes |
---|---|---|
5.1 Request storage of AIP (UC-1.5) | ||
5.2 Transfer AIP to Archival storage (UC-1.5) | Copy AIPs to /home/ingest/archivalstorage/. Use md5sum to ensure that the copying was done without errors. | |
5.3 Confirm receipt and storage of AIP (UC-1.5) | ||
5.4 Add AIP storage location to descriptive information (UC-1.6) | In the physical storage area in Qubit, add the storage location. | |
5.5 Add Descriptive Information to Data Management (UC-1.6) | This was done in AD4, step 4.6. In OAIS, generating descriptive information and adding them to data management are two different steps; however, in Archivematica this is done in one step in Qubit, which is used to upload images to a web interface, generate derivatives for searching and browsing and record descriptive information. | |
5.6 Confirm update of Data Management | ||
5.7 Destroy SIP and AIP copies | Destroy copies in /home/demo/ingest/ and /home/demo/ingestprocessing/. |