Difference between revisions of "Metadata elements"

From Archivematica
Jump to navigation Jump to search
 
(38 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
[[Main Page]] > [[Development]] > [[:Category:Development documentation|Development documentation]] > Metadata elements
 
[[Main Page]] > [[Development]] > [[:Category:Development documentation|Development documentation]] > Metadata elements
 +
 +
<div style="padding: 10px 10px; border: 1px solid black; background-color: #F79086;">This page is no longer being maintained and may contain inaccurate information. Please see the [https://www.archivematica.org/docs/latest/ Archivematica documentation] for up-to-date information. </div> <p>
  
 
This page identifies a minimum set of metadata elements designed to ensure authenticity and interoperability of preserved objects and to facilitate their retrieval.
 
This page identifies a minimum set of metadata elements designed to ensure authenticity and interoperability of preserved objects and to facilitate their retrieval.
  
==Design process==
+
*[[METS]]
 
+
*[[PREMIS]]
This process involves:
+
*[[PREMIS metadata: original files]]
 
+
*[[PREMIS metadata: normalized files]]
# Using the InterPARES Chain of Preservation (COP) model and the CoP/PREMIS crosswalk to identify required elements for objects preserved by Archivematica
+
*[[PREMIS metadata: events]]
# Analyzing existing metadata in the Archivematica AIP log files and METS.xml file in order to map them to METS and PREMIS elements (see [[Existing elements]])
+
*[[PREMIS metadata: rights - 0.10]]
# Comparing 1) to 2) in order to determine what gaps exist in Archivematica
+
*[[PREMIS/METS for scalability]]
# Filling in the gaps - eg by modifying workflow to produce and/or capture missing elements
+
*[[RDF/OWL]]
# Structuring the required elements into the [http://wiki.fcla.edu:8000/TIPR/21 Repository eXchange Package (RXP) specification]
 
# Determining what metadata belongs in the DIP(s)
 
 
 
 
 
<br>
 
 
 
==Proposed PREMIS metadata for original file==
 
 
 
This table is a template for metadata elements for the original file. Please note the following:
 
*The ''significantProperties'' semantic unit would be repeated as needed to capture all the significant property data produced by FITS
 
*The ''format'' semantic unit would be repeated as needed if FITS identified several possible formats for the file
 
*For most files, the ''relationships'' semantic unit would be used twice: once to link the preservation copy and once to link the access copy
 
*The ''eventOutcomeDetail'' semantic unit would be repeated as needed to capture detailed information generated by an event
 
*This table includes one event as an example (normalization); a real PREMIS file would contain information about numerous events (see [[Metadata_elements#Event_metadata|Event metadata]], below)
 
*This table includes two agent entities: an organization (City of Vancouver Archives) and a software program (Archivematica). The organization is the agent for manual events such as reviewing the SIP, while Archivematica is the agent for automated events such as normalization. Further agents may be included (such as individuals, workstations etc) but the two agents specified in this table would be a minimum.
 
 
 
<br>
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:10%"|'''PREMIS entity'''
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|object
 
|objectIdentifier
 
|objectIdentifierType
 
|UUID
 
|mandatory unit and component
 
|-
 
|object
 
|objectIdentifier
 
|objectIdentifierValue
 
|0db50321-6d7b-4291-89ec-a8b0adc1ff96
 
|mandatory unit and component
 
|-
 
|object
 
|objectCategory
 
|none
 
|file
 
|mandatory unit and component
 
|-
 
|object
 
|objectCharacteristics
 
|compositionLevel
 
|0
 
|mandatory unit and component
 
|-
 
|object
 
|objectCharacteristics/fixity
 
|messageDigestAlgorithm
 
|MD5
 
|
 
|-
 
|object
 
|objectCharacteristics/fixity
 
|messageDigest
 
|e479688508922354bdab09bca60d8d0e
 
|
 
|-
 
|object
 
|objectCharacteristics/fixity
 
|messageDigestOriginator
 
|City of Vancouver Archives
 
|
 
|-
 
|object
 
|objectCharacteristics
 
|size
 
|787510
 
|
 
|-
 
|object
 
|objectCharacteristics/format/formatDesignation
 
|formatName
 
|Windows Bitmap
 
|format is a mandatory unit; must use either formatDesignation or formatRegistry
 
|-
 
|object
 
|objectCharacteristics/format/formatDesignation
 
|formatVersion
 
|3.0
 
|format is a mandatory unit; must use either formatDesignation or formatRegistry
 
|-
 
|object
 
|objectCharacteristics/format/formatRegistry
 
|formatRegistryName
 
|PRONOM
 
|format is a mandatory unit; must use either formatDesignation or formatRegistry
 
|-
 
|object
 
|objectCharacteristics/format/formatRegistry
 
|formatRegistryKey
 
|fmt/116
 
|format is a mandatory unit; must use either formatDesignation or formatRegistry
 
|-
 
|object
 
|objectCharacteristics
 
|objectCharacteristicsExtension
 
|
 
<fits xsi:schemaLocation="http://hul.harvard.edu/ois/xml/ns/fits/fits_output http://hul.harvard.edu/ois/xml/xsd/fits/fits_output.xsd" version="0.3.2" timestamp="8/10/10 7:28 PM"> + selected FITS output
 
|objectCharacteristicsExtension is used for additional object characteristics not covered by PREMIS, for instance format specific metadata that is defined externally.
 
|-
 
|object
 
|originalName
 
|none
 
|/SAE Project files/newsletters/20100223/cover image.bmp
 
|
 
|-
 
|object
 
|relationship
 
|relationshipType
 
|derivation
 
|
 
|-
 
|object
 
|relationship
 
|relationshipSubType
 
|is source of
 
|
 
|-
 
|object
 
|relationship/relatedObjectIdentification
 
|relatedObjectIdentifierType
 
|UUID
 
|mandatory unit and component if there is a related object
 
|-
 
|object
 
|relationship/relatedObjectIdentification
 
|relatedObjectIdentifierValue
 
|270bd067-0483-4c5f-bdec-f2cbd6e651aa
 
|mandatory unit and component if there is a related object
 
|-
 
|object
 
|relationship/relatedEventIdentification
 
|relatedEventIdentifierType
 
|Archivematica ID
 
|"For derivative relationships between objects relatedEventIdentification must be recorded."
 
|-
 
|object
 
|relationship/relatedEventIdentification
 
|relatedEventIdentifierValue
 
|[alphanumeric code]
 
|"For derivative relationships between objects relatedEventIdentification must be recorded."
 
|-
 
|event
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|mandatory unit and component
 
|-
 
|event
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|mandatory unit and component
 
|-
 
|event
 
|eventType
 
|none
 
|normalization
 
|mandatory unit and component
 
|-
 
|event
 
|eventDateTime
 
|none
 
|2009-12-01T09:09:00-02:00
 
|mandatory unit and component
 
|-
 
|event
 
|eventDetail
 
|none
 
|program="ImageMagick"; version="6.6.4.0"; command="%convertPath% %fileFullName% +compress %preservationFileDirectory%%fileTitle%.%preservationFormat%"
 
|This element can be used to record information about software used and eliminates the need to have agent entities for software programs
 
|-
 
|event
 
|eventOutcomeInformation
 
|eventOutcome
 
|{Normalized; Not normalized}
 
|
 
|-
 
|event
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
*Normalization failed
 
*Already in preservation format. No need to normalize
 
|
 
|-
 
|event
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|preservation system
 
|used to link an agent to an event; not mandatory but recommended
 
|-
 
|event
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|used to link an agent to an event; not mandatory but recommended
 
|-
 
|agent
 
|agentIdentifier
 
|agentIdentifierType
 
|repository code
 
|mandatory unit and component
 
|-
 
|agent
 
|agentIdentifier
 
|agentIdentifierValue
 
|CVA
 
|mandatory unit and component
 
|-
 
|agent
 
|agentName
 
|none
 
|City of Vancouver Archives
 
|
 
|-
 
|agent
 
|agentType
 
|none
 
|organization
 
|
 
|-
 
|agent
 
|agentIdentifier
 
|agentIdentifierType
 
|preservation system
 
|mandatory unit and component
 
|-
 
|agent
 
|agentIdentifier
 
|agentIdentifierValue
 
|Archivematica-0.6
 
|mandatory unit and component
 
|-
 
|agent
 
|agentName
 
|none
 
|Archivematica
 
|
 
|-
 
|agent
 
|agentType
 
|none
 
|software
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
==Proposed PREMIS metadata for normalized file (preservation copy)==
 
Unlike the table above, this table shows all the metadata elements that should appear for a normalized file. The two events recorded are creation and checksum generation.
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:10%"|'''PREMIS entity'''
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|object
 
|objectIdentifier
 
|objectIdentifierType
 
|UUID
 
|mandatory unit and component
 
|-
 
|object
 
|objectIdentifier
 
|objectIdentifierValue
 
|270bd067-0483-4c5f-bdec-f2cbd6e651aa
 
|mandatory unit and component
 
|-
 
|object
 
|objectCategory
 
|none
 
|file
 
|mandatory unit and component
 
|-
 
|object
 
|objectCharacteristics
 
|compositionLevel
 
|0
 
|mandatory unit and component
 
|-
 
|object
 
|objectCharacteristics/fixity
 
|messageDigestAlgorithm
 
|MD5
 
|
 
|-
 
|object
 
|objectCharacteristics/fixity
 
|messageDigest
 
|e479688508922354bdab09bca60d8d0e
 
|
 
|-
 
|object
 
|objectCharacteristics/fixity
 
|messageDigestOriginator
 
|City of Vancouver Archives
 
|
 
|-
 
|object
 
|objectCharacteristics/format/formatDesignation
 
|formatName
 
|Tagged Image File Format
 
|format is a mandatory unit; must use either formatDesignation or formatRegistry
 
|-
 
|object
 
|objectCharacteristics/format/formatDesignation
 
|formatVersion
 
|6.0
 
|format is a mandatory unit; must use either formatDesignation or formatRegistry
 
|-
 
|object
 
|objectCharacteristics/format/formatRegistry
 
|formatRegistryName
 
|PRONOM
 
|format is a mandatory unit; must use either formatDesignation or formatRegistry
 
|-
 
|object
 
|objectCharacteristics/format/formatRegistry
 
|formatRegistryKey
 
|fmt/10
 
|format is a mandatory unit; must use either formatDesignation or formatRegistry
 
|-
 
|object
 
|relationship
 
|relationshipType
 
|derivation
 
|
 
|-
 
|object
 
|relationship
 
|relationshipSubType
 
|has source
 
|
 
|-
 
|object
 
|relationship/relatedObjectIdentification
 
|relatedObjectIdentifierType
 
|UUID
 
|
 
|-
 
|object
 
|relationship/relatedObjectIdentification
 
|relatedObjectIdentifierValue
 
|0db50321-6d7b-4291-89ec-a8b0adc1ff96
 
|
 
|-
 
|object
 
|relationship/relatedEventIdentification
 
|relatedEventIdentifierType
 
|Archivematica ID
 
|"For derivative relationships between objects relatedEventIdentification must be recorded."
 
|-
 
|object
 
|relationship/relatedEventIdentification
 
|relatedEventIdentifierValue
 
|[alphanumeric code]
 
|"For derivative relationships between objects relatedEventIdentification must be recorded."
 
|-
 
|event
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|mandatory unit and component
 
|-
 
|event
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|mandatory unit and component
 
|-
 
|event
 
|eventType
 
|none
 
|creation
 
|mandatory unit and component
 
|-
 
|event
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:44-03:00
 
|mandatory unit and component
 
|-
 
|event
 
|eventDetail
 
|none
 
|program=ImageMagick; version=6.6.4.0; command=%convertPath% %fileFullName% +compress %preservationFileDirectory%%fileTitle%.%preservationFormat%
 
|
 
|-
 
|event
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|preservation system
 
|used to link an agent to an event; not mandatory but recommended
 
|-
 
|event
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|used to link an agent to an event; not mandatory but recommended
 
|-
 
|event
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|mandatory unit and component
 
|-
 
|event
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|mandatory unit and component
 
|-
 
|event
 
|eventType
 
|none
 
|message digest calculation
 
|mandatory unit and component
 
|-
 
|event
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|mandatory unit and component
 
|-
 
|event
 
|eventDetail
 
|none
 
|program="MD5deep"; version="3.6"
 
|
 
|-
 
|event
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|preservation system
 
|used to link an agent to an event; not mandatory but recommended
 
|-
 
|event
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|used to link an agent to an event; not mandatory but recommended
 
|-
 
|agent
 
|agentIdentifier
 
|agentIdentifierType
 
|preservation system
 
|mandatory unit and component
 
|-
 
|agent
 
|agentIdentifier
 
|agentIdentifierValue
 
|Archivematica-0.6
 
|mandatory unit and component
 
|-
 
|agent
 
|agentName
 
|none
 
|Archivematica
 
|
 
|-
 
|agent
 
|agentType
 
|none
 
|software
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
==Event metadata==
 
 
 
===Receive SIP (SIP gets placed in  1-receiveSIP)===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|ingestion
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository code
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|CVA
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Check checksums===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|fixity check
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="MD5Deep"; version="3.6"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|{pass; fail}
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Generate checksums===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|message digest calculation
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="MD5Deep"; version="3.6"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|e479688508922354bdab09bca60d8d0e
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Review SIP===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|SIP review
 
|
 
|-
 
|eventDateTime
 
|none
 
|[date - may not be automatically generated]
 
|
 
|-
 
|eventDetail
 
|none
 
|[free text field - could include information about the Submission Information Agreement against which the SIP was checked, etc.]
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|{pass; conditional pass}
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
*some files missing
 
*appraisal required
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository code
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|CVA
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Place SIP in quarantine===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|start quarantine
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository code
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|CVA
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Remove SIP from quarantine===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|end quarantine
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Unpack zipped files===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|unpack
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="easyextract"; version="0.1.0"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|unpacked Newsletter.zip
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Assign UUIDs===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|create UUID
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="UUID"; version="1.6.2"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|270bd067-0483-4c5f-bdec-f2cbd6e651aa
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Remove prohibited characters===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|filename cleanup
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="detox"; version="1.2.0-1"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|Original name="cover image.bmp"; cleaned up name="cover_image.bmp"
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Scan for viruses===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|virus check
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="Clam AV"; version="0.95.2"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|pass
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Identify format===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|format identification
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="File Information Toolset"; version="0.2.6"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|{positive; tentative; unidentified}
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|fmt/116
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Validate format===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|validation
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="File Information Toolset"; version="0.2.6"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|{pass; partial pass; fail}
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|format="Windows Bitmap"; version="3.0"; result="Well-formed and valid"
 
|
 
*It is important to include the format and version against which Jhove is validating the file; otherwise it can misidentify or fail to identify the format and give a false positive for validation (for example, when it identifies the format as "bytestream" and then declares the file "Well-formed and valid").
 
*This semantic unit can be repeated if there is a specific error message relating to failed validation.
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Normalize to preservation format===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|normalization
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program=ImageMagick; version=6.6.4.0; command=%convertPath% %fileFullName% +compress %preservationFileDirectory%%fileTitle%.%preservationFormat%
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|{Normalized; Not normalized}
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
*Normalization failed
 
*Already in preservation format. No need to normalize
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Generate access copy===
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|access copy generation
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="ImageMagick"; version="6.6.4.0"; command="%convertPath% %fileFullName% %accessFileDirectory%%fileTitle%.%accessFormat%"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|{Access copy generated; access copy not generated}
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
*Normalization failed
 
*Already in access format. No need to normalize
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
<br>
 
 
 
===Create file===
 
This event is recorded only for preservation and access copies, not for original files
 
 
 
{| border="1" cellpadding="10" cellspacing="0" width=90%
 
|-
 
|- style="background-color:#cccccc;"
 
!style="width:20%"|'''Semantic unit'''
 
!style="width:20%"|'''Semantic component'''
 
!style="width:20%"|'''Sample value(s)'''
 
!style="width:20%"|'''Notes'''
 
|-
 
|eventIdentifier
 
|eventIdentifierType
 
|Archivematica ID
 
|
 
|-
 
|eventIdentifier
 
|eventIdentifierValue
 
|[alphanumeric code]
 
|
 
|-
 
|eventType
 
|none
 
|creation
 
|
 
|-
 
|eventDateTime
 
|none
 
|2010-08-01T09:08:46-01:00
 
|
 
|-
 
|eventDetail
 
|none
 
|program="ImageMagick"; version="6.6.4.0"; command="%convertPath% %fileFullName% +compress %preservationFileDirectory%%fileTitle%.%preservationFormat%"
 
|
 
|-
 
|eventOutcomeInformation
 
|eventOutcome
 
|
 
|
 
|-
 
|eventOutcomeDetail
 
|eventOutcomeDetailNote
 
|
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierType
 
|repository system
 
|
 
|-
 
|linkingAgentIdentifier
 
|linkingAgentIdentifierValue
 
|Archivematica-0.6
 
|
 
|-
 
|}
 
 
 
  
 
[[Category:Development documentation]]
 
[[Category:Development documentation]]

Latest revision as of 15:34, 11 February 2020

Main Page > Development > Development documentation > Metadata elements

This page is no longer being maintained and may contain inaccurate information. Please see the Archivematica documentation for up-to-date information.

This page identifies a minimum set of metadata elements designed to ensure authenticity and interoperability of preserved objects and to facilitate their retrieval.