Difference between revisions of "Email"

From Archivematica
Jump to navigation Jump to search
Line 13: Line 13:
 
*Options:
 
*Options:
 
** [http://www.qmail.org/man/man5/mbox.html mbox] ('''implemented in 0.7''')
 
** [http://www.qmail.org/man/man5/mbox.html mbox] ('''implemented in 0.7''')
** PDF (''implement in 0.7.1''?)
 
  
 
==Attachments==
 
==Attachments==
* These should be normalized according to the media type preservation plan for each attachment file format. Attachments must remain linked to email message (option 1: inferred by physical location in AIP; option 2: METS structMap)
+
* These should be normalized according to the media type preservation plan for each attachment file format. Attachments must remain linked to email message; in 0.7.1 this is achieved by attaching the number in the filename of the individual mbox file email to the filename of its attachment(s): eg:
 +
**''4.mbox'' (the message)
 +
**''4-report.gif'' (the attachment)
 +
**''4-report.tif'' (the normalized attachment)
  
 
==Normalization tool==
 
==Normalization tool==
Line 36: Line 38:
 
*The [http://www.pedalspreservation.org/Default.aspx PEDALS (Persistent Digital Archives and Library System) project] has produced an open-source [http://sourceforge.net/projects/pedalsemailextr/ email extractor] that converts .pst files to xml. However, this tool is designed for Windows only. Users would need to extract the email outside Archivematica and submit the extracted emails as the SIP. For more information, see Library of Congress News and Events at http://www.digitalpreservation.gov/news/2010/20100924news_article_pedals_email_tool.html.
 
*The [http://www.pedalspreservation.org/Default.aspx PEDALS (Persistent Digital Archives and Library System) project] has produced an open-source [http://sourceforge.net/projects/pedalsemailextr/ email extractor] that converts .pst files to xml. However, this tool is designed for Windows only. Users would need to extract the email outside Archivematica and submit the extracted emails as the SIP. For more information, see Library of Congress News and Events at http://www.digitalpreservation.gov/news/2010/20100924news_article_pedals_email_tool.html.
 
*[http://en.wikipedia.org/wiki/Mbox MBox] might be an acceptable preservation format for email. MBox files are aggregations of email messages converted to plain text.
 
*[http://en.wikipedia.org/wiki/Mbox MBox] might be an acceptable preservation format for email. MBox files are aggregations of email messages converted to plain text.
 +
**The Bodleian Libraries at the University of Oxford use MBOX as a preservation format for mailboxes. See http://www.dpconline.org/component/docman/doc_download/640-emailthomasjul2011.
 
*A detailed report on testing conversion of email from proprietary to open formats is available at http://www.significantproperties.org.uk/email-testingreport.html. The report includes information about testing conversions from .pst to Mbox using [http://alioth.debian.org/projects/libpst/ ReadPST].
 
*A detailed report on testing conversion of email from proprietary to open formats is available at http://www.significantproperties.org.uk/email-testingreport.html. The report includes information about testing conversions from .pst to Mbox using [http://alioth.debian.org/projects/libpst/ ReadPST].
  
  
 
__NOTOC__
 
__NOTOC__

Revision as of 15:28, 26 September 2011

Main Page > Documentation > Media type preservation plans > Email

Email preservation planning is currently under development. See also Email preservation.

Significant characteristics of email

Preservation Format

Access Format

  • Options:
    • mbox (implemented in 0.7)

Attachments

  • These should be normalized according to the media type preservation plan for each attachment file format. Attachments must remain linked to email message; in 0.7.1 this is achieved by attaching the number in the filename of the individual mbox file email to the filename of its attachment(s): eg:
    • 4.mbox (the message)
    • 4-report.gif (the attachment)
    • 4-report.tif (the normalized attachment)

Normalization tool

Conversion test results

Comments