Jump to navigation
Jump to search
Main Page > Documentation > Media type preservation plans > Email
Email preservation planning is currently under development. See also Email preservation.
Significant characteristics of email
Preservation Format
- Options:
- CERP Project E-Mail Account Schema
- mbox (implemented in 0.7)
Access Format
- Options:
- mbox (implemented in 0.7)
Attachments
- These should be normalized according to the media type preservation plan for each attachment file format. Attachments must remain linked to email message; in 0.7.1 this is achieved by attaching the number in the filename of the individual mbox file email to the filename of its attachment(s): eg:
- 4.mbox (the message)
- 4-report.gif (the attachment)
- 4-report.tif (the normalized attachment)
Normalization tool
- Options:
Conversion test results
Comments
- The PEDALS (Persistent Digital Archives and Library System) project has produced an open-source email extractor that converts .pst files to xml. However, this tool is designed for Windows only. Users would need to extract the email outside Archivematica and submit the extracted emails as the SIP. For more information, see Library of Congress News and Events at http://www.digitalpreservation.gov/news/2010/20100924news_article_pedals_email_tool.html.
- MBox might be an acceptable preservation format for email. MBox files are aggregations of email messages converted to plain text.
- The Bodleian Libraries at the University of Oxford use MBOX as a preservation format for mailboxes. See http://www.dpconline.org/component/docman/doc_download/640-emailthomasjul2011.
- A detailed report on testing conversion of email from proprietary to open formats is available at http://www.significantproperties.org.uk/email-testingreport.html. The report includes information about testing conversions from .pst to Mbox using ReadPST.