Difference between revisions of "PDF to PDF/A using Ghostscript"

From Archivematica
Jump to navigation Jump to search
Line 1: Line 1:
 
[[Main Page]] > [[Documentation]] > [[Media type preservation plans]] > [[Portable Document Format]] > PDF to PDF/A using Ghostscript
 
[[Main Page]] > [[Documentation]] > [[Media type preservation plans]] > [[Portable Document Format]] > PDF to PDF/A using Ghostscript
 +
 +
== File 1 ==
  
 
*File used was A checklist for documenting PREMIS-METS decisions in a PREMIS profile, May 2010, Sally Vermaaten, OCLC, http://www.loc.gov/standards/premis/premis_mets_checklist.pdf.
 
*File used was A checklist for documenting PREMIS-METS decisions in a PREMIS profile, May 2010, Sally Vermaaten, OCLC, http://www.loc.gov/standards/premis/premis_mets_checklist.pdf.
Line 27: Line 29:
 
*2 non-embedded subsets: Arial and Arial Italic
 
*2 non-embedded subsets: Arial and Arial Italic
 
|10 embedded subsets. Arial replaced by Helvetica and Arial Italic replaced by Helvetica Oblique.
 
|10 embedded subsets. Arial replaced by Helvetica and Arial Italic replaced by Helvetica Oblique.
 +
|-
 +
|Features
 +
|
 +
*Forms: no
 +
*Metadata stream: no
 +
*Outline: no
 +
*Threads: no
 +
*Tagged: yes
 +
*Page layout: single page
 +
*Page mode: use none
 +
|
 +
*Forms: no
 +
*Metadata stream: no
 +
*Outline: no
 +
*Threads: no
 +
*Tagged: no
 +
*Page layout: single page
 +
*Page mode: use none
 +
|-
 +
|}<br />
 +
 +
== File 2 ==
 +
 +
*File used was IFPI Digital Music Report 2010, http://www.ifpi.org/content/library/DMR2010.pdf
 +
*Used Ghostscript 8.71 using the following command: gs -dPDFA -dBATCH -dNOPAUSE -sDEVICE=pdfwrite -sOutputFile=DMR2010_PDFA.pdf DMR2010.pdf
 +
 +
<br>
 +
 +
 +
{| border="1" cellpadding="10" cellspacing="0" width=90%
 +
|-
 +
|- style="background-color:#cccccc;"
 +
!style="width:20%"|'''Property'''
 +
!style="width:20%"|'''Original'''
 +
!style="width:20%"|'''Normalized'''
 +
|-
 +
|File size
 +
|1,713,072 bytes
 +
|5,337,321 bytes
 +
|-
 +
|PageCount
 +
|32
 +
|32
 +
|-
 +
|Fonts
 +
|
 +
*8 embedded subsets
 +
|
 +
*Substituted font Helvetica for Arial
 +
*Substituted font Times-Roman for TimesNewRoman
 +
*Substituted font Times-Italic for TimesNewRoman, Italic
 +
*Substituted font Times-Bold for TimesNewRoman, Bold
 
|-
 
|-
 
|Features
 
|Features

Revision as of 15:11, 23 November 2010

Main Page > Documentation > Media type preservation plans > Portable Document Format > PDF to PDF/A using Ghostscript

File 1

  • File used was A checklist for documenting PREMIS-METS decisions in a PREMIS profile, May 2010, Sally Vermaaten, OCLC, http://www.loc.gov/standards/premis/premis_mets_checklist.pdf.
  • Used Ghostscript 8.71 using the following command: gs -dPDFA -dBATCH -dNOPAUSE -sDEVICE=pdfwrite -sOutputFile=premis_mets_checklist_PDFA.pdf premis_mets_checklist.pdf



Property Original Normalized
File size 318,500 bytes 974,071 bytes
PageCount 14 14
Fonts
  • 10 embedded subsets
  • 2 non-embedded subsets: Arial and Arial Italic
10 embedded subsets. Arial replaced by Helvetica and Arial Italic replaced by Helvetica Oblique.
Features
  • Forms: no
  • Metadata stream: no
  • Outline: no
  • Threads: no
  • Tagged: yes
  • Page layout: single page
  • Page mode: use none
  • Forms: no
  • Metadata stream: no
  • Outline: no
  • Threads: no
  • Tagged: no
  • Page layout: single page
  • Page mode: use none


File 2



Property Original Normalized
File size 1,713,072 bytes 5,337,321 bytes
PageCount 32 32
Fonts
  • 8 embedded subsets
  • Substituted font Helvetica for Arial
  • Substituted font Times-Roman for TimesNewRoman
  • Substituted font Times-Italic for TimesNewRoman, Italic
  • Substituted font Times-Bold for TimesNewRoman, Bold
Features
  • Forms: no
  • Metadata stream: no
  • Outline: no
  • Threads: no
  • Tagged: yes
  • Page layout: single page
  • Page mode: use none
  • Forms: no
  • Metadata stream: no
  • Outline: no
  • Threads: no
  • Tagged: no
  • Page layout: single page
  • Page mode: use none