<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.archivematica.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Misty+De+Meo</id>
	<title>Archivematica - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.archivematica.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Misty+De+Meo"/>
	<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/Special:Contributions/Misty_De_Meo"/>
	<updated>2026-04-30T10:32:18Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.35.4</generator>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Virtual_appliance_instructions&amp;diff=3859</id>
		<title>Virtual appliance instructions</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Virtual_appliance_instructions&amp;diff=3859"/>
		<updated>2011-04-21T16:55:30Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: Sun --&amp;gt; Oracle; product name has changed&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Main Page]] &amp;gt; Virtual appliance instructions&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Archivematica is distributed as a [http://en.wikipedia.org/wiki/Virtual_appliance virtual appliance] which integrates a number of software tools into one common virtual machine environment. &lt;br /&gt;
&lt;br /&gt;
This allows it to be run on almost any workstation or server hardware without compromising the host machine's operating system or application software. At the same time, the virtual appliance is able to interact with any number of networked and/or external storage devices to allow for the flexible implementation of an archival storage and backup strategy.&lt;br /&gt;
&lt;br /&gt;
*See the [[software]] page for a full list of the tools used in each system release. &lt;br /&gt;
*See the [[documentation]] page for instructions on how to use the virtual appliance to accomplish the system [[requirements]].&lt;br /&gt;
*See http://digitalcommons.uconn.edu/libr_pubs/34/ for instructions on installing Archivematica v0.7 on a custom-sized xubuntu v10.04.2 virtual machine hosted in VirtualBox v4.0.4, written by Michael J. Bennett. See also the related discussion on the [https://groups.google.com/group/archivematica/browse_thread/thread/570559472f3afea4?hl=en Archivematica discussion list]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Minimum hardware requirements=&lt;br /&gt;
&lt;br /&gt;
*Processor: Intel core 2 or AMD Opteron&lt;br /&gt;
*Memory: 512MB for the virtual appliance ('guest') operating system, i.e: if the 'host' operating system has 2GB available, 512MB needs to allocated to the 'guest'. Depending on the operating system, machines with less than 1.5GB total memory will likely have trouble running Archivematica. Note that the default allocation setting in Archivematica is 512MB; however, the more that is allocated the better the system will run. The setting can be changed once Archivematica is running.&lt;br /&gt;
*Hard Drive space: a minimum of 3GB to test the system on a small scale (i.e. use the available test files or import a small set of test files); 12GB or more for larger implementations&lt;br /&gt;
&lt;br /&gt;
=Instructions for using the VM image=&lt;br /&gt;
&lt;br /&gt;
==Install Oracle Virtual Box==&lt;br /&gt;
&lt;br /&gt;
*Archivematica uses the [http://en.wikipedia.org/wiki/Open_Virtualization_Format Open Virtualization Format] and has been tested with the free and open-source [http://www.sun.com/software/products/virtualbox/index.jsp Oracle Virtual Box ] virtualization platform. &lt;br /&gt;
*There are VirtualBox versions available for every major operating system.&lt;br /&gt;
**Download and install Oracle VirtualBox: [http://dlc.sun.com/virtualbox/vboxdownload.html http://dlc.sun.com/virtualbox/vboxdownload.html]. Note that if you are installing VirtualBox on Windows you will have to click through a number of warnings that you are attempting to install non-verified software.&lt;br /&gt;
&lt;br /&gt;
==Download Archivematica ==&lt;br /&gt;
&lt;br /&gt;
*[http://archivematica.org/download Download] the latest version of the Archivematica appliance.&lt;br /&gt;
*Unzip the Archivematica file. This should result in the following two files appearing in an Archivematica folder:&lt;br /&gt;
**Archivematica-0.x.ovf&lt;br /&gt;
**disk0.vmdk&lt;br /&gt;
&lt;br /&gt;
==Start Archivematica virtual appliance==&lt;br /&gt;
&lt;br /&gt;
*Open the Oracle VirtualBox virtual machine.&lt;br /&gt;
*Click File &amp;gt; Import Appliance&lt;br /&gt;
*Click Choose&lt;br /&gt;
*Select ''Archivematica-0.x.ovf''&lt;br /&gt;
*Click Open&lt;br /&gt;
*Click Next&lt;br /&gt;
*Click Import&lt;br /&gt;
*Read and agree to the Software License Agreement (GPL2)&lt;br /&gt;
*The virtual box will open with Archivematica 0.x listed on the left-hand side. Select Archivematica 0.x and click Start (the green arrow in the menu).&lt;br /&gt;
*The image should launch, showing you a Linux Ubuntu desktop and some of the digital preservation applications which are included in Archivematica 0.x &lt;br /&gt;
**You may have to log in using the user name ''demo'' and the password ''demo''. &lt;br /&gt;
**If your mouse pointer does not appear to be working in the virtual machine, click the letter f while holding down the right-hand control button. Do the same thing to switch back to using your mouse pointer outside the virtual box.&lt;br /&gt;
*Congratulations, you have a running copy of Archivematica! See [[Documentation]] for instructions on how to use the software.&lt;br /&gt;
&lt;br /&gt;
==Fixing archivematica sources.list==&lt;br /&gt;
The following commands entered in terminal will re-add the ubuntu repositories (makes repository software installable)&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
sudo -s&lt;br /&gt;
mv /etc/apt/sources.list /etc/apt/sources.list.backup&lt;br /&gt;
wget http://archivematica.org/downloads/sources.list  &lt;br /&gt;
mv ./sources.list /etc/apt/sources.list&lt;br /&gt;
aptitude update&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Import files into virtual appliance (optional)==&lt;br /&gt;
===Using SFTP===&lt;br /&gt;
&lt;br /&gt;
* While in virtualbox right click the archivematica virtualmachine and click settings&lt;br /&gt;
* Click the 'Network' tab&lt;br /&gt;
* 'Adapter 1' should be set to NAT by default(this allows you to get to the internet), click on 'Adapter 2' tab&lt;br /&gt;
* Click enable adapter and set attached to 'host only adapter'&lt;br /&gt;
* Power up the archivematica virtualmachine&lt;br /&gt;
* Once in the xubuntu interface goto Applications &amp;gt; Accessories &amp;gt; Terminal&lt;br /&gt;
* Type the following in terminal  (the password is demo) - this will take a minute and requires internet&lt;br /&gt;
&amp;lt;pre&amp;gt;sudo aptitude install ssh &amp;lt;/pre&amp;gt;&lt;br /&gt;
* type ifconfig in terminal you should see a IP address like '192.168.56.101'  (likely eth1 interface) &lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ ifconfig&lt;br /&gt;
eth1      Link encap:Ethernet  HWaddr fe:54:00:9d:92:64  &lt;br /&gt;
          inet addr:192.168.56.101  Bcast:192.168.56.255  Mask:255.255.255.0&lt;br /&gt;
          inet6 addr: fe80::1c6b:7bff:fe07:ddb6/64 Scope:Link&lt;br /&gt;
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1&lt;br /&gt;
          RX packets:24 errors:0 dropped:0 overruns:0 frame:0&lt;br /&gt;
          TX packets:45 errors:0 dropped:0 overruns:0 carrier:0&lt;br /&gt;
          collisions:0 txqueuelen:0 &lt;br /&gt;
          RX bytes:1400 (1.4 KB)  TX bytes:5815 (5.8 KB)&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
*  From here your machine should be connectable via SFTP.  Download a SFTP client, a popular opensource option is FileZilla, which works on Linux and Windows.  If using OSX cyber duck is reported to be a decent opensource SFTP client. &lt;br /&gt;
&lt;br /&gt;
* the connection information should be as follows&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
username: demo&lt;br /&gt;
password: demo&lt;br /&gt;
IP/Hostname: 192.168.56.101 &amp;lt; results of ifconfig likely '192.168.56.*'&lt;br /&gt;
port: 22&lt;br /&gt;
destination folder:   /home/demo/ &amp;lt; if this is not set you will have to navigate to /home/demo directory &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Turn off virtual machine==&lt;br /&gt;
*At the end of your session, turn off the VirtualBox virtual machine by going to Machine &amp;gt; Close &amp;gt; and selecting one of the three shutdown options. It is best to choose either &amp;quot;Save the machine state&amp;quot; or &amp;quot;Send the shutdown signal&amp;quot;. This is from the VirtualBox help manual:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;3.4.3. Saving the state of the machine&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;When you click on the &amp;quot;Close&amp;quot; button of your virtual machine window (at the top right of the window, just like you would close any other window on your system) (or press the Host key together with &amp;quot;Q&amp;quot;), VirtualBox asks you whether you want to &amp;quot;save&amp;quot; or &amp;quot;power off&amp;quot; the VM.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
  &lt;br /&gt;
&amp;lt;blockquote&amp;gt;The difference between these three options is crucial. They mean:&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
  &lt;br /&gt;
&amp;lt;blockquote&amp;gt;Save the machine state: With this option, VirtualBox &amp;quot;freezes&amp;quot; the virtual machine by completely saving its state to your local disk. When you later resume the VM (by again clicking the &amp;quot;Start&amp;quot; button in the VirtualBox main window), you will find that the VM continues exactly where it was left off. All your programs will still be open, and your computer resumes operation.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
  &lt;br /&gt;
&amp;lt;blockquote&amp;gt;Saving the state of a virtual machine is thus in some ways similar to suspending a laptop computer (e.g. by closing its lid).&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Send the shutdown signal. This will send an ACPI shutdown signal to the virtual machine, which has the same effect as if you had pressed the power button on a real computer. So long as a fairly modern operating system is installed and running in the VM, this should trigger a proper shutdown mechanism in the VM.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Power off the machine: With this option, VirtualBox also stops running the virtual machine, but without saving its state.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;This is equivalent to pulling the power plug on a real computer without shutting it down properly. If you start the machine again after powering it off, your operating system will have to reboot completely and may begin a lengthy check of its (virtual) system disks.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;As a result, this should not normally be done, since it can potentially cause data loss or an inconsistent state of the guest system on disk.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;The &amp;quot;Discard&amp;quot; button in the main VirtualBox window discards a virtual machine's saved state. This has the same effect as powering it off, and the same warnings apply.&amp;lt;/blockquote&amp;gt;&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=JPEG2000&amp;diff=3824</id>
		<title>JPEG2000</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=JPEG2000&amp;diff=3824"/>
		<updated>2011-04-07T19:32:19Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: Preservation Issues&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Main Page]] &amp;gt; [[Documentation]] &amp;gt; [[Media type preservation plans]] &amp;gt; JPEG2000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===File extension(s)===&lt;br /&gt;
.jp2&lt;br /&gt;
&lt;br /&gt;
===Format registry information===&lt;br /&gt;
None. Closest is [http://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&amp;amp;id=794 PRONOM (JPX - JPEG 2000 Extended)]&lt;br /&gt;
&lt;br /&gt;
===Significant characteristics===&lt;br /&gt;
See [[Significant characteristics of raster images]]&lt;br /&gt;
&lt;br /&gt;
===Preservation action plan===&lt;br /&gt;
See [[Raster images]]&lt;br /&gt;
&lt;br /&gt;
===Preservation issues===&lt;br /&gt;
*JPEG2000 image compression ranges from completely lossless to very lossy.&lt;br /&gt;
*Florida digital Archive has &amp;quot;high confidence level&amp;quot; in ability to preserve losslessly compressed JP2 files: [http://www.fcla.edu/digitalArchive/formatInfo.htm FDA Recommended Data File Formats].&lt;br /&gt;
*JPEG2000 has limited support for colour profiles: [http://jpeg2000wellcomelibrary.blogspot.com/2010/12/guest-post-ensuring-suitability-of-jpeg.html].&lt;br /&gt;
&lt;br /&gt;
===More information===&lt;br /&gt;
*[http://www.updig.org/guidelines/ph_archiving.html Universal Photographic Digital Imaging Guidelines: Archiving] and [http://www.dpconline.org/docs/reports/dpctw08-01.pdf Technology Watch Support:JPEG 2000 - a Practical Digital Preservation Standard? Digital Preservation Coalition 2008]. &lt;br /&gt;
*[http://www.digitalpreservation.gov/formats/fdd/fdd000143.shtml Library of Congress Sustainability of Digital Formats: JPEG 2000 Part 1 (Core) jp2 File Format]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=User_talk:Sdyppoypgo&amp;diff=3764</id>
		<title>User talk:Sdyppoypgo</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=User_talk:Sdyppoypgo&amp;diff=3764"/>
		<updated>2011-03-25T14:34:19Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: Eliminate spam&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=JPEG2000&amp;diff=3753</id>
		<title>JPEG2000</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=JPEG2000&amp;diff=3753"/>
		<updated>2011-03-23T19:03:17Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Main Page]] &amp;gt; [[Documentation]] &amp;gt; [[Media type preservation plans]] &amp;gt; JPEG2000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===File extension(s)===&lt;br /&gt;
.jp2&lt;br /&gt;
&lt;br /&gt;
===Format registry information===&lt;br /&gt;
None. Closest is [http://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&amp;amp;id=794 PRONOM (JPX - JPEG 2000 Extended)]&lt;br /&gt;
&lt;br /&gt;
===Significant characteristics===&lt;br /&gt;
See [[Significant characteristics of raster images]]&lt;br /&gt;
&lt;br /&gt;
===Preservation action plan===&lt;br /&gt;
See [[Raster images]]&lt;br /&gt;
&lt;br /&gt;
===Preservation issues===&lt;br /&gt;
*JPEG2000 image compression ranges from completely lossless to very lossy.&lt;br /&gt;
*Florida digital Archive has &amp;quot;high confidence level&amp;quot; in ability to preserve losslessly compressed JP2 files: [http://www.fcla.edu/digitalArchive/formatInfo.htm FDA Recommended Data File Formats].&lt;br /&gt;
&lt;br /&gt;
== Encoding JPEG2000-MXF video ==&lt;br /&gt;
&lt;br /&gt;
===Method one: OpenDCP===&lt;br /&gt;
&lt;br /&gt;
*FFmpeg&lt;br /&gt;
*OpenJPEG 1.4 ([http://www.openjpeg.org/])&lt;br /&gt;
*OpenDCP ([http://code.google.com/p/opendcp/])&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This workflow is designed around digital cinema package (DCP) production.&lt;br /&gt;
Most of the open-source tools are based around DCP since they're being&lt;br /&gt;
designed by indie filmmakers.&lt;br /&gt;
&lt;br /&gt;
OpenDCP is a utility for creating DCPs. The [http://www.cinecert.com/asdcplib/ AS-DCP library] and the [http://sourceforge.net/projects/mxflib/ mxflib library] it's based on are possibilities&lt;br /&gt;
for someone with more programming skills than me to build an&lt;br /&gt;
archives-targeted tool.&lt;br /&gt;
&lt;br /&gt;
The disadvantage to this route is that DCPs use separate files for audio&lt;br /&gt;
and video, accompanied by an XML file describing the pair, which might&lt;br /&gt;
make playback less convenient than a single AV MXF. The available tools&lt;br /&gt;
are also targeted more to film framerates, and I'm not sure if it's&lt;br /&gt;
possible to obtain a video FPS like 29.97 using unmodified tools.&lt;br /&gt;
&lt;br /&gt;
Step 1: Decoding source video&lt;br /&gt;
&lt;br /&gt;
The source video should be decoded into a series of single frames before&lt;br /&gt;
being encoded into JPEG2000. This can be done using Ffmpeg.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;ffmpeg -i invideo -vcodec tiff f%10d.tif&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce one file for every frame in the video. Why do we do&lt;br /&gt;
this? Well, the various Motion JPEG2000 formats are essentially a series&lt;br /&gt;
of single JPEG2000 files; each frame will be encoded to JPEG2000 before&lt;br /&gt;
multiplexing into a video.&lt;br /&gt;
&lt;br /&gt;
The audio should be decoded to uncompressed WAV. This is a simple process&lt;br /&gt;
so I won't go into much detail here. The command should be a variant of&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;ffmpeg -i invideo outfile.wav&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Step 2: Create JP2 frames&lt;br /&gt;
&lt;br /&gt;
Next, these frames should be compressed into JPEG2000. Note that this will&lt;br /&gt;
'''not''' be done using OpenJPEG's support for Motion JPEG2000! Motion&lt;br /&gt;
JPEG2000 is a specific container for single JPEG2000 frames, while MXF&lt;br /&gt;
wraps its frames in a different way. Technically, JPEG2000/MXF&lt;br /&gt;
is considered a separate format.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;image_to_j2k -ImgDir directory -OutFor j2c&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will create JP2 frames using the default lossless compression frames,&lt;br /&gt;
but you may wish to customize this. Frames will be saved as .j2c files in&lt;br /&gt;
the same directory as the TIFF frames.&lt;br /&gt;
&lt;br /&gt;
Step 3: Create MXF file&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;opendcp_mxf -i image_directory -r framerate -o video.mxf&lt;br /&gt;
&lt;br /&gt;
opendcp_mxf -i audio.wav -o audio.mxf&lt;br /&gt;
&lt;br /&gt;
opendcp_xml --reel video.mxf audio.mxf&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce two MXFs, one containing video and one containing audio.&lt;br /&gt;
The third command creates an accompanying XML file, mainly targeted at&lt;br /&gt;
digital projectors.&lt;br /&gt;
&lt;br /&gt;
===Method two: GStreamer===&lt;br /&gt;
&lt;br /&gt;
GStreamer can encode to and from JPEG2000 MXF. By default it uses JasPer&lt;br /&gt;
as the JP2 encoder, which has some limitations. I believe there may be a&lt;br /&gt;
project underway to replace JasPer with OpenJPEG. GStreamer's commandline&lt;br /&gt;
utility also has very inconvenient syntax; it's designed primarily to be&lt;br /&gt;
used as a library to be called by custom programs. The gst-launch utility&lt;br /&gt;
is mainly meant for debug purposes. For production would be best to write&lt;br /&gt;
a custom archives-oriented converter based on Gstream rather than use the&lt;br /&gt;
gst-launch utility.&lt;br /&gt;
&lt;br /&gt;
The advantage to GStreamer is that it provides all of the required&lt;br /&gt;
functionality, and can go straight from a source video to producing an&lt;br /&gt;
output.&lt;br /&gt;
&lt;br /&gt;
On the other hand, GStreamer currently uses JasPer for JP2 encoding. It&lt;br /&gt;
may be better to use it in conjunction with an external JP2 encoder.&lt;br /&gt;
&lt;br /&gt;
Requirements:&lt;br /&gt;
&lt;br /&gt;
*GStreamer ([http://gstreamer.freedesktop.org])&lt;br /&gt;
*gst-plugins-bad&lt;br /&gt;
&lt;br /&gt;
The GStreamer debug utility is based around a chain of smaller tools&lt;br /&gt;
chained together. Here's a sample command which takes a video-only file,&lt;br /&gt;
in QuickTime format with PNG video, and turns it into a JPEG2000 MXF:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;gst-launch-0.10 filesrc location=video.mov ! qtdemux ! pngdec !&lt;br /&gt;
ffmpegcolorspace ! videoscale ! jp2kenc ! mxfmux ! filesink&lt;br /&gt;
location=video.mxf&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This goes through a series of steps to get from your source video to the&lt;br /&gt;
encoded video. This example does:&lt;br /&gt;
&lt;br /&gt;
*filesrc location=video.mov : Tells gst-launch where to find the source video&lt;br /&gt;
*qtdemux : Demultiplexes the video. This means that it &amp;quot;opens&amp;quot; the container and extracts the tracks inside it, making them available to the other GStreamer tools&lt;br /&gt;
*pngdec : Decodes PNG compressed video into uncompressed video &lt;br /&gt;
*ffmpegcolorspace : Interprets the colour space in the video. Necessary to be able to pass it to other tools&lt;br /&gt;
*videoscale : Negotiates the size of the video. When used with no arguments, it just ensures the next tool knows what the size is&lt;br /&gt;
*jp2kenc : Encodes the video stream to JPEG 2000. You can specify the settings here if you want&lt;br /&gt;
*mxfmux : Combines (multiplexes) the streams being processed into an MXF container&lt;br /&gt;
*filesink location=video.mxf : Saves the video to a &amp;quot;filesink&amp;quot;, e.g. a location on the hard drive&lt;br /&gt;
&lt;br /&gt;
The exact commands will depend on the source video. You can also do a&lt;br /&gt;
combined audio/video file, but the commands for that are kind of complex&lt;br /&gt;
and, to be honest, I haven't quite gotten my head around it yet.&lt;br /&gt;
&lt;br /&gt;
You can also decode and convert videos using GStreamer. Here's a sample&lt;br /&gt;
command to convert the previous video to MPEG-4 with default settings:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;gst-launch-0.10 filesrc location=video.mxf ! mxfdemux name=d d. !&lt;br /&gt;
jp2kdec ! ffmpegcolorspace ! videoscale ! ffenc_mpeg4 ! filesink&lt;br /&gt;
location=video.mp4&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Method three: GStreamer + OpenJPEG===&lt;br /&gt;
&lt;br /&gt;
Requirements:&lt;br /&gt;
&lt;br /&gt;
*GStreamer&lt;br /&gt;
*gstreamer-plugins-bad&lt;br /&gt;
*FFmpeg&lt;br /&gt;
*OpenJPEG&lt;br /&gt;
&lt;br /&gt;
As I mentioned before, GStreamer's JP2 encoder isn't ideal right now.&lt;br /&gt;
However, you can easily encode a video with another tool and wrap it into&lt;br /&gt;
MXF using GStreamer.&lt;br /&gt;
&lt;br /&gt;
First, extract your video to frames with ffmpeg as described above, then&lt;br /&gt;
compress them to JP2 frames using the following command:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;image_to_j2k -ImgDir directory -OutFor j2k&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You should then wrap them into an MJ2 video using the following command:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;wrap_j2k_in_mj2 directory video.mj2&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce a single MJ2 file using the frames you extracted. By&lt;br /&gt;
default the framerate will be 25fps, which is probably wrong - consult the&lt;br /&gt;
OpenJPEG documentation to find out how to set the framerate.&lt;br /&gt;
&lt;br /&gt;
You can then open this video file using GStreamer and rewrap the frames&lt;br /&gt;
into an MXF using a command like the following:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;gst-launch-0.10 filesrc location=video.mj2 ! mj2demux ! mxfmux ! filesink&lt;br /&gt;
location=video.mxf&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce an MXF with the contents as your video - no&lt;br /&gt;
recompression or loss of quality. Hooray!&lt;br /&gt;
&lt;br /&gt;
===Areas for future work===&lt;br /&gt;
&lt;br /&gt;
GStreamer seems ripe for someone to develop a custom, archives-oriented video conversion program. Any volunteers?&lt;br /&gt;
&lt;br /&gt;
==More information==&lt;br /&gt;
[http://www.digitalpreservation.gov/formats/fdd/fdd000143.shtml Library of Congress Sustainability of Digital Formats: JPEG 2000 Part 1 (Core) jp2 File Format]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=JPEG2000&amp;diff=3752</id>
		<title>JPEG2000</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=JPEG2000&amp;diff=3752"/>
		<updated>2011-03-23T19:01:59Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: Reparent &amp;quot;more information&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Main Page]] &amp;gt; [[Documentation]] &amp;gt; [[Media type preservation plans]] &amp;gt; JPEG2000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===File extension(s)===&lt;br /&gt;
.jp2&lt;br /&gt;
&lt;br /&gt;
===Format registry information===&lt;br /&gt;
None. Closest is [http://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&amp;amp;id=794 PRONOM (JPX - JPEG 2000 Extended)]&lt;br /&gt;
&lt;br /&gt;
===Significant characteristics===&lt;br /&gt;
See [[Significant characteristics of raster images]]&lt;br /&gt;
&lt;br /&gt;
===Preservation action plan===&lt;br /&gt;
See [[Raster images]]&lt;br /&gt;
&lt;br /&gt;
===Preservation issues===&lt;br /&gt;
*JPEG2000 image compression ranges from completely lossless to very lossy.&lt;br /&gt;
*Florida digital Archive has &amp;quot;high confidence level&amp;quot; in ability to preserve losslessly compressed JP2 files: [http://www.fcla.edu/digitalArchive/formatInfo.htm FDA Recommended Data File Formats].&lt;br /&gt;
&lt;br /&gt;
== Encoding JPEG2000-MXF video ==&lt;br /&gt;
&lt;br /&gt;
===Method one: OpenDCP===&lt;br /&gt;
&lt;br /&gt;
*FFmpeg&lt;br /&gt;
*OpenJPEG 1.4 ([http://www.openjpeg.org/])&lt;br /&gt;
*OpenDCP ([http://code.google.com/p/opendcp/])&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This workflow is designed around digital cinema package (DCP) production.&lt;br /&gt;
Most of the open-source tools are based around DCP since they're being&lt;br /&gt;
designed by indie filmmakers.&lt;br /&gt;
&lt;br /&gt;
OpenDCP is a utility for creating DCPs. The [http://www.cinecert.com/asdcplib/ AS-DCP library] and the [http://sourceforge.net/projects/mxflib/ mxflib library] it's based on are possibilities&lt;br /&gt;
for someone with more programming skills than me to build an&lt;br /&gt;
archives-targeted tool.&lt;br /&gt;
&lt;br /&gt;
The disadvantage to this route is that DCPs use separate files for audio&lt;br /&gt;
and video, accompanied by an XML file describing the pair, which might&lt;br /&gt;
make playback less convenient than a single AV MXF. The available tools&lt;br /&gt;
are also targeted more to film framerates, and I'm not sure if it's&lt;br /&gt;
possible to obtain a video FPS like 29.97 using unmodified tools.&lt;br /&gt;
&lt;br /&gt;
Step 1: Decoding source video&lt;br /&gt;
&lt;br /&gt;
The source video should be decoded into a series of single frames before&lt;br /&gt;
being encoded into JPEG2000. This can be done using Ffmpeg.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;ffmpeg -i invideo -vcodec tiff f%10d.tif&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce one file for every frame in the video. Why do we do&lt;br /&gt;
this? Well, the various Motion JPEG2000 formats are essentially a series&lt;br /&gt;
of single JPEG2000 files; each frame will be encoded to JPEG2000 before&lt;br /&gt;
multiplexing into a video.&lt;br /&gt;
&lt;br /&gt;
The audio should be decoded to uncompressed WAV. This is a simple process&lt;br /&gt;
so I won't go into much detail here. The command should be a variant of&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;ffmpeg -i invideo outfile.wav&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Step 2: Create JP2 frames&lt;br /&gt;
&lt;br /&gt;
Next, these frames should be compressed into JPEG2000. Note that this will&lt;br /&gt;
'''not''' be done using OpenJPEG's support for Motion JPEG2000! Motion&lt;br /&gt;
JPEG2000 is a specific container for single JPEG2000 frames, while MXF&lt;br /&gt;
wraps its frames in a different way. Technically, JPEG2000/MXF&lt;br /&gt;
is considered a separate format.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;image_to_j2k -ImgDir directory -OutFor j2c&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will create JP2 frames using the default lossless compression frames,&lt;br /&gt;
but you may wish to customize this. Frames will be saved as .j2c files in&lt;br /&gt;
the same directory as the TIFF frames.&lt;br /&gt;
&lt;br /&gt;
Step 3: Create MXF file&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;opendcp_mxf -i image_directory -r framerate -o video.mxf&lt;br /&gt;
&lt;br /&gt;
opendcp_mxf -i audio.wav -o audio.mxf&lt;br /&gt;
&lt;br /&gt;
opendcp_xml --reel video.mxf audio.mxf&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce two MXFs, one containing video and one containing audio.&lt;br /&gt;
The third command creates an accompanying XML file, mainly targeted at&lt;br /&gt;
digital projectors.&lt;br /&gt;
&lt;br /&gt;
===Method two: GStreamer===&lt;br /&gt;
&lt;br /&gt;
GStreamer can encode to and from JPEG2000 MXF. By default it uses JasPer&lt;br /&gt;
as the JP2 encoder, which has some limitations. I believe there may be a&lt;br /&gt;
project underway to replace JasPer with OpenJPEG. GStreamer's commandline&lt;br /&gt;
utility also has very inconvenient syntax; it's designed primarily to be&lt;br /&gt;
used as a library to be called by custom programs. The gst-launch utility&lt;br /&gt;
is mainly meant for debug purposes. For production would be best to write&lt;br /&gt;
a custom archives-oriented converter based on Gstream rather than use the&lt;br /&gt;
gst-launch utility.&lt;br /&gt;
&lt;br /&gt;
The advantage to GStreamer is that it provides all of the required&lt;br /&gt;
functionality, and can go straight from a source video to producing an&lt;br /&gt;
output.&lt;br /&gt;
&lt;br /&gt;
On the other hand, GStreamer currently uses JasPer for JP2 encoding. It&lt;br /&gt;
may be better to use it in conjunction with an external JP2 encoder.&lt;br /&gt;
&lt;br /&gt;
Requirements:&lt;br /&gt;
&lt;br /&gt;
GStreamer ([http://gstreamer.freedesktop.org])&lt;br /&gt;
gst-plugins-bad&lt;br /&gt;
&lt;br /&gt;
The GStreamer debug utility is based around a chain of smaller tools&lt;br /&gt;
chained together. Here's a sample command which takes a video-only file,&lt;br /&gt;
in QuickTime format with PNG video, and turns it into a JPEG2000 MXF:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;gst-launch-0.10 filesrc location=video.mov ! qtdemux ! pngdec !&lt;br /&gt;
ffmpegcolorspace ! videoscale ! jp2kenc ! mxfmux ! filesink&lt;br /&gt;
location=video.mxf&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This goes through a series of steps to get from your source video to the&lt;br /&gt;
encoded video. This example does:&lt;br /&gt;
&lt;br /&gt;
*filesrc location=video.mov : Tells gst-launch where to find the source&lt;br /&gt;
video&lt;br /&gt;
*qtdemux : Demultiplexes the video. This means that it &amp;quot;opens&amp;quot; the&lt;br /&gt;
container and extracts the tracks inside it, making them available to the&lt;br /&gt;
other GStreamer tools&lt;br /&gt;
*pngdec : Decodes PNG compressed video into uncompressed video&lt;br /&gt;
*ffmpegcolorspace : Interprets the colour space in the video. Necessary to&lt;br /&gt;
be able to pass it to other tools&lt;br /&gt;
*videoscale : Negotiates the size of the video. When used with no&lt;br /&gt;
arguments, it just ensures the next tool knows what the size is&lt;br /&gt;
*jp2kenc : Encodes the video stream to JPEG 2000. You can specify the&lt;br /&gt;
settings here if you want&lt;br /&gt;
*mxfmux : Combines (multiplexes) the streams being processed into an MXF&lt;br /&gt;
container&lt;br /&gt;
*filesink location=video.mxf : Saves the video to a &amp;quot;filesink&amp;quot;, e.g. a&lt;br /&gt;
location on the hard drive&lt;br /&gt;
&lt;br /&gt;
The exact commands will depend on the source video. You can also do a&lt;br /&gt;
combined audio/video file, but the commands for that are kind of complex&lt;br /&gt;
and, to be honest, I haven't quite gotten my head around it yet.&lt;br /&gt;
&lt;br /&gt;
You can also decode and convert videos using GStreamer. Here's a sample&lt;br /&gt;
command to convert the previous video to MPEG-4 with default settings:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;gst-launch-0.10 filesrc location=video.mxf ! mxfdemux name=d d. !&lt;br /&gt;
jp2kdec ! ffmpegcolorspace ! videoscale ! ffenc_mpeg4 ! filesink&lt;br /&gt;
location=video.mp4&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Method three: GStreamer + OpenJPEG===&lt;br /&gt;
&lt;br /&gt;
Requirements:&lt;br /&gt;
&lt;br /&gt;
GStreamer&lt;br /&gt;
gstreamer-plugins-bad&lt;br /&gt;
FFmpeg&lt;br /&gt;
OpenJPEG&lt;br /&gt;
&lt;br /&gt;
As I mentioned before, GStreamer's JP2 encoder isn't ideal right now.&lt;br /&gt;
However, you can easily encode a video with another tool and wrap it into&lt;br /&gt;
MXF using GStreamer.&lt;br /&gt;
&lt;br /&gt;
First, extract your video to frames with ffmpeg as described above, then&lt;br /&gt;
compress them to JP2 frames using the following command:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;image_to_j2k -ImgDir directory -OutFor j2k&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You should then wrap them into an MJ2 video using the following command:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;wrap_j2k_in_mj2 directory video.mj2&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce a single MJ2 file using the frames you extracted. By&lt;br /&gt;
default the framerate will be 25fps, which is probably wrong - consult the&lt;br /&gt;
OpenJPEG documentation to find out how to set the framerate.&lt;br /&gt;
&lt;br /&gt;
You can then open this video file using GStreamer and rewrap the frames&lt;br /&gt;
into an MXF using a command like the following:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;gst-launch-0.10 filesrc location=video.mj2 ! mj2demux ! mxfmux ! filesink&lt;br /&gt;
location=video.mxf&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce an MXF with the contents as your video - no&lt;br /&gt;
recompression or loss of quality. Hooray!&lt;br /&gt;
&lt;br /&gt;
===Areas for future work===&lt;br /&gt;
&lt;br /&gt;
GStreamer seems ripe for someone to develop a custom, archives-oriented video conversion program. Any volunteers?&lt;br /&gt;
&lt;br /&gt;
==More information==&lt;br /&gt;
[http://www.digitalpreservation.gov/formats/fdd/fdd000143.shtml Library of Congress Sustainability of Digital Formats: JPEG 2000 Part 1 (Core) jp2 File Format]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=JPEG2000&amp;diff=3751</id>
		<title>JPEG2000</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=JPEG2000&amp;diff=3751"/>
		<updated>2011-03-23T18:34:40Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: JP2-MXF video&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Main Page]] &amp;gt; [[Documentation]] &amp;gt; [[Media type preservation plans]] &amp;gt; JPEG2000&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===File extension(s)===&lt;br /&gt;
.jp2&lt;br /&gt;
&lt;br /&gt;
===Format registry information===&lt;br /&gt;
None. Closest is [http://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&amp;amp;id=794 PRONOM (JPX - JPEG 2000 Extended)]&lt;br /&gt;
&lt;br /&gt;
===Significant characteristics===&lt;br /&gt;
See [[Significant characteristics of raster images]]&lt;br /&gt;
&lt;br /&gt;
===Preservation action plan===&lt;br /&gt;
See [[Raster images]]&lt;br /&gt;
&lt;br /&gt;
===Preservation issues===&lt;br /&gt;
*JPEG2000 image compression ranges from completely lossless to very lossy.&lt;br /&gt;
*Florida digital Archive has &amp;quot;high confidence level&amp;quot; in ability to preserve losslessly compressed JP2 files: [http://www.fcla.edu/digitalArchive/formatInfo.htm FDA Recommended Data File Formats].&lt;br /&gt;
&lt;br /&gt;
== Encoding JPEG2000-MXF video ==&lt;br /&gt;
&lt;br /&gt;
===Method one: OpenDCP===&lt;br /&gt;
&lt;br /&gt;
*FFmpeg&lt;br /&gt;
*OpenJPEG 1.4 ([http://www.openjpeg.org/])&lt;br /&gt;
*OpenDCP ([http://code.google.com/p/opendcp/])&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This workflow is designed around digital cinema package (DCP) production.&lt;br /&gt;
Most of the open-source tools are based around DCP since they're being&lt;br /&gt;
designed by indie filmmakers.&lt;br /&gt;
&lt;br /&gt;
OpenDCP is a utility for creating DCPs. The [http://www.cinecert.com/asdcplib/ AS-DCP library] and the [http://sourceforge.net/projects/mxflib/ mxflib library] it's based on are possibilities&lt;br /&gt;
for someone with more programming skills than me to build an&lt;br /&gt;
archives-targeted tool.&lt;br /&gt;
&lt;br /&gt;
The disadvantage to this route is that DCPs use separate files for audio&lt;br /&gt;
and video, accompanied by an XML file describing the pair, which might&lt;br /&gt;
make playback less convenient than a single AV MXF. The available tools&lt;br /&gt;
are also targeted more to film framerates, and I'm not sure if it's&lt;br /&gt;
possible to obtain a video FPS like 29.97 using unmodified tools.&lt;br /&gt;
&lt;br /&gt;
Step 1: Decoding source video&lt;br /&gt;
&lt;br /&gt;
The source video should be decoded into a series of single frames before&lt;br /&gt;
being encoded into JPEG2000. This can be done using Ffmpeg.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;ffmpeg -i invideo -vcodec tiff f%10d.tif&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce one file for every frame in the video. Why do we do&lt;br /&gt;
this? Well, the various Motion JPEG2000 formats are essentially a series&lt;br /&gt;
of single JPEG2000 files; each frame will be encoded to JPEG2000 before&lt;br /&gt;
multiplexing into a video.&lt;br /&gt;
&lt;br /&gt;
The audio should be decoded to uncompressed WAV. This is a simple process&lt;br /&gt;
so I won't go into much detail here. The command should be a variant of&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;ffmpeg -i invideo outfile.wav&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Step 2: Create JP2 frames&lt;br /&gt;
&lt;br /&gt;
Next, these frames should be compressed into JPEG2000. Note that this will&lt;br /&gt;
'''not''' be done using OpenJPEG's support for Motion JPEG2000! Motion&lt;br /&gt;
JPEG2000 is a specific container for single JPEG2000 frames, while MXF&lt;br /&gt;
wraps its frames in a different way. Technically, JPEG2000/MXF&lt;br /&gt;
is considered a separate format.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;image_to_j2k -ImgDir directory -OutFor j2c&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will create JP2 frames using the default lossless compression frames,&lt;br /&gt;
but you may wish to customize this. Frames will be saved as .j2c files in&lt;br /&gt;
the same directory as the TIFF frames.&lt;br /&gt;
&lt;br /&gt;
Step 3: Create MXF file&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;opendcp_mxf -i image_directory -r framerate -o video.mxf&lt;br /&gt;
&lt;br /&gt;
opendcp_mxf -i audio.wav -o audio.mxf&lt;br /&gt;
&lt;br /&gt;
opendcp_xml --reel video.mxf audio.mxf&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce two MXFs, one containing video and one containing audio.&lt;br /&gt;
The third command creates an accompanying XML file, mainly targeted at&lt;br /&gt;
digital projectors.&lt;br /&gt;
&lt;br /&gt;
===Method two: GStreamer===&lt;br /&gt;
&lt;br /&gt;
GStreamer can encode to and from JPEG2000 MXF. By default it uses JasPer&lt;br /&gt;
as the JP2 encoder, which has some limitations. I believe there may be a&lt;br /&gt;
project underway to replace JasPer with OpenJPEG. GStreamer's commandline&lt;br /&gt;
utility also has very inconvenient syntax; it's designed primarily to be&lt;br /&gt;
used as a library to be called by custom programs. The gst-launch utility&lt;br /&gt;
is mainly meant for debug purposes. For production would be best to write&lt;br /&gt;
a custom archives-oriented converter based on Gstream rather than use the&lt;br /&gt;
gst-launch utility.&lt;br /&gt;
&lt;br /&gt;
The advantage to GStreamer is that it provides all of the required&lt;br /&gt;
functionality, and can go straight from a source video to producing an&lt;br /&gt;
output.&lt;br /&gt;
&lt;br /&gt;
On the other hand, GStreamer currently uses JasPer for JP2 encoding. It&lt;br /&gt;
may be better to use it in conjunction with an external JP2 encoder.&lt;br /&gt;
&lt;br /&gt;
Requirements:&lt;br /&gt;
&lt;br /&gt;
GStreamer ([http://gstreamer.freedesktop.org])&lt;br /&gt;
gst-plugins-bad&lt;br /&gt;
&lt;br /&gt;
The GStreamer debug utility is based around a chain of smaller tools&lt;br /&gt;
chained together. Here's a sample command which takes a video-only file,&lt;br /&gt;
in QuickTime format with PNG video, and turns it into a JPEG2000 MXF:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;gst-launch-0.10 filesrc location=video.mov ! qtdemux ! pngdec !&lt;br /&gt;
ffmpegcolorspace ! videoscale ! jp2kenc ! mxfmux ! filesink&lt;br /&gt;
location=video.mxf&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This goes through a series of steps to get from your source video to the&lt;br /&gt;
encoded video. This example does:&lt;br /&gt;
&lt;br /&gt;
*filesrc location=video.mov : Tells gst-launch where to find the source&lt;br /&gt;
video&lt;br /&gt;
*qtdemux : Demultiplexes the video. This means that it &amp;quot;opens&amp;quot; the&lt;br /&gt;
container and extracts the tracks inside it, making them available to the&lt;br /&gt;
other GStreamer tools&lt;br /&gt;
*pngdec : Decodes PNG compressed video into uncompressed video&lt;br /&gt;
*ffmpegcolorspace : Interprets the colour space in the video. Necessary to&lt;br /&gt;
be able to pass it to other tools&lt;br /&gt;
*videoscale : Negotiates the size of the video. When used with no&lt;br /&gt;
arguments, it just ensures the next tool knows what the size is&lt;br /&gt;
*jp2kenc : Encodes the video stream to JPEG 2000. You can specify the&lt;br /&gt;
settings here if you want&lt;br /&gt;
*mxfmux : Combines (multiplexes) the streams being processed into an MXF&lt;br /&gt;
container&lt;br /&gt;
*filesink location=video.mxf : Saves the video to a &amp;quot;filesink&amp;quot;, e.g. a&lt;br /&gt;
location on the hard drive&lt;br /&gt;
&lt;br /&gt;
The exact commands will depend on the source video. You can also do a&lt;br /&gt;
combined audio/video file, but the commands for that are kind of complex&lt;br /&gt;
and, to be honest, I haven't quite gotten my head around it yet.&lt;br /&gt;
&lt;br /&gt;
You can also decode and convert videos using GStreamer. Here's a sample&lt;br /&gt;
command to convert the previous video to MPEG-4 with default settings:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;gst-launch-0.10 filesrc location=video.mxf ! mxfdemux name=d d. !&lt;br /&gt;
jp2kdec ! ffmpegcolorspace ! videoscale ! ffenc_mpeg4 ! filesink&lt;br /&gt;
location=video.mp4&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Method three: GStreamer + OpenJPEG===&lt;br /&gt;
&lt;br /&gt;
Requirements:&lt;br /&gt;
&lt;br /&gt;
GStreamer&lt;br /&gt;
gstreamer-plugins-bad&lt;br /&gt;
FFmpeg&lt;br /&gt;
OpenJPEG&lt;br /&gt;
&lt;br /&gt;
As I mentioned before, GStreamer's JP2 encoder isn't ideal right now.&lt;br /&gt;
However, you can easily encode a video with another tool and wrap it into&lt;br /&gt;
MXF using GStreamer.&lt;br /&gt;
&lt;br /&gt;
First, extract your video to frames with ffmpeg as described above, then&lt;br /&gt;
compress them to JP2 frames using the following command:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;image_to_j2k -ImgDir directory -OutFor j2k&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You should then wrap them into an MJ2 video using the following command:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;wrap_j2k_in_mj2 directory video.mj2&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce a single MJ2 file using the frames you extracted. By&lt;br /&gt;
default the framerate will be 25fps, which is probably wrong - consult the&lt;br /&gt;
OpenJPEG documentation to find out how to set the framerate.&lt;br /&gt;
&lt;br /&gt;
You can then open this video file using GStreamer and rewrap the frames&lt;br /&gt;
into an MXF using a command like the following:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;gst-launch-0.10 filesrc location=video.mj2 ! mj2demux ! mxfmux ! filesink&lt;br /&gt;
location=video.mxf&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This will produce an MXF with the contents as your video - no&lt;br /&gt;
recompression or loss of quality. Hooray!&lt;br /&gt;
&lt;br /&gt;
===Areas for future work===&lt;br /&gt;
&lt;br /&gt;
GStreamer seems ripe for someone to develop a custom, archives-oriented video conversion program. Any volunteers?&lt;br /&gt;
&lt;br /&gt;
===More information===&lt;br /&gt;
[http://www.digitalpreservation.gov/formats/fdd/fdd000143.shtml Library of Congress Sustainability of Digital Formats: JPEG 2000 Part 1 (Core) jp2 File Format]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Talk:Virtual_appliance_instructions&amp;diff=3714</id>
		<title>Talk:Virtual appliance instructions</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Talk:Virtual_appliance_instructions&amp;diff=3714"/>
		<updated>2011-03-18T18:00:38Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am curious why 2.5.2 Using Shared Folders was removed from the virtual appliance instructions. I wanted to try Archivematica out on local copies of some of our files, and I needed these instructions. I found them in the Google cache and they worked just fine. I would suggest putting them back! Thanks. [[User:ChristieP|ChristieP]] 10:51, 18 March 2011 (PDT)&lt;br /&gt;
&lt;br /&gt;
:Many users have been having trouble using shared folders in VirtualBox with the latest version of Archivematica, so the devs have temporarily removed the instructions. I was able to get shared folders working in VMware Fusion, so I'm planning to post a set of VMware instructions. [[User:Misty De Meo|Misty De Meo]] 11:00, 18 March 2011 (PDT)&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3645</id>
		<title>Transcoder</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3645"/>
		<updated>2011-03-10T14:51:02Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: /* Known Issues */ Changed expected fix date&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Transcode: convert (language or information) from one form of coded representation to another.[ source: [http://oxforddictionaries.com/view/entry/m_en_gb0876960#m_en_gb0876960 Oxford English Dictionary] ]&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
The transcoder is developed by artefactual, for the purpose of normalization and generating access copies in the archivematica system. In earlier versions it was called normalizer. It will try to identify the file type by the file extension, or other metadata, and look for matching configured actions for those identified. It will then perform those actions, and exit with a zero status if it believes those actions have been completed successfully.&lt;br /&gt;
&lt;br /&gt;
=Development=&lt;br /&gt;
Presently to manage the complexity of automating the link between file identification and actions, a database based implementation of the transcoder is being built to replace the current xml one.&lt;br /&gt;
&lt;br /&gt;
=Configuration=&lt;br /&gt;
Configuration files are located in the /etc/transcoder/ directory.&lt;br /&gt;
&lt;br /&gt;
==transcoderConfig.conf==&lt;br /&gt;
&lt;br /&gt;
transcoderConfig.conf is the primary transcoder configuration file. It is a bash script which defines the variables used in the various file format policy XML files; it primarily contains paths to conversion tools and standard file names.&lt;br /&gt;
&lt;br /&gt;
Variables are stored as standard bash shell script variables. Variables can be added or edited using any text editor; any new variables added become available for use in format policy XML files. They use the format:&lt;br /&gt;
&lt;br /&gt;
variableName=&amp;quot;variable contents&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Default variables:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|'''Variable'''&lt;br /&gt;
|'''Description'''&lt;br /&gt;
|'''Default value'''&lt;br /&gt;
|-&lt;br /&gt;
|formatPoliciesPath&lt;br /&gt;
|Directory containing format policy XML files&lt;br /&gt;
|/etc/transcoder/archivematicaFormatPolicies/&lt;br /&gt;
|-&lt;br /&gt;
|transcoderScriptsDir&lt;br /&gt;
|Directory containing transcoder normalization scripts&lt;br /&gt;
|/usr/lib/transcoder/transcoderScripts/&lt;br /&gt;
|-&lt;br /&gt;
|convertPath&lt;br /&gt;
|Path to ImageMagick for image conversion. Requires a space at the end.&lt;br /&gt;
|/usr/bin/convert&lt;br /&gt;
|-&lt;br /&gt;
|ffmpegPath&lt;br /&gt;
|Path to ffmpeg for audio and video. Requires a space at the end.&lt;br /&gt;
|/usr/bin/ffmpeg&lt;br /&gt;
|-&lt;br /&gt;
|theoraPath&lt;br /&gt;
|Path to ffmpeg2theora script to create Ogg Theora and Vorbis files. Currently unused. Requires a space at the end.&lt;br /&gt;
|/usr/bin/ffmpeg2theora&lt;br /&gt;
|-&lt;br /&gt;
|unoconvPath&lt;br /&gt;
|Path to unoconv binary for converting document files. Currently unused. Requires a space at the end.&lt;br /&gt;
|/usr/bin/unoconv&lt;br /&gt;
|-&lt;br /&gt;
|unoconvAlternatePath&lt;br /&gt;
|Path to unoconv launcher script for converting document files. Requires a space at the end.&lt;br /&gt;
|/usr/lib/transcoder/transcoderScripts/unoconvAlternative.sh&lt;br /&gt;
|-&lt;br /&gt;
|DublinCore&lt;br /&gt;
|File name for Dublin Core metadata&lt;br /&gt;
|dublincore.xml&lt;br /&gt;
|-&lt;br /&gt;
|MD5FileName&lt;br /&gt;
|File name containing SIP MD5 checksum&lt;br /&gt;
|MD5checksum.txt&lt;br /&gt;
|-&lt;br /&gt;
|fileUUIDHumanReadable&lt;br /&gt;
|Log file containing unique IDs for items within a SIP&lt;br /&gt;
|FileUUIDs.log&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==archivematicaFormatPolicies==&lt;br /&gt;
&lt;br /&gt;
The /etc/transcoder/archivematicaFormatPolicies directory contains XML files which control how Archivematica performs normalization. Transcoder reads the file extension of a file and selects the matching XML file to determine how to perform normalization. Note that, because normalization is based on file extension, objects with an incorrect file extension or no extension will usually fail to normalize - this is [http://code.google.com/p/archivematica/issues/detail?id=156 scheduled] to change in Archivematica 0.7.2.&lt;br /&gt;
&lt;br /&gt;
The following sample configuration file illustrates the syntax:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;source lang=&amp;quot;xml&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;formatPolicy&amp;gt;&lt;br /&gt;
  &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
  &amp;lt;accessFormat&amp;gt;MP3&amp;lt;/accessFormat&amp;gt;&lt;br /&gt;
  &amp;lt;preservationFormat&amp;gt;WAV&amp;lt;/preservationFormat&amp;gt;&lt;br /&gt;
  &amp;lt;accessConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% -ab 192000 %accessFileDirectory%%fileTitle%.%accessFormat%&amp;lt;/accessConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% %preservationFileDirectory%%fileTitle%.%preservationFormat%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%xenaPath%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
&amp;lt;/formatPolicy&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Every format policy XML encloses its contents in the '''&amp;lt;formatPolicy&amp;gt;&amp;lt;/formatPolicy&amp;gt;''' tag. All content intended to be read by transcoder for normalization should be between the opening and close tags.&lt;br /&gt;
&lt;br /&gt;
* '''&amp;lt;inherit&amp;gt;''' should should be used when a format shares the same normalization processes as another format. For instance, all audio files (MP3, WMA, etc.) share a common access format and preservation format and are converted using the same command. For instance, the XML files for MP3 and WMA contain only an &amp;lt;inherit&amp;gt; tag pointing to the &amp;quot;parent&amp;quot; AUDIO document that contains the commands for all audio types. If &amp;lt;inherit&amp;gt; is used, all other tags should be empty.&lt;br /&gt;
* '''&amp;lt;accessFormat&amp;gt;''' defines the access format defined for the specified object type. It should be expressed using the file format for that type, in capital letters. For instance, '''MP3''' is the accessFormat for audio files.&lt;br /&gt;
* '''&amp;lt;preservationFormat&amp;gt;''' defines the preservation format for the specified object type. It uses the same syntax as &amp;lt;accessFormat&amp;gt;&lt;br /&gt;
* '''&amp;lt;accessConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates an access copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
* '''&amp;lt;preservationConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates a preservation copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
&lt;br /&gt;
Note that all tags should be present, regardless if they are being used. For instance, if the format in question does not inherit another type, the inherit opening and closing tags should still be present without enclosing any content, e.g.: &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Known Issues===&lt;br /&gt;
&lt;br /&gt;
*Transcode will not normalize between codecs when using the same container. Transcoder does not normalize files which are already in an access/preservation format. Because it currently uses the file extension to identify file formats, this means that transcode erroneously conflates video containers with video codecs, and won't normalize from one file extension to the same file extension. This is expected to be fixed in Archivematica 0.8. See [http://code.google.com/p/archivematica/issues/detail?id=468 issue 468]&lt;br /&gt;
&lt;br /&gt;
TODO: learn why there can be two &amp;lt;preservationConversionCommand&amp;gt;s and update documentation&lt;br /&gt;
&lt;br /&gt;
[[Category:Development documentation]]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3634</id>
		<title>Transcoder</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3634"/>
		<updated>2011-03-08T20:46:32Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: /* archivematicaFormatPolicies */ added known issues&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Transcode: convert (language or information) from one form of coded representation to another.[ source: [http://oxforddictionaries.com/view/entry/m_en_gb0876960#m_en_gb0876960 Oxford English Dictionary] ]&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
The transcoder is developed by artefactual, for the purpose of normalization and generating access copies in the archivematica system. In earlier versions it was called normalizer. It will try to identify the file type by the file extension, or other metadata, and look for matching configured actions for those identified. It will then perform those actions, and exit with a zero status if it believes those actions have been completed successfully.&lt;br /&gt;
&lt;br /&gt;
=Development=&lt;br /&gt;
Presently to manage the complexity of automating the link between file identification and actions, a database based implementation of the transcoder is being built to replace the current xml one.&lt;br /&gt;
&lt;br /&gt;
=Configuration=&lt;br /&gt;
Configuration files are located in the /etc/transcoder/ directory.&lt;br /&gt;
&lt;br /&gt;
==transcoderConfig.conf==&lt;br /&gt;
&lt;br /&gt;
transcoderConfig.conf is the primary transcoder configuration file. It is a bash script which defines the variables used in the various file format policy XML files; it primarily contains paths to conversion tools and standard file names.&lt;br /&gt;
&lt;br /&gt;
Variables are stored as standard bash shell script variables. Variables can be added or edited using any text editor; any new variables added become available for use in format policy XML files. They use the format:&lt;br /&gt;
&lt;br /&gt;
variableName=&amp;quot;variable contents&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Default variables:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|'''Variable'''&lt;br /&gt;
|'''Description'''&lt;br /&gt;
|'''Default value'''&lt;br /&gt;
|-&lt;br /&gt;
|formatPoliciesPath&lt;br /&gt;
|Directory containing format policy XML files&lt;br /&gt;
|/etc/transcoder/archivematicaFormatPolicies/&lt;br /&gt;
|-&lt;br /&gt;
|transcoderScriptsDir&lt;br /&gt;
|Directory containing transcoder normalization scripts&lt;br /&gt;
|/usr/lib/transcoder/transcoderScripts/&lt;br /&gt;
|-&lt;br /&gt;
|convertPath&lt;br /&gt;
|Path to ImageMagick for image conversion. Requires a space at the end.&lt;br /&gt;
|/usr/bin/convert&lt;br /&gt;
|-&lt;br /&gt;
|ffmpegPath&lt;br /&gt;
|Path to ffmpeg for audio and video. Requires a space at the end.&lt;br /&gt;
|/usr/bin/ffmpeg&lt;br /&gt;
|-&lt;br /&gt;
|theoraPath&lt;br /&gt;
|Path to ffmpeg2theora script to create Ogg Theora and Vorbis files. Currently unused. Requires a space at the end.&lt;br /&gt;
|/usr/bin/ffmpeg2theora&lt;br /&gt;
|-&lt;br /&gt;
|unoconvPath&lt;br /&gt;
|Path to unoconv binary for converting document files. Currently unused. Requires a space at the end.&lt;br /&gt;
|/usr/bin/unoconv&lt;br /&gt;
|-&lt;br /&gt;
|unoconvAlternatePath&lt;br /&gt;
|Path to unoconv launcher script for converting document files. Requires a space at the end.&lt;br /&gt;
|/usr/lib/transcoder/transcoderScripts/unoconvAlternative.sh&lt;br /&gt;
|-&lt;br /&gt;
|DublinCore&lt;br /&gt;
|File name for Dublin Core metadata&lt;br /&gt;
|dublincore.xml&lt;br /&gt;
|-&lt;br /&gt;
|MD5FileName&lt;br /&gt;
|File name containing SIP MD5 checksum&lt;br /&gt;
|MD5checksum.txt&lt;br /&gt;
|-&lt;br /&gt;
|fileUUIDHumanReadable&lt;br /&gt;
|Log file containing unique IDs for items within a SIP&lt;br /&gt;
|FileUUIDs.log&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==archivematicaFormatPolicies==&lt;br /&gt;
&lt;br /&gt;
The /etc/transcoder/archivematicaFormatPolicies directory contains XML files which control how Archivematica performs normalization. Transcoder reads the file extension of a file and selects the matching XML file to determine how to perform normalization. Note that, because normalization is based on file extension, objects with an incorrect file extension or no extension will usually fail to normalize - this is [http://code.google.com/p/archivematica/issues/detail?id=156 scheduled] to change in Archivematica 0.7.2.&lt;br /&gt;
&lt;br /&gt;
The following sample configuration file illustrates the syntax:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;source lang=&amp;quot;xml&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;formatPolicy&amp;gt;&lt;br /&gt;
  &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
  &amp;lt;accessFormat&amp;gt;MP3&amp;lt;/accessFormat&amp;gt;&lt;br /&gt;
  &amp;lt;preservationFormat&amp;gt;WAV&amp;lt;/preservationFormat&amp;gt;&lt;br /&gt;
  &amp;lt;accessConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% -ab 192000 %accessFileDirectory%%fileTitle%.%accessFormat%&amp;lt;/accessConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% %preservationFileDirectory%%fileTitle%.%preservationFormat%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%xenaPath%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
&amp;lt;/formatPolicy&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Every format policy XML encloses its contents in the '''&amp;lt;formatPolicy&amp;gt;&amp;lt;/formatPolicy&amp;gt;''' tag. All content intended to be read by transcoder for normalization should be between the opening and close tags.&lt;br /&gt;
&lt;br /&gt;
* '''&amp;lt;inherit&amp;gt;''' should should be used when a format shares the same normalization processes as another format. For instance, all audio files (MP3, WMA, etc.) share a common access format and preservation format and are converted using the same command. For instance, the XML files for MP3 and WMA contain only an &amp;lt;inherit&amp;gt; tag pointing to the &amp;quot;parent&amp;quot; AUDIO document that contains the commands for all audio types. If &amp;lt;inherit&amp;gt; is used, all other tags should be empty.&lt;br /&gt;
* '''&amp;lt;accessFormat&amp;gt;''' defines the access format defined for the specified object type. It should be expressed using the file format for that type, in capital letters. For instance, '''MP3''' is the accessFormat for audio files.&lt;br /&gt;
* '''&amp;lt;preservationFormat&amp;gt;''' defines the preservation format for the specified object type. It uses the same syntax as &amp;lt;accessFormat&amp;gt;&lt;br /&gt;
* '''&amp;lt;accessConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates an access copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
* '''&amp;lt;preservationConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates a preservation copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
&lt;br /&gt;
Note that all tags should be present, regardless if they are being used. For instance, if the format in question does not inherit another type, the inherit opening and closing tags should still be present without enclosing any content, e.g.: &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Known Issues===&lt;br /&gt;
&lt;br /&gt;
*Transcode will not normalize between codecs when using the same container. Transcoder does not normalize files which are already in an access/preservation format. Because it currently uses the file extension to identify file formats, this means that transcode erroneously conflates video containers with video codecs, and won't normalize from one file extension to the same file extension. This is expected to be fixed in Archivematica 0.7.1. See [http://code.google.com/p/archivematica/issues/detail?id=468 issue 468]&lt;br /&gt;
&lt;br /&gt;
TODO: learn why there can be two &amp;lt;preservationConversionCommand&amp;gt;s and update documentation&lt;br /&gt;
&lt;br /&gt;
[[Category:Development documentation]]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3632</id>
		<title>Transcoder</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3632"/>
		<updated>2011-03-08T17:53:42Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: /* archivematicaFormatPolicies */ Fix bolding&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Transcode: convert (language or information) from one form of coded representation to another.[ source: [http://oxforddictionaries.com/view/entry/m_en_gb0876960#m_en_gb0876960 Oxford English Dictionary] ]&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
The transcoder is developed by artefactual, for the purpose of normalization and generating access copies in the archivematica system. In earlier versions it was called normalizer. It will try to identify the file type by the file extension, or other metadata, and look for matching configured actions for those identified. It will then perform those actions, and exit with a zero status if it believes those actions have been completed successfully.&lt;br /&gt;
&lt;br /&gt;
=Development=&lt;br /&gt;
Presently to manage the complexity of automating the link between file identification and actions, a database based implementation of the transcoder is being built to replace the current xml one.&lt;br /&gt;
&lt;br /&gt;
=Configuration=&lt;br /&gt;
Configuration files are located in the /etc/transcoder/ directory.&lt;br /&gt;
&lt;br /&gt;
==transcoderConfig.conf==&lt;br /&gt;
&lt;br /&gt;
transcoderConfig.conf is the primary transcoder configuration file. It is a bash script which defines the variables used in the various file format policy XML files; it primarily contains paths to conversion tools and standard file names.&lt;br /&gt;
&lt;br /&gt;
Variables are stored as standard bash shell script variables. Variables can be added or edited using any text editor; any new variables added become available for use in format policy XML files. They use the format:&lt;br /&gt;
&lt;br /&gt;
variableName=&amp;quot;variable contents&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Default variables:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|'''Variable'''&lt;br /&gt;
|'''Description'''&lt;br /&gt;
|'''Default value'''&lt;br /&gt;
|-&lt;br /&gt;
|formatPoliciesPath&lt;br /&gt;
|Directory containing format policy XML files&lt;br /&gt;
|/etc/transcoder/archivematicaFormatPolicies/&lt;br /&gt;
|-&lt;br /&gt;
|transcoderScriptsDir&lt;br /&gt;
|Directory containing transcoder normalization scripts&lt;br /&gt;
|/usr/lib/transcoder/transcoderScripts/&lt;br /&gt;
|-&lt;br /&gt;
|convertPath&lt;br /&gt;
|Path to ImageMagick for image conversion. Requires a space at the end.&lt;br /&gt;
|/usr/bin/convert&lt;br /&gt;
|-&lt;br /&gt;
|ffmpegPath&lt;br /&gt;
|Path to ffmpeg for audio and video. Requires a space at the end.&lt;br /&gt;
|/usr/bin/ffmpeg&lt;br /&gt;
|-&lt;br /&gt;
|theoraPath&lt;br /&gt;
|Path to ffmpeg2theora script to create Ogg Theora and Vorbis files. Currently unused. Requires a space at the end.&lt;br /&gt;
|/usr/bin/ffmpeg2theora&lt;br /&gt;
|-&lt;br /&gt;
|unoconvPath&lt;br /&gt;
|Path to unoconv binary for converting document files. Currently unused. Requires a space at the end.&lt;br /&gt;
|/usr/bin/unoconv&lt;br /&gt;
|-&lt;br /&gt;
|unoconvAlternatePath&lt;br /&gt;
|Path to unoconv launcher script for converting document files. Requires a space at the end.&lt;br /&gt;
|/usr/lib/transcoder/transcoderScripts/unoconvAlternative.sh&lt;br /&gt;
|-&lt;br /&gt;
|DublinCore&lt;br /&gt;
|File name for Dublin Core metadata&lt;br /&gt;
|dublincore.xml&lt;br /&gt;
|-&lt;br /&gt;
|MD5FileName&lt;br /&gt;
|File name containing SIP MD5 checksum&lt;br /&gt;
|MD5checksum.txt&lt;br /&gt;
|-&lt;br /&gt;
|fileUUIDHumanReadable&lt;br /&gt;
|Log file containing unique IDs for items within a SIP&lt;br /&gt;
|FileUUIDs.log&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==archivematicaFormatPolicies==&lt;br /&gt;
&lt;br /&gt;
The /etc/transcoder/archivematicaFormatPolicies directory contains XML files which control how Archivematica performs normalization. Transcoder reads the file extension of a file and selects the matching XML file to determine how to perform normalization. Note that, because normalization is based on file extension, objects with an incorrect file extension or no extension will usually fail to normalize - this is [http://code.google.com/p/archivematica/issues/detail?id=156 scheduled] to change in Archivematica 0.7.2.&lt;br /&gt;
&lt;br /&gt;
The following sample configuration file illustrates the syntax:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;source lang=&amp;quot;xml&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;formatPolicy&amp;gt;&lt;br /&gt;
  &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
  &amp;lt;accessFormat&amp;gt;MP3&amp;lt;/accessFormat&amp;gt;&lt;br /&gt;
  &amp;lt;preservationFormat&amp;gt;WAV&amp;lt;/preservationFormat&amp;gt;&lt;br /&gt;
  &amp;lt;accessConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% -ab 192000 %accessFileDirectory%%fileTitle%.%accessFormat%&amp;lt;/accessConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% %preservationFileDirectory%%fileTitle%.%preservationFormat%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%xenaPath%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
&amp;lt;/formatPolicy&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Every format policy XML encloses its contents in the '''&amp;lt;formatPolicy&amp;gt;&amp;lt;/formatPolicy&amp;gt;''' tag. All content intended to be read by transcoder for normalization should be between the opening and close tags.&lt;br /&gt;
&lt;br /&gt;
* '''&amp;lt;inherit&amp;gt;''' should should be used when a format shares the same normalization processes as another format. For instance, all audio files (MP3, WMA, etc.) share a common access format and preservation format and are converted using the same command. For instance, the XML files for MP3 and WMA contain only an &amp;lt;inherit&amp;gt; tag pointing to the &amp;quot;parent&amp;quot; AUDIO document that contains the commands for all audio types. If &amp;lt;inherit&amp;gt; is used, all other tags should be empty.&lt;br /&gt;
* '''&amp;lt;accessFormat&amp;gt;''' defines the access format defined for the specified object type. It should be expressed using the file format for that type, in capital letters. For instance, '''MP3''' is the accessFormat for audio files.&lt;br /&gt;
* '''&amp;lt;preservationFormat&amp;gt;''' defines the preservation format for the specified object type. It uses the same syntax as &amp;lt;accessFormat&amp;gt;&lt;br /&gt;
* '''&amp;lt;accessConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates an access copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
* '''&amp;lt;preservationConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates a preservation copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
&lt;br /&gt;
Note that all tags should be present, regardless if they are being used. For instance, if the format in question does not inherit another type, the inherit opening and closing tags should still be present without enclosing any content, e.g.: &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TODO: learn why there can be two &amp;lt;preservationConversionCommand&amp;gt;s and update documentation&lt;br /&gt;
&lt;br /&gt;
[[Category:Development documentation]]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3631</id>
		<title>Transcoder</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3631"/>
		<updated>2011-03-08T17:52:50Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: /* transcoderConfig.conf */ Those italics were kind of ugly&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Transcode: convert (language or information) from one form of coded representation to another.[ source: [http://oxforddictionaries.com/view/entry/m_en_gb0876960#m_en_gb0876960 Oxford English Dictionary] ]&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
The transcoder is developed by artefactual, for the purpose of normalization and generating access copies in the archivematica system. In earlier versions it was called normalizer. It will try to identify the file type by the file extension, or other metadata, and look for matching configured actions for those identified. It will then perform those actions, and exit with a zero status if it believes those actions have been completed successfully.&lt;br /&gt;
&lt;br /&gt;
=Development=&lt;br /&gt;
Presently to manage the complexity of automating the link between file identification and actions, a database based implementation of the transcoder is being built to replace the current xml one.&lt;br /&gt;
&lt;br /&gt;
=Configuration=&lt;br /&gt;
Configuration files are located in the /etc/transcoder/ directory.&lt;br /&gt;
&lt;br /&gt;
==transcoderConfig.conf==&lt;br /&gt;
&lt;br /&gt;
transcoderConfig.conf is the primary transcoder configuration file. It is a bash script which defines the variables used in the various file format policy XML files; it primarily contains paths to conversion tools and standard file names.&lt;br /&gt;
&lt;br /&gt;
Variables are stored as standard bash shell script variables. Variables can be added or edited using any text editor; any new variables added become available for use in format policy XML files. They use the format:&lt;br /&gt;
&lt;br /&gt;
variableName=&amp;quot;variable contents&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Default variables:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|'''Variable'''&lt;br /&gt;
|'''Description'''&lt;br /&gt;
|'''Default value'''&lt;br /&gt;
|-&lt;br /&gt;
|formatPoliciesPath&lt;br /&gt;
|Directory containing format policy XML files&lt;br /&gt;
|/etc/transcoder/archivematicaFormatPolicies/&lt;br /&gt;
|-&lt;br /&gt;
|transcoderScriptsDir&lt;br /&gt;
|Directory containing transcoder normalization scripts&lt;br /&gt;
|/usr/lib/transcoder/transcoderScripts/&lt;br /&gt;
|-&lt;br /&gt;
|convertPath&lt;br /&gt;
|Path to ImageMagick for image conversion. Requires a space at the end.&lt;br /&gt;
|/usr/bin/convert&lt;br /&gt;
|-&lt;br /&gt;
|ffmpegPath&lt;br /&gt;
|Path to ffmpeg for audio and video. Requires a space at the end.&lt;br /&gt;
|/usr/bin/ffmpeg&lt;br /&gt;
|-&lt;br /&gt;
|theoraPath&lt;br /&gt;
|Path to ffmpeg2theora script to create Ogg Theora and Vorbis files. Currently unused. Requires a space at the end.&lt;br /&gt;
|/usr/bin/ffmpeg2theora&lt;br /&gt;
|-&lt;br /&gt;
|unoconvPath&lt;br /&gt;
|Path to unoconv binary for converting document files. Currently unused. Requires a space at the end.&lt;br /&gt;
|/usr/bin/unoconv&lt;br /&gt;
|-&lt;br /&gt;
|unoconvAlternatePath&lt;br /&gt;
|Path to unoconv launcher script for converting document files. Requires a space at the end.&lt;br /&gt;
|/usr/lib/transcoder/transcoderScripts/unoconvAlternative.sh&lt;br /&gt;
|-&lt;br /&gt;
|DublinCore&lt;br /&gt;
|File name for Dublin Core metadata&lt;br /&gt;
|dublincore.xml&lt;br /&gt;
|-&lt;br /&gt;
|MD5FileName&lt;br /&gt;
|File name containing SIP MD5 checksum&lt;br /&gt;
|MD5checksum.txt&lt;br /&gt;
|-&lt;br /&gt;
|fileUUIDHumanReadable&lt;br /&gt;
|Log file containing unique IDs for items within a SIP&lt;br /&gt;
|FileUUIDs.log&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==archivematicaFormatPolicies==&lt;br /&gt;
&lt;br /&gt;
The /etc/transcoder/archivematicaFormatPolicies directory contains XML files which control how Archivematica performs normalization. Transcoder reads the file extension of a file and selects the matching XML file to determine how to perform normalization. Note that, because normalization is based on file extension, objects with an incorrect file extension or no extension will usually fail to normalize - this is [http://code.google.com/p/archivematica/issues/detail?id=156 scheduled] to change in Archivematica 0.7.2.&lt;br /&gt;
&lt;br /&gt;
The following sample configuration file illustrates the syntax:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;source lang=&amp;quot;xml&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;formatPolicy&amp;gt;&lt;br /&gt;
  &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
  &amp;lt;accessFormat&amp;gt;MP3&amp;lt;/accessFormat&amp;gt;&lt;br /&gt;
  &amp;lt;preservationFormat&amp;gt;WAV&amp;lt;/preservationFormat&amp;gt;&lt;br /&gt;
  &amp;lt;accessConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% -ab 192000 %accessFileDirectory%%fileTitle%.%accessFormat%&amp;lt;/accessConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% %preservationFileDirectory%%fileTitle%.%preservationFormat%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%xenaPath%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
&amp;lt;/formatPolicy&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Every format policy XML encloses its contents in the '''&amp;lt;formatPolicy&amp;gt;&amp;lt;/formatPolicy&amp;gt;''' tag. All content intended to be read by transcoder for normalization should be between the opening and close tags.&lt;br /&gt;
&lt;br /&gt;
* '''&amp;lt;inherit&amp;gt;''' should should be used when a format shares the same normalization processes as another format. For instance, all audio files (MP3, WMA, etc.) share a common access format and preservation format and are converted using the same command. For instance, the XML files for MP3 and WMA contain only an &amp;lt;inherit&amp;gt; tag pointing to the &amp;quot;parent&amp;quot; AUDIO document that contains the commands for all audio types. If &amp;lt;inherit&amp;gt; is used, all other tags should be empty.&lt;br /&gt;
* '''&amp;lt;accessFormat&amp;gt;''' defines the access format defined for the specified object type. It should be expressed using the file format for that type, in capital letters. For instance, '''MP3''' is the accessFormat for audio files.&lt;br /&gt;
* '''&amp;lt;preservationFormat&amp;gt;''' defines the preservation format for the specified object type. It uses the same syntax as &amp;lt;accessFormat&amp;gt;&lt;br /&gt;
* '''&amp;lt;accessConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates an access copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
* '''&amp;lt;preservationConversionCommand&amp;gt;contains the commandline options for launching the tool which creates a preservation copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
&lt;br /&gt;
Note that all tags should be present, regardless if they are being used. For instance, if the format in question does not inherit another type, the inherit opening and closing tags should still be present without enclosing any content, e.g.: &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TODO: learn why there can be two &amp;lt;preservationConversionCommand&amp;gt;s and update documentation&lt;br /&gt;
&lt;br /&gt;
[[Category:Development documentation]]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3630</id>
		<title>Transcoder</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3630"/>
		<updated>2011-03-08T17:45:12Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: /* archivematicaFormatPolicies */ Folder --&amp;gt; Directory&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Transcode: convert (language or information) from one form of coded representation to another.[ source: [http://oxforddictionaries.com/view/entry/m_en_gb0876960#m_en_gb0876960 Oxford English Dictionary] ]&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
The transcoder is developed by artefactual, for the purpose of normalization and generating access copies in the archivematica system. In earlier versions it was called normalizer. It will try to identify the file type by the file extension, or other metadata, and look for matching configured actions for those identified. It will then perform those actions, and exit with a zero status if it believes those actions have been completed successfully.&lt;br /&gt;
&lt;br /&gt;
=Development=&lt;br /&gt;
Presently to manage the complexity of automating the link between file identification and actions, a database based implementation of the transcoder is being built to replace the current xml one.&lt;br /&gt;
&lt;br /&gt;
=Configuration=&lt;br /&gt;
Configuration files are located in the /etc/transcoder/ directory.&lt;br /&gt;
&lt;br /&gt;
==transcoderConfig.conf==&lt;br /&gt;
&lt;br /&gt;
transcoderConfig.conf is the primary transcoder configuration file. It is a bash script which defines the variables used in the various file format policy XML files; it primarily contains paths to conversion tools and standard file names.&lt;br /&gt;
&lt;br /&gt;
Variables are stored as standard bash shell script variables. Variables can be added or edited using any text editor; any new variables added become available for use in format policy XML files. They use the format:&lt;br /&gt;
&lt;br /&gt;
variableName=&amp;quot;variable contents&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Default variables:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|'''Variable'''&lt;br /&gt;
|'''Description'''&lt;br /&gt;
|'''Default value'''&lt;br /&gt;
|-&lt;br /&gt;
|formatPoliciesPath&lt;br /&gt;
|Directory containing format policy XML files&lt;br /&gt;
|''/etc/transcoder/archivematicaFormatPolicies/''&lt;br /&gt;
|-&lt;br /&gt;
|transcoderScriptsDir&lt;br /&gt;
|Directory containing transcoder normalization scripts&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/''&lt;br /&gt;
|-&lt;br /&gt;
|convertPath&lt;br /&gt;
|Path to ImageMagick for image conversion. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/convert''&lt;br /&gt;
|-&lt;br /&gt;
|ffmpegPath&lt;br /&gt;
|Path to ffmpeg for audio and video. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg''&lt;br /&gt;
|-&lt;br /&gt;
|theoraPath&lt;br /&gt;
|Path to ffmpeg2theora script to create Ogg Theora and Vorbis files. Currently unused. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg2theora''&lt;br /&gt;
|-&lt;br /&gt;
|unoconvPath&lt;br /&gt;
|Path to unoconv binary for converting document files. Currently unused. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/unoconv''&lt;br /&gt;
|-&lt;br /&gt;
|unoconvAlternatePath&lt;br /&gt;
|Path to unoconv launcher script for converting document files. Requires a space at the end.&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/unoconvAlternative.sh''&lt;br /&gt;
|-&lt;br /&gt;
|DublinCore&lt;br /&gt;
|File name for Dublin Core metadata&lt;br /&gt;
|''dublincore.xml''&lt;br /&gt;
|-&lt;br /&gt;
|MD5FileName&lt;br /&gt;
|File name containing SIP MD5 checksum&lt;br /&gt;
|''MD5checksum.txt''&lt;br /&gt;
|-&lt;br /&gt;
|fileUUIDHumanReadable&lt;br /&gt;
|Log file containing unique IDs for items within a SIP&lt;br /&gt;
|''FileUUIDs.log''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==archivematicaFormatPolicies==&lt;br /&gt;
&lt;br /&gt;
The /etc/transcoder/archivematicaFormatPolicies directory contains XML files which control how Archivematica performs normalization. Transcoder reads the file extension of a file and selects the matching XML file to determine how to perform normalization. Note that, because normalization is based on file extension, objects with an incorrect file extension or no extension will usually fail to normalize - this is [http://code.google.com/p/archivematica/issues/detail?id=156 scheduled] to change in Archivematica 0.7.2.&lt;br /&gt;
&lt;br /&gt;
The following sample configuration file illustrates the syntax:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;source lang=&amp;quot;xml&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;formatPolicy&amp;gt;&lt;br /&gt;
  &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
  &amp;lt;accessFormat&amp;gt;MP3&amp;lt;/accessFormat&amp;gt;&lt;br /&gt;
  &amp;lt;preservationFormat&amp;gt;WAV&amp;lt;/preservationFormat&amp;gt;&lt;br /&gt;
  &amp;lt;accessConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% -ab 192000 %accessFileDirectory%%fileTitle%.%accessFormat%&amp;lt;/accessConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% %preservationFileDirectory%%fileTitle%.%preservationFormat%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%xenaPath%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
&amp;lt;/formatPolicy&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Every format policy XML encloses its contents in the '''&amp;lt;formatPolicy&amp;gt;&amp;lt;/formatPolicy&amp;gt;''' tag. All content intended to be read by transcoder for normalization should be between the opening and close tags.&lt;br /&gt;
&lt;br /&gt;
* '''&amp;lt;inherit&amp;gt;''' should should be used when a format shares the same normalization processes as another format. For instance, all audio files (MP3, WMA, etc.) share a common access format and preservation format and are converted using the same command. For instance, the XML files for MP3 and WMA contain only an &amp;lt;inherit&amp;gt; tag pointing to the &amp;quot;parent&amp;quot; AUDIO document that contains the commands for all audio types. If &amp;lt;inherit&amp;gt; is used, all other tags should be empty.&lt;br /&gt;
* '''&amp;lt;accessFormat&amp;gt;''' defines the access format defined for the specified object type. It should be expressed using the file format for that type, in capital letters. For instance, '''MP3''' is the accessFormat for audio files.&lt;br /&gt;
* '''&amp;lt;preservationFormat&amp;gt;''' defines the preservation format for the specified object type. It uses the same syntax as &amp;lt;accessFormat&amp;gt;&lt;br /&gt;
* '''&amp;lt;accessConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates an access copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
* '''&amp;lt;preservationConversionCommand&amp;gt;contains the commandline options for launching the tool which creates a preservation copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
&lt;br /&gt;
Note that all tags should be present, regardless if they are being used. For instance, if the format in question does not inherit another type, the inherit opening and closing tags should still be present without enclosing any content, e.g.: &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TODO: learn why there can be two &amp;lt;preservationConversionCommand&amp;gt;s and update documentation&lt;br /&gt;
&lt;br /&gt;
[[Category:Development documentation]]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3629</id>
		<title>Transcoder</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3629"/>
		<updated>2011-03-08T17:43:39Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: /* archivematicaFormatPolicies */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Transcode: convert (language or information) from one form of coded representation to another.[ source: [http://oxforddictionaries.com/view/entry/m_en_gb0876960#m_en_gb0876960 Oxford English Dictionary] ]&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
The transcoder is developed by artefactual, for the purpose of normalization and generating access copies in the archivematica system. In earlier versions it was called normalizer. It will try to identify the file type by the file extension, or other metadata, and look for matching configured actions for those identified. It will then perform those actions, and exit with a zero status if it believes those actions have been completed successfully.&lt;br /&gt;
&lt;br /&gt;
=Development=&lt;br /&gt;
Presently to manage the complexity of automating the link between file identification and actions, a database based implementation of the transcoder is being built to replace the current xml one.&lt;br /&gt;
&lt;br /&gt;
=Configuration=&lt;br /&gt;
Configuration files are located in the /etc/transcoder/ directory.&lt;br /&gt;
&lt;br /&gt;
==transcoderConfig.conf==&lt;br /&gt;
&lt;br /&gt;
transcoderConfig.conf is the primary transcoder configuration file. It is a bash script which defines the variables used in the various file format policy XML files; it primarily contains paths to conversion tools and standard file names.&lt;br /&gt;
&lt;br /&gt;
Variables are stored as standard bash shell script variables. Variables can be added or edited using any text editor; any new variables added become available for use in format policy XML files. They use the format:&lt;br /&gt;
&lt;br /&gt;
variableName=&amp;quot;variable contents&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Default variables:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|'''Variable'''&lt;br /&gt;
|'''Description'''&lt;br /&gt;
|'''Default value'''&lt;br /&gt;
|-&lt;br /&gt;
|formatPoliciesPath&lt;br /&gt;
|Directory containing format policy XML files&lt;br /&gt;
|''/etc/transcoder/archivematicaFormatPolicies/''&lt;br /&gt;
|-&lt;br /&gt;
|transcoderScriptsDir&lt;br /&gt;
|Directory containing transcoder normalization scripts&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/''&lt;br /&gt;
|-&lt;br /&gt;
|convertPath&lt;br /&gt;
|Path to ImageMagick for image conversion. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/convert''&lt;br /&gt;
|-&lt;br /&gt;
|ffmpegPath&lt;br /&gt;
|Path to ffmpeg for audio and video. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg''&lt;br /&gt;
|-&lt;br /&gt;
|theoraPath&lt;br /&gt;
|Path to ffmpeg2theora script to create Ogg Theora and Vorbis files. Currently unused. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg2theora''&lt;br /&gt;
|-&lt;br /&gt;
|unoconvPath&lt;br /&gt;
|Path to unoconv binary for converting document files. Currently unused. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/unoconv''&lt;br /&gt;
|-&lt;br /&gt;
|unoconvAlternatePath&lt;br /&gt;
|Path to unoconv launcher script for converting document files. Requires a space at the end.&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/unoconvAlternative.sh''&lt;br /&gt;
|-&lt;br /&gt;
|DublinCore&lt;br /&gt;
|File name for Dublin Core metadata&lt;br /&gt;
|''dublincore.xml''&lt;br /&gt;
|-&lt;br /&gt;
|MD5FileName&lt;br /&gt;
|File name containing SIP MD5 checksum&lt;br /&gt;
|''MD5checksum.txt''&lt;br /&gt;
|-&lt;br /&gt;
|fileUUIDHumanReadable&lt;br /&gt;
|Log file containing unique IDs for items within a SIP&lt;br /&gt;
|''FileUUIDs.log''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==archivematicaFormatPolicies==&lt;br /&gt;
&lt;br /&gt;
The /etc/transcoder/archivematicaFormatPolicies folder contains XML files which control how Archivematica performs normalization. Transcoder reads the file extension of a file and selects the matching XML file to determine how to perform normalization. Note that, because normalization is based on file extension, objects with an incorrect file extension or no extension will usually fail to normalize - this is [http://code.google.com/p/archivematica/issues/detail?id=156 scheduled] to change in Archivematica 0.7.2.&lt;br /&gt;
&lt;br /&gt;
The following sample configuration file illustrates the syntax:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;source lang=&amp;quot;xml&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;formatPolicy&amp;gt;&lt;br /&gt;
  &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
  &amp;lt;accessFormat&amp;gt;MP3&amp;lt;/accessFormat&amp;gt;&lt;br /&gt;
  &amp;lt;preservationFormat&amp;gt;WAV&amp;lt;/preservationFormat&amp;gt;&lt;br /&gt;
  &amp;lt;accessConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% -ab 192000 %accessFileDirectory%%fileTitle%.%accessFormat%&amp;lt;/accessConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% %preservationFileDirectory%%fileTitle%.%preservationFormat%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%xenaPath%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
&amp;lt;/formatPolicy&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Every format policy XML encloses its contents in the '''&amp;lt;formatPolicy&amp;gt;&amp;lt;/formatPolicy&amp;gt;''' tag. All content intended to be read by transcoder for normalization should be between the opening and close tags.&lt;br /&gt;
&lt;br /&gt;
* '''&amp;lt;inherit&amp;gt;''' should should be used when a format shares the same normalization processes as another format. For instance, all audio files (MP3, WMA, etc.) share a common access format and preservation format and are converted using the same command. For instance, the XML files for MP3 and WMA contain only an &amp;lt;inherit&amp;gt; tag pointing to the &amp;quot;parent&amp;quot; AUDIO document that contains the commands for all audio types. If &amp;lt;inherit&amp;gt; is used, all other tags should be empty.&lt;br /&gt;
* '''&amp;lt;accessFormat&amp;gt;''' defines the access format defined for the specified object type. It should be expressed using the file format for that type, in capital letters. For instance, '''MP3''' is the accessFormat for audio files.&lt;br /&gt;
* '''&amp;lt;preservationFormat&amp;gt;''' defines the preservation format for the specified object type. It uses the same syntax as &amp;lt;accessFormat&amp;gt;&lt;br /&gt;
* '''&amp;lt;accessConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates an access copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
* '''&amp;lt;preservationConversionCommand&amp;gt;contains the commandline options for launching the tool which creates a preservation copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
&lt;br /&gt;
Note that all tags should be present, regardless if they are being used. For instance, if the format in question does not inherit another type, the inherit opening and closing tags should still be present without enclosing any content, e.g.: &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TODO: learn why there can be two &amp;lt;preservationConversionCommand&amp;gt;s and update documentation&lt;br /&gt;
&lt;br /&gt;
[[Category:Development documentation]]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3628</id>
		<title>Transcoder</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3628"/>
		<updated>2011-03-08T17:42:48Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: /* Configuration */ added file policy XML documentation&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Transcode: convert (language or information) from one form of coded representation to another.[ source: [http://oxforddictionaries.com/view/entry/m_en_gb0876960#m_en_gb0876960 Oxford English Dictionary] ]&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
The transcoder is developed by artefactual, for the purpose of normalization and generating access copies in the archivematica system. In earlier versions it was called normalizer. It will try to identify the file type by the file extension, or other metadata, and look for matching configured actions for those identified. It will then perform those actions, and exit with a zero status if it believes those actions have been completed successfully.&lt;br /&gt;
&lt;br /&gt;
=Development=&lt;br /&gt;
Presently to manage the complexity of automating the link between file identification and actions, a database based implementation of the transcoder is being built to replace the current xml one.&lt;br /&gt;
&lt;br /&gt;
=Configuration=&lt;br /&gt;
Configuration files are located in the /etc/transcoder/ directory.&lt;br /&gt;
&lt;br /&gt;
==transcoderConfig.conf==&lt;br /&gt;
&lt;br /&gt;
transcoderConfig.conf is the primary transcoder configuration file. It is a bash script which defines the variables used in the various file format policy XML files; it primarily contains paths to conversion tools and standard file names.&lt;br /&gt;
&lt;br /&gt;
Variables are stored as standard bash shell script variables. Variables can be added or edited using any text editor; any new variables added become available for use in format policy XML files. They use the format:&lt;br /&gt;
&lt;br /&gt;
variableName=&amp;quot;variable contents&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Default variables:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|'''Variable'''&lt;br /&gt;
|'''Description'''&lt;br /&gt;
|'''Default value'''&lt;br /&gt;
|-&lt;br /&gt;
|formatPoliciesPath&lt;br /&gt;
|Directory containing format policy XML files&lt;br /&gt;
|''/etc/transcoder/archivematicaFormatPolicies/''&lt;br /&gt;
|-&lt;br /&gt;
|transcoderScriptsDir&lt;br /&gt;
|Directory containing transcoder normalization scripts&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/''&lt;br /&gt;
|-&lt;br /&gt;
|convertPath&lt;br /&gt;
|Path to ImageMagick for image conversion. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/convert''&lt;br /&gt;
|-&lt;br /&gt;
|ffmpegPath&lt;br /&gt;
|Path to ffmpeg for audio and video. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg''&lt;br /&gt;
|-&lt;br /&gt;
|theoraPath&lt;br /&gt;
|Path to ffmpeg2theora script to create Ogg Theora and Vorbis files. Currently unused. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg2theora''&lt;br /&gt;
|-&lt;br /&gt;
|unoconvPath&lt;br /&gt;
|Path to unoconv binary for converting document files. Currently unused. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/unoconv''&lt;br /&gt;
|-&lt;br /&gt;
|unoconvAlternatePath&lt;br /&gt;
|Path to unoconv launcher script for converting document files. Requires a space at the end.&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/unoconvAlternative.sh''&lt;br /&gt;
|-&lt;br /&gt;
|DublinCore&lt;br /&gt;
|File name for Dublin Core metadata&lt;br /&gt;
|''dublincore.xml''&lt;br /&gt;
|-&lt;br /&gt;
|MD5FileName&lt;br /&gt;
|File name containing SIP MD5 checksum&lt;br /&gt;
|''MD5checksum.txt''&lt;br /&gt;
|-&lt;br /&gt;
|fileUUIDHumanReadable&lt;br /&gt;
|Log file containing unique IDs for items within a SIP&lt;br /&gt;
|''FileUUIDs.log''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==archivematicaFormatPolicies==&lt;br /&gt;
&lt;br /&gt;
The /etc/transcoder/archivematicaFormatPolicies folder contains XML files which control how Archivematica performs normalization. Transcoder reads the file extension of a file and selects the matching XML file to determine how to perform normalization. Note that, because normalization is based on file extension, objects with an incorrect file extension or no extension will usually fail to normalize - this is [http://code.google.com/p/archivematica/issues/detail?id=156 scheduled] to change in Archivematica 0.7.2.&lt;br /&gt;
&lt;br /&gt;
The following sample configuration file illustrates the syntax:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;source lang=&amp;quot;xml&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;formatPolicy&amp;gt;&lt;br /&gt;
  &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
  &amp;lt;accessFormat&amp;gt;MP3&amp;lt;/accessFormat&amp;gt;&lt;br /&gt;
  &amp;lt;preservationFormat&amp;gt;WAV&amp;lt;/preservationFormat&amp;gt;&lt;br /&gt;
  &amp;lt;accessConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% -ab 192000 %accessFileDirectory%%fileTitle%.%accessFormat%&amp;lt;/accessConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%ffmpegPath% -i %fileFullName% %preservationFileDirectory%%fileTitle%.%preservationFormat%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
  &amp;lt;preservationConversionCommand&amp;gt;%xenaPath%&amp;lt;/preservationConversionCommand&amp;gt;&lt;br /&gt;
&amp;lt;/formatPolicy&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Every format policy XML encloses its contents in the '''&amp;lt;formatPolicy&amp;gt;&amp;lt;/formatPolicy&amp;gt;''' tag. All content intended to be read by transcoder for normalization should be between the opening and close tags.&lt;br /&gt;
&lt;br /&gt;
* '''&amp;lt;inherit&amp;gt;''' should should be used when a format shares the same normalization processes as another format. For instance, all audio files (MP3, WMA, etc.) share a common access format and preservation format and are converted using the same command. For instance, the XML files for MP3 and WMA contain only an &amp;lt;inherit&amp;gt; tag pointing to the &amp;quot;parent&amp;quot; AUDIO document that contains the commands for all audio types. If &amp;lt;inherit&amp;gt; is used, all other tags should be empty.&lt;br /&gt;
* '''&amp;lt;accessFormat&amp;gt;''' defines the access format defined for the specified object type. It should be expressed using the file format for that type, in capital letters. For instance, '''MP3''' is the accessFormat for audio files.&lt;br /&gt;
* '''&amp;lt;preservationFormat&amp;gt;''' defines the preservation format for the specified object type. It uses the same syntax as &amp;lt;accessFormat&amp;gt;&lt;br /&gt;
* '''&amp;lt;accessConversionCommand&amp;gt;''' contains the commandline options for launching the tool which creates an access copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
* '''&amp;lt;preservationConversionCommand&amp;gt;contains the commandline options for launching the tool which creates a preservation copy from the specified object type. This can be customized or replaced. Any variables in use are defined by normalizationConfig.conf and should be customized there.&lt;br /&gt;
&lt;br /&gt;
Note that all tags should be present, regardless if they are being used. For instance, if the format in question does not inherit another type, the inherit opening and closing tags should still be present without enclosing any content: &amp;lt;inherit&amp;gt;&amp;lt;/inherit&amp;gt;&lt;br /&gt;
&lt;br /&gt;
TODO: learn why there can be two &amp;lt;preservationConversionCommand&amp;gt;s and update documentation&lt;br /&gt;
&lt;br /&gt;
[[Category:Development documentation]]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3627</id>
		<title>Transcoder</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3627"/>
		<updated>2011-03-08T16:38:04Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: /* transcoderConfig.conf */ Added additional editing documentation&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Transcode: convert (language or information) from one form of coded representation to another.[ source: [http://oxforddictionaries.com/view/entry/m_en_gb0876960#m_en_gb0876960 Oxford English Dictionary] ]&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
The transcoder is developed by artefactual, for the purpose of normalization and generating access copies in the archivematica system. In earlier versions it was called normalizer. It will try to identify the file type by the file extension, or other metadata, and look for matching configured actions for those identified. It will then perform those actions, and exit with a zero status if it believes those actions have been completed successfully.&lt;br /&gt;
&lt;br /&gt;
=Development=&lt;br /&gt;
Presently to manage the complexity of automating the link between file identification and actions, a database based implementation of the transcoder is being built to replace the current xml one.&lt;br /&gt;
&lt;br /&gt;
=Configuration=&lt;br /&gt;
Configuration files are located in the /etc/transcoder/ directory.&lt;br /&gt;
&lt;br /&gt;
==transcoderConfig.conf==&lt;br /&gt;
&lt;br /&gt;
transcoderConfig.conf is the primary transcoder configuration file. It is a bash script which defines the variables used in the various file format policy XML files; it primarily contains paths to conversion tools and standard file names.&lt;br /&gt;
&lt;br /&gt;
Variables are stored as standard bash shell script variables. Variables can be added or edited using any text editor; any new variables added become available for use in format policy XML files. They use the format:&lt;br /&gt;
&lt;br /&gt;
variableName=&amp;quot;variable contents&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Default variables:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|'''Variable'''&lt;br /&gt;
|'''Description'''&lt;br /&gt;
|'''Default value'''&lt;br /&gt;
|-&lt;br /&gt;
|formatPoliciesPath&lt;br /&gt;
|Directory containing format policy XML files&lt;br /&gt;
|''/etc/transcoder/archivematicaFormatPolicies/''&lt;br /&gt;
|-&lt;br /&gt;
|transcoderScriptsDir&lt;br /&gt;
|Directory containing transcoder normalization scripts&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/''&lt;br /&gt;
|-&lt;br /&gt;
|convertPath&lt;br /&gt;
|Path to ImageMagick for image conversion. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/convert''&lt;br /&gt;
|-&lt;br /&gt;
|ffmpegPath&lt;br /&gt;
|Path to ffmpeg for audio and video. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg''&lt;br /&gt;
|-&lt;br /&gt;
|theoraPath&lt;br /&gt;
|Path to ffmpeg2theora script to create Ogg Theora and Vorbis files. Currently unused. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg2theora''&lt;br /&gt;
|-&lt;br /&gt;
|unoconvPath&lt;br /&gt;
|Path to unoconv binary for converting document files. Currently unused. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/unoconv''&lt;br /&gt;
|-&lt;br /&gt;
|unoconvAlternatePath&lt;br /&gt;
|Path to unoconv launcher script for converting document files. Requires a space at the end.&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/unoconvAlternative.sh''&lt;br /&gt;
|-&lt;br /&gt;
|DublinCore&lt;br /&gt;
|File name for Dublin Core metadata&lt;br /&gt;
|''dublincore.xml''&lt;br /&gt;
|-&lt;br /&gt;
|MD5FileName&lt;br /&gt;
|File name containing SIP MD5 checksum&lt;br /&gt;
|''MD5checksum.txt''&lt;br /&gt;
|-&lt;br /&gt;
|fileUUIDHumanReadable&lt;br /&gt;
|Log file containing unique IDs for items within a SIP&lt;br /&gt;
|''FileUUIDs.log''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
[[Category:Development documentation]]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
	<entry>
		<id>https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3626</id>
		<title>Transcoder</title>
		<link rel="alternate" type="text/html" href="https://wiki.archivematica.org/index.php?title=Transcoder&amp;diff=3626"/>
		<updated>2011-03-08T16:32:51Z</updated>

		<summary type="html">&lt;p&gt;Misty De Meo: Added table documenting contents of transcoderConfig.conf file&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Transcode: convert (language or information) from one form of coded representation to another.[ source: [http://oxforddictionaries.com/view/entry/m_en_gb0876960#m_en_gb0876960 Oxford English Dictionary] ]&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
The transcoder is developed by artefactual, for the purpose of normalization and generating access copies in the archivematica system. In earlier versions it was called normalizer. It will try to identify the file type by the file extension, or other metadata, and look for matching configured actions for those identified. It will then perform those actions, and exit with a zero status if it believes those actions have been completed successfully.&lt;br /&gt;
&lt;br /&gt;
=Development=&lt;br /&gt;
Presently to manage the complexity of automating the link between file identification and actions, a database based implementation of the transcoder is being built to replace the current xml one.&lt;br /&gt;
&lt;br /&gt;
=Configuration=&lt;br /&gt;
Configuration files are located in the /etc/transcoder/ directory.&lt;br /&gt;
&lt;br /&gt;
==transcoderConfig.conf==&lt;br /&gt;
&lt;br /&gt;
transcoderConfig.conf is the primary transcoder configuration file. It is a bash script which defines the variables used in the various file format policy XML files; it primarily contains paths to conversion tools and standard file names.&lt;br /&gt;
&lt;br /&gt;
Default variables:&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|'''Variable'''&lt;br /&gt;
|'''Description'''&lt;br /&gt;
|'''Default value'''&lt;br /&gt;
|-&lt;br /&gt;
|formatPoliciesPath&lt;br /&gt;
|Directory containing format policy XML files&lt;br /&gt;
|''/etc/transcoder/archivematicaFormatPolicies/''&lt;br /&gt;
|-&lt;br /&gt;
|transcoderScriptsDir&lt;br /&gt;
|Directory containing transcoder normalization scripts&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/''&lt;br /&gt;
|-&lt;br /&gt;
|convertPath&lt;br /&gt;
|Path to ImageMagick for image conversion. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/convert''&lt;br /&gt;
|-&lt;br /&gt;
|ffmpegPath&lt;br /&gt;
|Path to ffmpeg for audio and video. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg''&lt;br /&gt;
|-&lt;br /&gt;
|theoraPath&lt;br /&gt;
|Path to ffmpeg2theora script to create Ogg Theora and Vorbis files. Currently unused. Requires a space at the end.&lt;br /&gt;
|''/usr/bin/ffmpeg2theora''&lt;br /&gt;
|-&lt;br /&gt;
|unoconvPath&lt;br /&gt;
|Path to unoconv binary for converting document files. Currently unused. Requires a space at the end.&lt;br /&gt;
|/usr/bin/unoconv&lt;br /&gt;
|-&lt;br /&gt;
|unoconvAlternatePath&lt;br /&gt;
|Path to unoconv launcher script for converting document files. Requires a space at the end.&lt;br /&gt;
|''/usr/lib/transcoder/transcoderScripts/unoconvAlternative.sh''&lt;br /&gt;
|-&lt;br /&gt;
|DublinCore&lt;br /&gt;
|File name for Dublin Core metadata&lt;br /&gt;
|''dublincore.xml''&lt;br /&gt;
|-&lt;br /&gt;
|MD5FileName&lt;br /&gt;
|File name containing SIP MD5 checksum&lt;br /&gt;
|''MD5checksum.txt''&lt;br /&gt;
|-&lt;br /&gt;
|fileUUIDHumanReadable&lt;br /&gt;
|Log file containing unique IDs for items within a SIP&lt;br /&gt;
|''FileUUIDs.log''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
[[Category:Development documentation]]&lt;/div&gt;</summary>
		<author><name>Misty De Meo</name></author>
	</entry>
</feed>