Skip to content
preserve-cat edited this page Mar 18, 2022 · 5 revisions

The latest version of FITS is configured to use these tools for identifying, validating, and extracting technical metadata:

ADL Tool

  • Maintenance organization: Harvard Library
  • Formats supported: Audio Decision List files
  • Capabilities: Identifies file formats
  • Maintenance organization: UK National Archives
  • Details: DROID is written in Java. The FITS tool wrapper uses the provided API. The output is converted into a simple XML document and then converted to FITS XML using xml/droid/droid_to_fits.xslt. The DROID configuration file and signature file are located in the tools/droid directory.
  • Formats supported: many (> 1,000) - Listed in the DROID signature file
  • Capabilities: Identify and extract complete metadata from SMPTE DPX image files.
  • Maintenance organization: Library of Congress
  • Details: embARC is written in Java. The FITS tool wrapper uses the provided API. Raw tool output is provided in XML format.
  • Formats supported: dpx
  • Notes: Although the standalone embARC application processes DPX sequences natively, this integration with FITS only supports the processing of DPX files individually and not as a sequence.
  • Capabilities: Identifies and extracts technical metadata.
  • Maintenance: Phil Harvey
  • Details: Exiftool is written in Perl. A windows executable is also provided. The Exiftool tool wrapper detects the operating system type and calls the appropriate version of the tool. The tab-delimited output is captured, converted to a simple XML structure, and then converted to FITS XML using xslt. xml/exiftool/exiftool_xslt_map.xml is used to determine which XSLT to apply for the given identified format.
  • Formats Supported: jpg,tiff,jp2,gif,bmp,png,psd,dng,wav,mp3,mp4,m4a,aiff,rm,ogg,flac,xml,html,pdf,doc

ffident

  • Capabilities: Identifies file formats.
  • Maintenance: no longer maintained - archived site
  • Details: FFIdent is written in Java. The FITS tool wrapper uses the provided API. Output is converted into a simple XML document and then converted to FITS XML using xml/ffident/ffident_to_fits.xslt.
  • Formats Supported: Listed in the configuration file tools/ffident/formats.txt
  • Capabilities: Identifies files.
  • Details: File Utility is usually bundled with Linux, UNIX and OS X. The GnuWin32? port is provided for use on Windows. Due to variations in versions this may cause different output when run on different platforms. File Utility is called in its default mode (no arguments), and also with -i to determine the MIME type. The output is converted into a simple XML document and then converted to FITS XML using xml/fileutility/fileutility_to_fits.xslt
  • Formats Supported: many (> 1,000)
  • Capabilities: Identifies, extracts technical metadata, and validates files.
  • Maintenance: Open Preservation Foundation. See: http://jhove.openpreservation.org/
  • Details: JHOVE is written in Java. The FITS tool wrapper uses the provided API. The JHOVE XML output is converted to FITS XML using XSLT. xml/jhove/jhove_xslt_map.xml is used to determine which XSLT to apply for the given identified format.
  • Formats Supported: jpg,tiff,jp2,gif,wave,aiff,xml,html,ascii,utf-8,pdf.
  • Notes:
  • For JP2 files the JHOVE output element Transformation indicates whether the compression is lossy or lossless. The transformation values are described in Table A-20 of the JPEG2000 part 1 specification. A value of 0 maps to the 9-7 irreversible (lossy) filter. A value of 1 maps to 5-3 reversible (lossless) filter. This JHOVE element is used by FITS when it outputs the compressionScheme in the image metadata, writing it as JPEG 2000 Lossy or JPEG 2000 Lossless.
  • JHOVE does not validate the codestream but it checks the file structure.
  • Capabilities: Identifies and extracts technical metadata for video files.
  • Maintenance: MediaArea.net
  • Details: The MediaInfo API is written in C++ and is called via Java by using the JNA library. For more details on JNA, see: https://github.com/java-native-access/jna
  • Formats Supported: Although MediaInfo supports many video formats, FITS will only support the following video formats and wrappers out of the box:
  • avi, mov, mpg, mpeg, mkv, mp4, mxf, ogv, mj2, divx, dv, m4v, m2v, ism.
  • Capabilities: Identifies and extracts technical metadata.
  • Maintenance organization: National Library of New Zealand
  • Details: The FITS NLNZ tool wrapper uses the provided Java API. The NLNZ native XML output is converted to FITS XML using XSLT. xml/nlnz/fits/nlnz_xslt_map.xml is used to determine which XSLT to apply to the given identified format.
  • Formats Supported: jpg,tiff,gif,bmp,wav,mp3,xml,html,pdf,doc,wordperfect,msworks,odt

OIS Audio Information

  • Maintenance organization: Harvard Library
  • Formats supported: audio

OIS File Information

  • Capabilities: Extracts technical metadata.
  • Maintenance organization:Harvard Library
  • Details: FileInfo creates FITS XML without further normalization. It determines basic file information like file name, size, file system last modified date, and md5 checksums. It uses the fast md5 jar from http://www.twmacinta.com/myjava/fast_md5.php. Written by Harvard Library.
  • Formats Supported: Any

OIS XML Information

  • Capabilities: Identifies and extracts technical metadata.
  • Maintenance organization: Harvard Library
  • Details: XmlMetadata creates FITS XML without further normalization. Its sole purpose is to identify XML and parse out the default namespace and schema location. This is used for FITS text metadata. Written by Harvard Library.
  • Formats Supported: XML

VTT Tool

  • Capabilities: Identifies and extracts technical metadata.
  • Maintenance organization: Harvard Library
  • Details: VTT Tool creates FITS XML without further normalization. This is used for FITS text metadata. Written by Harvard Library.
  • Formats Supported: video