File Format

 Click here to show Table of Contents...

Abstract

REQUIRED

MODS Element name: <physicalDescription><internetMediaType>

Short definition: The electronic file format type/extension of the object.

Input guidelines: Select/enter the value that corresponds to file type/extension of object to be ingested. e.g. CR2, GIF, JPEG, PSD, TIFF. The file format in subelement <internetMediaType> is specified according to the template given in IANA's list of Media Types (formerly known as MIME types). Note that the expected values in the DAMS in some cases differ from the IANA specification. The expected values for JPEG and GIF images are "image/jpeg" and "image/gif" respectively, even though these values are not present in the IANA list. Example: For images in TIF (Tagged Image File) format, the expected value for internetMediaType is "image/tiff".

Definition

The electronic file format type/extension of the object (adapted from MODS standard).

DAMS input form fields

File Format

DAMS form field nameform field typerequiredMODS elementCollections Portal displaynotes
File Format

dropdown selection

Values available for selection are contingent on the broad class of asset being ingested. Note that the file type values listed in the web form for a broad class of asset do not always align with the list of file types supported for a particular Content model in the DAMS. For instance, not all image file types listed on the metadata web form are supported by the LARGE or BASIC IMAGE Content model. Instead, they would have to be ingested using the BINARY Content model.

  • Image
    • cr2
    • gif
    • jpg/jpeg
    • png
    • psd
    • tif/tiff
    • other
  • Audio
    • aif/aiff
    • au
    • m4a
    • mp3
    • wav
    • wma
    • other
  • Paged Content
    • pdf
    • tif/tiff
    • other
    • N/A
  • Text
    • csv
    • docx
    • epub
    • pdf
    • rtf
    • txt
    • xml
    • xlsx
    • xls
    • other
  • Video
    • avi
    • camrec
    • iso
    • m4v
    • mov
    • mp4
    • mts
    • swf
    • wmv
    • other
yes
N/A

MODS Element description

Element <physicalDescription>

http://www.loc.gov/standards/mods/userguide/physicaldescription.html

Guidelines for use

<physicalDescription> is a container element that contains all subelements relating to physical description information of the resource. Data is input only within each subelement.

Attributes

Currently no attributes for <physicalDescription> are implemented in the DAMS.

Subelements

The following subelements of <physicalDescription> are used in the DAMS:

  • internetMediaType
  • (extent)
  • (form)

Subelement <physicalDescription><internetMediaType>

Guidelines for use

The file format in subelement <internetMediaType> is specified according to the template given in IANA's list of Media Types (formerly known as MIME types): https://www.iana.org/assignments/media-types/media-types.xhtml. Note that the expected values in the DAMS in some cases differ from the IANA specification. The expected values for JPEG and GIF images are "image/jpeg" and "image/gif" respectively, even though these values are not present in the IANA list.

The DAMS software expects one of the following values for file format specification:

File formatinternetMediaType valueDAMS manual ingest form labelAvailable for Content Model
Canon Raw V2image/x-rawcr2Image
Graphics Interchange Formatimage/gifgifImage
JPEG (ISO/IEC 10918-1)image/jpegjpg/jpegImage
Portable Network Graphicsimage/pngpngImage
Photoshop Documentapplication/photoshoppsdImage
Tagged Image File Formatimage/tifftif/tiffImage
*otherotherImage
Audio Interchange File Formataudio/aiffaif/aiffAudio
AU Sound fileaudio/auauAudio
MPEG-4 Audio file (typically AAC, ALAC)audio/m4am4aAudio
MPEG-1/2 Audio Layer IIIaudio/mpegmp3Audio
Waveform Audio File Formataudio/x-wavwavAudio
Windows Media Audioaudio/wmawmaAudio
*otherotherAudio
Portable Document Formatapplication/pdf

pdf

Paged Content
Tagged Image File Formatimage/tifftif/tiffPaged Content
*otherotherPaged Content

not-applicable

N/A

Paged Content
Comma-separated valuestext/csvcsvText
Office Open XML (WordprocessingML)application/msworddocxText
EPUBtext/epubepubText
Portable Document Formatapplication/pdfpdfText
Rich Text Formattext/rtfrtfText
Plain texttext/txttxtText
Text file with XML contenttext/xmlxmlText
Office Open XML (SpreadsheetML)

application/vnd.openxmlformats-officedocument.spreadsheetml.sheet

xlsxText
Microsoft Excel (BIFF)

application/vnd.ms-excel

xlsText
*otherotherText
Audio Video Interleavevideo/aviaviVideo
Camtasia Studio recordingvideo/camreccamrecVideo
ISO (question)binary/isoisoVideo
M4V File (Apple)video/m4vm4vVideo
QuickTime File Formatvideo/quicktimemovVideo
MPEG-4 Part 14video/mp4mp4Video
MPEG Transport Stream (container format for different kinds of video format)video/mtsmtsVideo
Shockwave Flash

application/x-shockwave-flash

swfVideo
Windows Media Video

video/x-ms-wmv

wmvVideo
*otherotherVideo


Attributes

Element Parts

Details

XPath syntax examples

lang

values:

Enter ISO-639-2 language code (3 letters). Default value is "eng" for English.

physicalDescription/internetMediaType[@lang="eng"]
displayLabel

value:

  • File Format
physicalDescription/internetMediaType[@displayLabel="File Format"]

Subelements

No subelements for <internetMediaType>.

XML Examples

<physicalDescription>
	<internetMediaType lang="eng" displayLabel="File format">image/jpeg</internetMediaType>
</physicalDescription>

<physicalDescription>
	<internetMediaType lang="eng" displayLabel="File format">application/pdf</internetMediaType>
</physicalDescription>

Mappings

Dublin Core

Depending on the direction of mapping necessary, check

The following specific guidelines apply for the DAMS:

Dublin Core fieldMapping conditionMODS elementNotes
dc:format
internetMediaType

Approximate mapping.

Depending on the exact content, dc:format might also map to MODS elements extent or form.

MARC 21

see https://www.loc.gov/standards/mods/mods-mapping.html#physicaldescription. The following specific guidelines apply for the DAMS:

MARC 21 fieldMapping conditionMODS elementNotes
347
internetMediaType

Approximate mapping.

The information given in MARC field 347 typically does not align with the list of expected file format values. The information can be used, however, to determine a resource's file type.

352$q
internetMediaType

Approximate mapping.

The information given in MARC subfield 352$q typically does not align with the list of expected file format values. The information can be used, however, to determine a resource's file type.

856$q
internetMediaType

Approximate mapping.

Check if the value given in MARC subfield 856$q aligns with the list of expected file format values for the DAMS.

Solr

In general, all MODS metadata is imported into the DAMS Solr server upon ingest. The ingest process generates Solr fields typically named according to the following schema:

mods_value*_suffix

where

  • value* can be one or multiple element, subelement or attribute names that allow to distinguish Solr fields
  • suffix is s, t, ss, ms or mt, which refers to the type of data stored in a Solr field and how it is indexed. The Solr index usually contains multiple copies of each field with the same content, distinguished by their suffix.

The following table shows mappings between MODS elements and Solr fields for those fields that are currently used for display in the Collections portal, or where additional processing happens in Islandora or during the publishing process. Suffixes are ignored, unless relevant for the mapping.

MODS elementMapping conditionSolr DAMSSolr Collections PortalNotes
physicalDescription/ internetMediaType

mods_physicalDescription_internetMediaType

N/A