PREMIS Metadata profile
Preservation characteristics of digital objects in the CSH metadata model are expressed in a PREMIS v3.0-based schema. This document describes the CSH-PREMIS preservation profile. This profile is level 3 PREMIS-conformant--the semantic units described in this profile are internally implemented to be consistent with the PREMIS entities. The profile has modified the obligation of entities to be Mandatory (M) instead of Optional (O). Such changes are documented in the Obligation column of the table.
PREMIS
S. No | Label | Property | Range | Usage | Obligation | Vocab schema | Syntax schema | Example | Comments |
|---|---|---|---|---|---|---|---|---|---|
1 | Object | premis.object | This element contains information about the PREMIS object entity. | 1 |
|
|
|
| |
2 | Event | premis.eventList | This element contains information about the PREMIS event entity. Event instances are bundled in a list. | 0-n |
|
|
|
|
Object
Objects may either be individual scanned folios or groupings of scanned folios (for example, registers or microfilm rolls). For details, please see the documentation for the premis.object.objectCategory property.
S. No | Label | Property | Range | Usage | Obligation | Vocab schema | Syntax schema | Example | Comments |
|---|---|---|---|---|---|---|---|---|---|
1 | Object Identifier | premis.object .objectIdentifer | A container for the object identifier elements | 1 |
|
|
|
| |
2 | Object Category | premis.object .objectCategory |
| A value that indicates the type of this object. Scanned documents will be recorded as "file". Virtual groupings of scanned documents (for example, all files belonging to a register or all files belonging to a microfilm roll) will be recorded as "intellectual entity". | 1 | Controlled vocabulary available at: http://id.loc.gov/search/?q=&q=cs%3Ahttp%3A%2F%2Fid.loc.gov%2Fvocabulary%2Fpreservation%2FobjectCategory one of {intellectual entity, bitstream, file, representation}
|
|
|
|
3 | Object characteristics | premis.object .objectCharacteristics | A container to describe characteristics of this object | 1 |
|
|
|
| |
4 | Original name | premis.object .originalName |
| The path on the hard drive as received from the scanning company. This path will include the name of the hard drive. | 1 |
|
|
|
|
ObjectIdentifier
S. No | Label | Property | Range | Usage | Obligation | Vocab schema | Syntax schema | Example | Comments |
|---|---|---|---|---|---|---|---|---|---|
1 | Object id type | premis.object.objectIdentifer .objectIdentiferType |
| Type of the identifier code. This information should include sufficient detail about the program, module, and version that was used to create this identifier, separated using semi-colons. | 1 |
|
|
|
|
2 | Object id value | premis.object.objectIdentifer .objectIdentifierValue |
| The unique identified assigned to this object, which was generated using the mechanism described in Object id type. | 1 |
|
|
|
|
ObjectCharacteristics
S. No | Label | Property | Range | Usage | Obligation | Vocab schema | Syntax schema | Example | Comments |
|---|---|---|---|---|---|---|---|---|---|
1 | fixity | premis.object .objectCharacteristics.fixity | An element that records the message digest for conducting future fixity checks on this object. | 1 |
| Literal |
|
| |
2 | size | premis.object .objectCharacteristics.size |
| Size of the file as stored on the disk in bytes. | 1 |
| number (no commas or separators) |
|
|
3 | format | premis.object .objectCharacteristics.format | An element that describes the encoding format for this document. | 1 |
|
|
|
|
Fixity
S. No | Label | Property | Range | Usage | Obligation | Vocab schema | Syntax schema | Example | Comments |
|---|---|---|---|---|---|---|---|---|---|
1 | Message digest algorithm | premis.object.objectCharacteristics .fixity.messageDigestAlgorithm | The program, algorithm, and version that was used to create the message digest recorded in this element. |
| 1 |
|
|
|
|
2 | Message digest | premis.object.objectCharacteristics .fixity.messageDigest | The resulting message digest generated for this object using the mechanism described in Message digest algorithm. |
| 1 |
|
|
|
|
Format
S. No | Label | Property | Range | Usage | Obligation | Vocab schema | Syntax schema | Example | Comments |
|---|---|---|---|---|---|---|---|---|---|
1 | Format designation | premis.object.objectCharacteristics .format.formatDesignation | PREMIS container for the format name and version PREMIS 3.0 Data | 1 PREMIS 3.0 data dictionary lists all format properties as optional. However, this has the potential to record empty format instances. To prevent such occurrences, this document changes the obligation for formatDesignation to be mandatory. |
|
|
|
|
FormatDesignation
S. No | Label | Property | Range | Usage | Obligation | Vocab schema | Syntax schema | Example | Comments |
|---|---|---|---|---|---|---|---|---|---|
1 | Format name | premis.object.objectCharacteristics .format.formatDesignation.formatName |
| A mime type that describes the format of the file described by this object. The file should ideally be in a recognized preservation format. | 1 | List of IANA-recognized media type from the list located at: https://www.iana.org/assignments/media-types/media-types.xhtml |
| image/tiff |
|
2 | Format version | premis.object.objectCharacteristics .format.formatDesignation.formatVersion |
| A version descriptor for the file encoding used by the file in this object. | 0-1 |
|
|
|
|
Event
NOTE: All constituent event objects (the components of which are listed below) will be enumerated in a list called eventList within each metadata record. This is a deviation from full compliance with PREMIS, but is necessitated by limitations of our database software (MongoDB), which does not allow records to have multiple fields with the same field name.
Label | Property | Range | Usage | Obligation | Vocab schema | Syntax schema | Example | Comments | |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Event Identifier | premis.event .eventIdentifier | A designation used to identify the Event uniquely within the preservation repository system. | 1 |
|
|
|
| |
| 2 | Event Type | premis.event .eventType |
| A categorization of the nature of the Event | 1 |
| ingestion, migration |
| |
| 3 | Event Date Time | premis.event .eventDateTime |
| The single date and time, or date and time range, at or during which the Event occurred. | 1 | Extended Date-Time format: EDTF (LoC) |
| 20050704T071530-0500 [July 4, 2005 at 7:15:30 a.m. EST] 2006-07-16T19:20:30+01:00 20050705T0715-0500/20050705T0720-0500 [from 7:15 a.m. EST to 7:20 a.m. EST on July 4, 2005] 2004-03-17 [March 17, 2004, only the date is known] | Recommended practice is to record the most specific time possible and to designate the time zone. |
| 4 | Event Detail Information | premis.event .eventDetailInformation |
| 0-n |
|
|
| Additional information about the Event. | |
| 5 | Event Outcome Information | premis.event .eventOutcomeInformation |
| 0-1 |
|
|
| Information about the outcome of the Event | |
| 6 | Linking Agent Identifier | premis.event .linkingAgentIdentifier |
| 0-1 |
|
|
| Identification of one or more Agents associated with the Event. |
EventIdentifier
Label | Property | Range | Usage | Obligation | Vocab schema | Syntax schema | Example |
|---|