CSH Crowdsourcing Metadata Schema

CSH Crowdsourcing Metadata Schema


Image

 

Label

Property

Range

Usage

Obligation

ID

.id

String

The ID generated by MongoDB

1

File properties

.file

File

The file properties for this image

1

Scan properties

.scan

Scan

The page from which this image was extracted and its properties

1

Can crowdsource

.canCrowdsource

String

Whether this image is approved to be crowdsourced (i.e., yes, no, maybe)

0 - 1

Transcription properties

.transcription

Transcription

The transcription status and current labels for this image

0 - 1

 


File

 

Label

Property

Range

Usage

Obligation

Height

.file.height

Number

The height of the image in pixels

1

Width

.file.width

Number

The width of the images in pixels

1

Size

.file.size

Number

The size of the image in bytes

1

Original file path

.file.origPath

String

The path with original file name

1

Anonymized file path

.file.anonPath

String

The path with anonymized file name

1

 


Scan

 

Label

Property

Range

Usage

Obligation

Line number

.scan.lineNum

Number

The line number from which this image was extracted

1

Item group number

.scan.itemGroupNum

Number

The register in which this image is a part of

1

Word number

.scan.wordNum

Number

The word number from which this image was extracted

1

Scan number

.scan.scanNum

Number

The page number from which this image was extracted

1

Pixel location

.scan.pixelLocation

PixelLocation

The pixel location of the image with respect to the page

1

 


PixelLocation

 

Label

Property

Range

Usage

Obligation

x position

.scan.pixelLocation.x

Number

The x position of the image in pixels

1

y position

.scan.pixelLocation.y

Number

The y position of the image in pixels

1

 


Transcription

 

Label

Property

Range

Usage

Obligation

Number of classifications

.transcription.numClassifications

Number

The number of classifications needed for this image

1

Subject set ID

.transcription.subjectSetId

SubjectSet_id

A pointer to the Zooniverse subject set ID to which this image belongs

1

Status

.transcription.status

String

The transcription status of the image (i.e., to send, sent, recieved, finished, gold standard)

1

Answer

.transcription.answer

Answer

The aggregated worker responses

0 - 1

 



Answer

 

Label

Property

Range

Usage

Obligation

Image type

.answer.type

String

The aggregated type of image from labeller responses

0 - 1

Image label

.answer.label

String

The aggregated label from labeller responses

0 - 1

Labeller responses

.answer.responses

List: Response

The list of labeller responses

0 - 1

 


Responses

 

Label

Property

Range

Usage

Obligation

Labeller ID

.labellerId

String

The Zooniverse ID of the labeller

0 - 1

Image type

.type

String

The type of image the labeller indicated (i.e., not a word, partial word, single word, multiple words)

1

Image label

.label

String

The word in the image indicated by the labeller

1

 

 

 

Reference Table (Not included in schema)

S. No.LabelPropertyRangeExampleUsageObligationComments
1Anonymized File nameanonymizedImageFile 

 

 0 - 1 
2Location-based filenamelocationBasedImageFile 

 

 

1

 
3Page numbernumPagexsd:string

6.0,

1.5, etc.

A data element that designates the version of the format named in "Format Name"0 or 1 
4Locatiion of imagelocationY

premis:hasMessageDigest?

nfo:hashValue?

fedora:digest?
 

xsd:string, or

xsd:anyURI
May have more than one checksum using different algorithms (differentiated with either URN syntax or separate properties for each algorithm). 
5heightheightrdfs:Literal Date of creation of the resource.0 or 1 
6Image number in the filenumWordrdfs:Literal The last modification date0 or 1 
7LabellocationXxsd:string A human-readable label or string that can be used as a simple surrogate for the resourcemin 0, max unbounded 
8segmentType----0-1 
9File Name----01 
10width of the imagewidthxsd:string A human readable string that specifies the name of the manufacturer of the scanner0 - 1Recommended
11register Numberregisterxsd:string A human readable string that specifies the name of the model of the scanner0 - 1Recommended