04: TEXT images that NEED transcription, one metadata (.xml) record

 

ONLY USE THIS BUCKET if the text requires future transcription. [currently being saved as image files (.tif) to enable display]

Not sure whether to transcribe? README

 

The following files belong to the same object.

 

a single-page document

0023_000003_000012_0000.xml

0023_000003_000012_0001.tif

 

OR:

 

a five-page document [contains 1 metadata (.xml) file, 5 image files (.tif)]:

0023_000003_000012_0000.xml

 

0023_000003_000012_0001.tif

0023_000003_000012_0002.tif

0023_000003_000012_0003.tif

0023_000003_000012_0004.tif

0023_000003_000012_0005.tif

 

0023 means that this is institution 23

000003 means that this is their collection number 3 (other institutions may have a collection 3 also)

000012 is the item number. This is the 12th item in the series

0000 is the metadata record, with the .xml extension

0001 this is the first digital object that applies to this metadata record; here the number is the sequence to be applied in display

(This may NOT be the same as the page number for the scanned image!)

0002 then is the 2nd page in the sequence 0003 is the third page in the sequence, and so on

 

The scanned pages and their corresponding metadata files thus do NOT have the same filename!

However, each tiff has a corresponding text file that DOES have the same filename.

 

These files have NO search capabilities beyond the metadata record. .

Hence, this “bucket” is to be avoided if at all possible

The likelihood of raising funds for transcription at a later date is extremely small.

 

Return to Filenaming Schemes


Page Information

  • 9 months ago [history]
  • View page source
  • You're not logged in
  • No tags yet learn more

Wiki Information

Recent PBwiki Blog Posts