03: DIGITALLY TRANSCRIBED TEXT, one metadata (.xml) record

 

The following files belong to the same object.

 

a single-page document (already transcribed):

0023_000003_000012_0000.xml

0023_000003_000012_0001.tif

0023_000003_000012_0001.txt

(this last is the text file you create by typing the words of the tiff into a plain text editor)

 

OR:

 

a five-page document [contains 1 metadata (.xml) file, 5 image files (.tif), and 5 text (.txt) files]:

0023_000003_000012_0000.xml

 

0023_000003_000012_0001.tif

0023_000003_000012_0002.tif

0023_000003_000012_0003.tif

0023_000003_000012_0004.tif

0023_000003_000012_0005.tif

 

the corresponding plain text files that you will create, one per tiff image:

0023_000003_000012_0001.txt

0023_000003_000012_0002.txt

0023_000003_000012_0003.txt

0023_000003_000012_0004.txt

0023_000003_000012_0005.txt

 

[note the different file extensions, all 3 file types belong to the same document]

 

0023 means that this is institution 23

000003 means that this is their collection number 3 (other institutions may have a collection 3 also)

000012 is the item number. This is the 12th item in the series

0000 is the metadata record, with the .xml extension

0001 this is the first digital object that applies to this metadata record; here the number is the sequence to be applied in display

(This may NOT be the same as the page number for the scanned image!)

0002 then is the 2nd page in the sequence 0003 is the third page in the sequence, and so on

 

The scanned pages and their corresponding metadata files thus do NOT have the same filename!

However, each tiff has a corresponding text file that DOES have the same filename.

 

For each page there is a corresponding plain ASCII text file, containing the transcription for that page.

Except for the extension, the filename must match that of the image which was transcribed.

Transcriptions should be done with a plain text editor (such as Notepad), with no formatting, and saved as .txt files.

These files are used for searching and for display, but do NOT add in html or other formatting without permission.

 

Return to Filenaming Schemes


Page Information

  • 9 months ago [history]
  • View page source
  • You're not logged in
  • No tags yet learn more

Wiki Information

Recent PBwiki Blog Posts