The following files belong to the same object.
a single-page document (already transcribed):
0023_000003_000012_0000.xml
0023_000003_000012_0001.tif
0023_000003_000012_0001.txt
(this last is the text file you create by typing the words of the tiff into a plain text editor)
OR:
a five-page document [contains 1 metadata (.xml) file, 5 image files (.tif), and 5 text (.txt) files]:
0023_000003_000012_0000.xml
0023_000003_000012_0001.tif
0023_000003_000012_0002.tif
0023_000003_000012_0003.tif
0023_000003_000012_0004.tif
0023_000003_000012_0005.tif
the corresponding plain text files that you will create, one per tiff image:
0023_000003_000012_0001.txt
0023_000003_000012_0002.txt
0023_000003_000012_0003.txt
0023_000003_000012_0004.txt
0023_000003_000012_0005.txt
[note the different file extensions, all 3 file types belong to the same document]
0023 means that this is institution 23
000003 means that this is their collection number 3 (other institutions may have a collection 3 also)
000012 is the item number. This is the 12th item in the series
0000 is the metadata record, with the .xml extension
0001 this is the first digital object that applies to this metadata record; here the number is the sequence to be applied in display
(This may NOT be the same as the page number for the scanned image!)
0002 then is the 2nd page in the sequence 0003 is the third page in the sequence, and so on
The scanned pages and their corresponding metadata files thus do NOT have the same filename!
However, each tiff has a corresponding text file that DOES have the same filename.
For each page there is a corresponding plain ASCII text file, containing the transcription for that page.
Except for the extension, the filename must match that of the image which was transcribed.
Transcriptions should be done with a plain text editor (such as Notepad), with no formatting, and saved as .txt files.
These files are used for searching and for display, but do NOT add in html or other formatting without permission.
Return to Filenaming Schemes
Page Information
|
Wiki Information |
Recent PBwiki Blog Posts |