Urgent OCR
About This Gig
In this folder are three folders called Bishopsgate archives, Newham archives and special collections and RIBA collections. https://www.dropbox.com/scl/fo/ixwakdbgdhodiq9sqlbk6/AAuJLgrYLb2WB1a--E_b76A?rlkey=y0f9poa3t1rpalmemh2qljs2n&st=dfvttlf8&dl=0. Ignore the folder called riba objects called scanning. Each of those three folders has many subfolders. For each of those subfolders perform these instructions OCR INSTRUCTIONS 1. One file per folder For each archive folder (e.g. TBUK1, NEWHAM2), create ONE plain text file (.txt). Put all documents from that folder into that one file. 2. Clear document separation and stable IDs Every time a new document starts, write: ============================== ARCHIVE_FOLDER: TBUK1 DOCUMENT_ID: TBUK1_01 DATE: (write date exactly as shown, or Unknown) PLACE: (write place exactly as shown, or Not stated) ------------------------------ Then paste the full OCR text of that document. For the next document: ============================== ARCHIVE_FOLDER: TBU
Skills & Tags
About the Seller
Sasha W.
on PeoplePerHour