Reading List: Text Recognition for Digital Collections
Optical character recognition (OCR) and handwritten text recognition (HTR) are processes most libraries are familiar with when digitising (large volumes of) text. The automated software recognises characters, which are then available for e.g. keyword search and computational analysis. The rise of machine learning applications saw a corresponding rise in HTR and improvements in OCR quality. …