Ocromore : Combining multiple OCR-engine results to improve character recognition accuracy
Full recordShow full item record
AbstractOne of the goals of the Aktienführer-Datenarchiv project is to process data from the Aktienführer and store it in a structured manner in a database. The Aktienführer is a reference work published annually between 1956-1999 as print book comprising data for companies listed at stock exchanges in Germany. A high character recognition accuracy is crucial for structure recognition and further analyses of the OCR-output. To optimize the OCR quality, "Ocromore" was developed. It is a toolset for combining multiple OCR-outputs. The best combined result is achieved with an word-wise character confidence-based multi sequence alignment (msa) approach. Our results show an character accuracy increase of 0,49% and an error reduction of 33% compared to the best single result.
TypePräsentation auf Konferenz
Kamlah, Jan ; Stegmüller, Johannes (2018) Ocromore : Combining multiple OCR-engine results to improve character recognition accuracy. <https://ub-madoc.bib.uni-mannheim.de/48756> Open Access [Präsentation auf Konferenz] [img] </48756/1/Ocromore.pdf>Vorschau