文件名称:2007 - Smith - An Overview of the Tesseract OCR Engine.pdf
文件大小:207KB
文件格式:PDF
更新时间:2023-07-17 10:48:24
机器学习
2007年OCR经典文献。Tesseract is an open-source OCR engine that was developed at HP between 1984 and 1994. Like a super-nova, it appeared from nowhere for the 1995 UNLV Annual Test of OCR Accuracy [1], shone brightly with its results, and then vanished back under the same cloak of secrecy under which it had been developed. Now for the first time, details of the architecture and algorithms can be revealed.