文档OCR /可读性的图像处理/增强算法?

时间:2022-02-07 09:02:31

I'm looking for algorithms, papers, or software to enhance faxes, images from cell phone cameras, and other similar source for readability and OCR.

我正在寻找算法,论文或软件来增强传真,手机摄像头的图像以及其他类似的可读性和OCR来源。

I'm mainly interested in simple enhancements (eg. things you could do using ImageMagick), but I'm also interested in more sophisticated techniques. I'm already talking to vendors, so for this question I'm mostly looking for algorithms or open source software.

我主要对简单的增强功能感兴趣(例如,你可以使用ImageMagick做的事情),但我也对更复杂的技术感兴趣。我已经和供应商谈过了,所以对于这个问题,我主要是寻找算法或开源软件。

To further clarify: I'm not looking for OCR software or algorithms; I'm looking for algorithms to clean up the image so it looks more readable to the human eye, and can possibly be used for OCR.

进一步澄清:我不是在寻找OCR软件或算法;我正在寻找算法来清理图像,使其看起来对人眼更具可读性,并且可能用于OCR。

3 个解决方案

#1


I had a similar problem when I was writing some software to do book scanning; floating around on the internet is a program called pagetools that does straightening of scanned-in pages using a fairly clever mathematical trick called the Radon transform.

当我写一些软件进行书籍扫描时,我遇到了类似的问题;漂浮在互联网上的是一个名为pagetools的程序,它使用一种称为Radon变换的相当聪明的数学技巧来拉直扫描页面。

I also wrote a small routine that would white out the blank space on the page; OCR algorithms tend to do a lot better when they don't have to contend with background noise. What I did, was look for light-colored pixels that were more than a small radius away from dark-colored ones, and then boost those up to being pure white.

我还写了一个小例程,它会使页面上的空白变白;当OCR算法不必与背景噪声竞争时,它们往往会做得更好。我所做的是寻找距离深色的像素不到一个小半径的浅色像素,然后将它们提升为纯白色。

It's been a few years, though, so I don't have the exact implementation details handy.

但是,已经有几年了,所以我没有准确的实现细节。

#2


One simple image filter to look into is the "Median Filter" which is a very straightforward, easy to implement yourself, filter to help clean up scanned/photographed text. http://en.wikipedia.org/wiki/Median_filter

一个简单的图像过滤器是“中值过滤器”,它非常简单,易于实现,过滤器有助于清理扫描/拍摄的文本。 http://en.wikipedia.org/wiki/Median_filter

#3


As requested, link to Wikipedia: Optical character recognition

根据要求,链接到*:光学字符识别

Microsoft Research: Optical character recognition papers

微软研究院:光学字符识别论文

CiteSeerX : Papers on optical character recognition

CiteSeerX:关于光学字符识别的论文

#1


I had a similar problem when I was writing some software to do book scanning; floating around on the internet is a program called pagetools that does straightening of scanned-in pages using a fairly clever mathematical trick called the Radon transform.

当我写一些软件进行书籍扫描时,我遇到了类似的问题;漂浮在互联网上的是一个名为pagetools的程序,它使用一种称为Radon变换的相当聪明的数学技巧来拉直扫描页面。

I also wrote a small routine that would white out the blank space on the page; OCR algorithms tend to do a lot better when they don't have to contend with background noise. What I did, was look for light-colored pixels that were more than a small radius away from dark-colored ones, and then boost those up to being pure white.

我还写了一个小例程,它会使页面上的空白变白;当OCR算法不必与背景噪声竞争时,它们往往会做得更好。我所做的是寻找距离深色的像素不到一个小半径的浅色像素,然后将它们提升为纯白色。

It's been a few years, though, so I don't have the exact implementation details handy.

但是,已经有几年了,所以我没有准确的实现细节。

#2


One simple image filter to look into is the "Median Filter" which is a very straightforward, easy to implement yourself, filter to help clean up scanned/photographed text. http://en.wikipedia.org/wiki/Median_filter

一个简单的图像过滤器是“中值过滤器”,它非常简单,易于实现,过滤器有助于清理扫描/拍摄的文本。 http://en.wikipedia.org/wiki/Median_filter

#3


As requested, link to Wikipedia: Optical character recognition

根据要求,链接到*:光学字符识别

Microsoft Research: Optical character recognition papers

微软研究院:光学字符识别论文

CiteSeerX : Papers on optical character recognition

CiteSeerX:关于光学字符识别的论文