颜色分类leetcode-awesome-ocr:有前途的OCR资源的精选列表

时间:2024-07-26 15:11:32
【文件属性】:

文件名称:颜色分类leetcode-awesome-ocr:有前途的OCR资源的精选列表

文件大小:75.61MB

文件格式:ZIP

更新时间:2024-07-26 15:11:32

系统开源

颜色分类leetcode Call for contributor(paper summary,dataset generation,algorithm implementation and any other useful resources) awesome-ocr A curated list of promising OCR resources Librarys 有2个api 都支持图片 百度自家的 :基本可以放弃 化验单识别:也只能提取化验单上三个字段的一个 第三方和阿里自己提供的 API 集中在身份证、银行卡、驾驶证、护照、电商商品评论文本、车牌、名片、贴吧文本、视频中的文本,多输出字符及相应坐标,卡片类可输出成结构化字段,价格在0.01左右 另外有三家提供了简历的解析,输出结果多为结构化字段,支持文档和图片格式 价格在0.1-0.3次不等 目前无第三方入驻,仅有腾讯自有的api 涵盖车牌、名片、身份证、驾驶证、银行卡、营业执照、通用印刷体,价格最高可达0.2左右。 OcrKing 从哪来? OcrKing 源自2009年初 Aven 在数据挖掘中的自用项目,在对技术的执着


【文件预览】:
awesome-ocr-master
----image-acquiistion.md(235B)
----.github()
--------workflows()
----车牌识别.md(909B)
----challenge-methods.md(0B)
----trainning-data-preparing.md(1KB)
----preprocessing.md(0B)
----LICENSE(1KB)
----ocr-engines.md(0B)
----README.md(26KB)
----post-processing.md(0B)
----papers()
--------Figure 2.1- Illustration of basic template matching..png(165KB)
--------Figure 2.4- Illustration of how RNNs are used for the OCR task.png(251KB)
--------The Neural Turing Machine.pdf(2.79MB)
--------WEB SCHEMA DETECTION AND DATA EXTRACTION SYSTEM-17_chapter 7.pdf(773KB)
--------DCA-lecture06.pptx(2.27MB)
--------Statistical Language Modeling for Historical Documents using Weighted Finite-State Transducers and Long Short-Term Memory.pdf(34.26MB)
--------synthetic text-line image generation process.png(639KB)
--------UNSUPERVISED APPROACH TO DEDUCE SCHEMA AND EXTRACT DATA FROM TEMPLATE WEB PAGES.pdf(272KB)
--------Figure 2.2- Illustration of segmentation process using over-segmentation method..png(270KB)
--------jucs_20_02_0169_0192_grigalis.pdf(411KB)
--------Adaptive document image binarization.pdf(867KB)
--------Generic Text Recognition using Long Short-Term Memory Networks-PhD_Thesis_Ul-Hasan.pdf(21.76MB)
--------A Sequence Learning Approach for Multiple Script Identification.pdf(1.6MB)
--------Instructions.doc(22KB)
--------Brian Lott. Survey of Keyword Extraction Techniques.pdf(82KB)
--------Figure 3.3- Example of Fraktur script. Ersch-Gruber is an encyclopedia written in the.png(310KB)
--------Sequence prediction using recurrent neural networks(LSTM) with TensorFlow — Mourad Mourafiq.pdf(987KB)
--------1610.01178v1.pdf(480KB)
--------Towards a Robust OCR System for Indic Scripts.pdf(1.03MB)
--------HOCR specification.pdf(199KB)
--------Page - Level Web Data Extraction f rom Template Pages-Chang_FiVaTech.pdf(232KB)
--------ijsrp-p3021.pdf(395KB)
--------Document-Image-Analysis-process.png(303KB)
--------Representation Learning -A Review and New Perspectives-TPAMISI-2012-04-0260-1.pdf(919KB)
--------全家桶.jpg(330KB)
--------y-derivative of a Gaussian kernel (p. 42).pdf(2.93MB)
--------深度神经网络结构以及Pre-Training的理解 - cyq0122的专栏 - 博客频道 - CSDN.NET.pdf(4.06MB)
-------- Relation Schema Induction using Tensor Factorization with Side Information.pdf(465KB)
--------Long-Short Memory Network(LSTM长短期记忆网络) - Physcal - 博客园.pdf(308KB)
--------语义分析的一些方法(一) | 火光摇曳.pdf(764KB)
--------Figure 2.3- Figure showing the basic unit in HMM-based OCR. .png(150KB)
--------pdf_24.pdf(867KB)
--------InfoQ-JHipster-mini-book.pdf(0B)
--------V3I3-0224.pdf(432KB)
--------Review paper on “Optimized approaches for web data harvesting.pdf(268KB)
--------自然语言处理的神经网络入门学习笔记.pdf(13.52MB)
--------Figure 3.5- Document quality degradation caused during preprocessing..png(113KB)
--------Figure 3.4- Shape confusion in Fraktur script. Many characters in Fraktur resemble.png(164KB)
--------Figure 3.6- Word formation in Devanagari script.png(239KB)
--------Universum Prescription- Regularization Using Unlabeled Data1511.03719v7.pdf(400KB)
--------刘知远. 基于文档主题结构的关键词抽取方法研究.pdf(3.11MB)
--------Schema Extraction for Tabular Data on the Web-p421-adelfio.pdf(378KB)
--------文章结构.png(269KB)
--------Figure 3.7- Reading direction in Nastaleeq script.Nastaleeq script is read from right-.png(169KB)
--------1610.05567v1.pdf(5.99MB)
----resources()
--------汉语拼音码表-一级汉字拼音对照对照表.txt(31KB)
--------汉语拼音码表-二级汉字拼音对照对照表.txt(25KB)
--------special-character.txt(28KB)
--------3754个常用汉字列表.txt(11KB)
--------6039.txt(15KB)
--------chinese_5039.txt(15KB)
--------cid2code.txt(1.95MB)
----.gitignore(10B)

网友评论