英语原文共 6 页,剩余内容已隐藏,支付完成后下载完整资料
A Complete Workflow for Development of Bangla OCR
ABSTRACT
Developing a Bangla OCR requires bunch of algorithm and methods. There were many effort went on for developing a Bangla OCR. But all of them failed to provide an error free Bangla OCR. Each of them has some lacking. We discussed about the problem scope of currently existing Bangla OCRrsquo;s. In this paper, we present the basic steps required for developing a Bangla OCR and a complete workflow for development of a Bangla OCR with mentioning all the possible algorithms required.
Keywords
OCR, Bangla OCR, Bangla Font, Matra, Preprocessing, Binarization, Classification, Segmentation, Page Layout analysis, Tesseract.
INTRODUCTION
OCR means Optical Character Recognition, which is the mechanical or electronic transformation of scanned images of handwritten, typewritten or printed text into machine-encoded text.
OCR has emerged a major research field for its usefulness since 1950. Bangla is ranked 5th as speaking language in the world. With the digitization of every field, it is now necessary to digitized huge volume of old Bangla book by using an efficient Bangla OCR. However, till today there is no such OCR for Bangla is developed. The existing Bangla OCRs could not fulfill the desired result. From 80‟s Bangla OCR development has started and now for its necessity this field becomes a major research area today. For further progression, synchronization of the total system is required. Here we present a total overview of Bangla OCR and its existing challenge is to know the development procedure and to estimate what more requires to do to create a complete Bangla OCR.
RELATED WORKS
Though Bangla OCR is not a recent work, but there are very few mentionable works in this field. BOCRA and Apona-Pathak‟ was made publicly in 2006 1. But they are not open source. The Center for Research on Bangla Language Processing (CRBLP) released BanglaOCR – the first open source OCR software for Bangla – in 2007 2. BanglaOCR is a complete OCR framework, and has a recognition rate of up to 98% (in limited domains) but it also have many limitations.
STEPS OF BANGLA OCR
Although OCR system can be develop for different purposes, for different languages, an OCR system contains some basic steps. Figure-2 describes the basic steps of an OCR. A basic OCR system has the following particular processing steps:
1. Scanning.
2. Preprocessing.
3. Feature extraction or pattern recognition.
4. Recognition using one or more classifier.
5. Contextual verification or post processing
Scanning
To extract characters from scanned images it is necessary to convert the image into proper digital image. This process is called text digitization. The process of text digitization can be performed either by a Flat-bed scanner or a hand-held scanner. Hand held scanner typically has a low resolution range. Appropriate resolution level typically 300-1000 dots per inch for better accuracy of text extraction [1].
Preprocessing
Preprocessing consists of number of preliminary processing steps to make the raw data usable for the recognizer. The typical preprocessing steps included the following process:
1. Binarization
2. Noise Detection amp; Reduction
3. Skew detection amp; correction
4. Page layout analysis
5. Segmentation
Binarization methods
Binarization is a technique by which the gray scale images are converted to binary images. Some binarization methods are given below:
Global Fixed Threshold: The algorithm chooses a fixed intensity threshold value I. If the intensity value of any pixel of an input is more than I, the pixel is set to white otherwise it is black. If the source is a color image, it first has to be converted to grey level using the standard conversion [2].
Otsu Global Algorithm: This method is both simple and effective. The algorithm assumes that the image to be threshold contains two classes of pixels and calculates the optimum threshold separating those two classes so that their combined spread (intra-class variation) is minimal .
Niblackrsquo;s Algorithm: Niblacrsquo;s algorithm calculates a pixel wise threshold by sliding a rectangular window over the grey level image. The threshold is computed by using the mean and standard deviation, of all the pixels in the window[2].
Adaptive Niblackrsquo;s Algorithm: In archive document processing, it is difficult to identify suitable sliding window size SW and constant k values for all images, as the character size of both frame and stroke may vary image by image. Improper choice of SW and k values results in poor binarization. Modified Niblacrsquo;s algorithm allows automatically chosen values for k and SW, which is called adaptive Niblacrsquo;s algorithm[2].
Sauvolarsquo;s Algorithm: Sauvolarsquo;s algorithm is a modification of Niblacrsquo;s which is claimed to give improved performance on documents in which the background contains light texture, big variations and uneven illumination. In this algorithm, a threshold is computed with the dynamic range of the standard deviation[2].
Noise detectionamp;correction methods
Noise can be produced during the scanning section automatically. Two types of noises are common. They are background noise and salt amp; paper noise. Complex script like Bangla, we cannot eliminate wide pixels from upper or lower portion of a character because it may not only eliminate noise but also the difference between two characters like e and i and for some other characters[3]. Some noise detection methods are given below:
T
剩余内容已隐藏,支付完成后下载完整资料
资料编号:[148415],资料为PDF文档或Word文档,PDF文档可免费转换为Word
课题毕业论文、开题报告、任务书、外文翻译、程序设计、图纸设计等资料可联系客服协助查找。
您可能感兴趣的文章
- 饮用水微生物群:一个全面的时空研究,以监测巴黎供水系统的水质外文翻译资料
- 步进电机控制和摩擦模型对复杂机械系统精确定位的影响外文翻译资料
- 具有温湿度控制的开式阴极PEM燃料电池性能的提升外文翻译资料
- 警报定时系统对驾驶员行为的影响:调查驾驶员信任的差异以及根据警报定时对警报的响应外文翻译资料
- 门禁系统的零知识认证解决方案外文翻译资料
- 车辆废气及室外环境中悬浮微粒中有机磷的含量—-个案研究外文翻译资料
- ZigBee协议对城市风力涡轮机的无线监控: 支持应用软件和传感器模块外文翻译资料
- ZigBee系统在医疗保健中提供位置信息和传感器数据传输的方案外文翻译资料
- 基于PLC的模糊控制器在污水处理系统中的应用外文翻译资料
- 光伏并联最大功率点跟踪系统独立应用程序外文翻译资料