fix DPI (if needed) 300 DPI is minimum fix text size (e.g. 12 pt should be ok) try to fix text lines (deskew and dewarp text) try to fix illumination of image (e.g. no dark part of image binarize and de-noise image http://stackoverflow.com/questions/9480013/image-processing-to-improve-tesseract-ocr-accuracy http://en.wikipedia.org/wiki/Image_scaling User Scenario: Process digital camera pictures and OCR to extract specific […]
Also filed in
|
|
Wednesday, March 11, 2015
I have been experimenting with tasseract and occasionally get really terrible results when I know I should not. When I grab the images, change the dpi aka save them at a different resolution I get great results. Tasseract wants 300 dpi. when I google dpi of digital images or how to change dpi on android […]
Also filed in
|
|
Saturday, August 30, 2014
After getting my feet wet with the simple app in my last post I wanted to try a little bit more advanced app. Getting it running was pretty straight forward. git clone https://github.com/rmtheis/android-ocr.git android-ocr made sure it was pointing at tess two right clicked on project and ran as android app it works pretty well. […]
Also filed in
|
|
pretty boring post. basically followed this tutorial make sure you have the build toolssudo apt-get install build-essential sudo apt-get install ia32-libs sudo apt-get update sudo apt-get install ia32-libs sudo apt-get install openjdk-6-jdk sudo apt-get install icedtea-plugin install android sdk install android ndk add adt tools to your path and .bashrc build tess-two git clone git://github.com/rmtheis/tess-two […]
Also filed in
|
|
Tesseract is probably the most accurate open source OCR engine available. It differs from OpenCV because opencv is a general purpose image library. You could use it to build something like Teseeract. How well does it work? I downloaded the latest portable version to try it out. The ReadMe is very helpful. out of the […]
Also filed in
|
|