concerns:
OCR, tesseract, wrapper
depends:
tesseract, Image magick
Usage: ocube [OPTION] file1 file2 ...
converts files to tif, scans them with tesseract and outputs the text on STDOUT
OPTIONS:
-t TIF-DIR |
saves converted TIF images in target-directory TIF-DIR |
-o TESS-DIR |
saves files created by tesseract in TESS-DIR |
-i <FILE> |
define input file (otherwise STDIN) |
-f FILE |
saves all text output in file FILE (messages on STOUT) |
-s |
silent; no output on STDOUT |
-c |
convert with fill white, resize, sigmoidal-contrast, etc.. |
-l |
save error messages in ocube.error.log instead of showing them on STDERR |
-L <file> |
save error messages in <file> |
-e |
exclude files found in ocube.error.log. Avoids rescanning of files that were processed already. |
-E <file> |
same as -e but with file <file> |