OCR {imagerExtra} | R Documentation |
Optical Character Recognition with tesseract
Description
OCR and OCR_data are wrappers for ocr and ocr_data of tesseract package. You need to install tesseract package to use these functions.
Usage
OCR(imorpx, engine = tesseract::tesseract("eng"), HOCR = FALSE)
OCR_data(imorpx, engine = tesseract::tesseract("eng"))
Arguments
imorpx |
a grayscale image of class cimg or a pixel set |
engine |
a tesseract engine. See the reference manual of tesseract for detail. |
HOCR |
if TRUE return results as HOCR xml instead of plain text |
Author(s)
Shota Ochi
Examples
hello <- DenoiseDCT(papers, 0.01) %>% ThresholdAdaptive(., 0.1, range = c(0,1))
if (requireNamespace("tesseract", quietly = TRUE))
{
OCR(hello) %>% cat
OCR_data(hello)
}
[Package imagerExtra version 1.3.2 Index]