ocr {magick} | R Documentation |
Image Text OCR
Description
Extract text from an image using the tesseract package.
Usage
image_ocr(image, language = "eng", HOCR = FALSE, ...)
image_ocr_data(image, language = "eng", ...)
Arguments
image |
magick image object returned by |
language |
passed to tesseract. To install additional languages see instructions in tesseract_download(). |
HOCR |
if |
... |
additional parameters passed to tesseract |
Details
To use this function you need to tesseract first:
install.packages("tesseract")
Best results are obtained if you set the correct language in tesseract. To install additional languages see instructions in tesseract_download().
See Also
Other image:
_index_
,
analysis
,
animation
,
attributes()
,
color
,
composite
,
defines
,
device
,
edges
,
editing
,
effects()
,
fx
,
geometry
,
morphology
,
options()
,
painting
,
segmentation
,
transform()
,
video
Examples
if(require("tesseract")){
img <- image_read("http://jeroen.github.io/images/testocr.png")
image_ocr(img)
image_ocr_data(img)
}
[Package magick version 2.8.4 Index]