Detect Text (OCR) (Image Processing)
Synopsis
This operator detects text on images.Description
This operator uses the tesseract library to detect text in images. It returns the result as a table.
Tesseract provides a large range of language specific models, which you can choose with the languages or all_languages parameter. The chosen model is downloaded and then used to detect text on the image which is given on the img port.
The downloaded model is cached on your computer within the .RapidMiner folder. If the model was already downloaded in a prior run of the operator the cached version is used.
You can define the granularity of the detection using the split_into setting. it allows the user to specify if the results are delivered on block, paragraph, line, sentence or individual character level.
Tesseract allows the user to specify more than 300 parameters. You can specify them using the 'additional parameters' setting list.
On Mac OS and Linux users need to install the tesseract library manually for example using brew or apt-get. The library is already bundled for Windows Users