uiucprescon.ocr Package¶
Public API¶
-
class
uiucprescon.ocr.
Reader
(language_code, tesseract_data_path)¶ Reading the text from an image file
Note
A Reader object should not be generated directly. Instead, it should be constructed using the Engine class’s
Engine.get_reader()
method.-
read
(file: str)¶ Generate text from an image
Parameters: file – File path to an image Returns: Text extracted from an image
-
-
class
uiucprescon.ocr.
Engine
(data_set_path)¶ The engine for driving the ocr processing
-
get_reader
(lang: str) → uiucprescon.ocr.reader.AbsReader¶ Builder method for creating reader objects for a specific language
Parameters: lang – letter code that represents the language for a tesseract data set. Returns: Constructs a Reader object which can be used for extracting text from and image.
-
get_version
() → str¶ Check the version of Tesseract that this python package is linked to. An example value might be the string “3.05.02”.
-