Source Image: sourceImage.jpg
This report was generated by scanning the image file using Optical Character Recognition (OCR) via Tesseract. The extracted text was then compared against a predefined list of keywords. Any matched keywords were linked to their corresponding Wikipedia pages for further reference.
File Paths Used:
image_path = c:\Temp\Projects\projectOCR\JPG\sourceImage.jpgkeywords_path = c:\Temp\Projects\projectOCR\TXT\keywords.txtoutput_html_path = c:\Temp\Projects\projectOCR\HTML\OCR-Report.html