synthetics_handwritten_OCR/OCR_earsing
2024-09-16 19:11:05 +07:00
..
latent_diffusion First commit OCR_earsing and Synthetics Handwritten Recognition awesome repo 2024-09-16 19:11:05 +07:00
model_checkpoint First commit OCR_earsing and Synthetics Handwritten Recognition awesome repo 2024-09-16 19:11:05 +07:00
utils First commit OCR_earsing and Synthetics Handwritten Recognition awesome repo 2024-09-16 19:11:05 +07:00
__init__.py First commit OCR_earsing and Synthetics Handwritten Recognition awesome repo 2024-09-16 19:11:05 +07:00
ocr_eraser.py First commit OCR_earsing and Synthetics Handwritten Recognition awesome repo 2024-09-16 19:11:05 +07:00
README.md First commit OCR_earsing and Synthetics Handwritten Recognition awesome repo 2024-09-16 19:11:05 +07:00
requirements.txt First commit OCR_earsing and Synthetics Handwritten Recognition awesome repo 2024-09-16 19:11:05 +07:00

OCR Earsing

This repository contains the code for erasing the OCR text from the images. Code is written based on OCR_SAM

Have 2 method to earse the text from the image:

  • Traditional method: using the bounding box of the text to earse the text, and use function OpenCV to earse the text.
  • Deep learning method: using the bounding box of the text, segment the text based on SAM (Segment Anything) and use diffuse model to earse the text.

In my code, I use the deep learning method to earse the text from the image.

Installation

pip install -r requirements.txt

Run

python ocr_eraser.py