Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error: simplified Chinese blank bug #140

Open
DwightNotFound opened this issue Jan 2, 2025 · 1 comment
Open

Error: simplified Chinese blank bug #140

DwightNotFound opened this issue Jan 2, 2025 · 1 comment

Comments

@DwightNotFound
Copy link

it seems that the software has a great accuracy for simplified Chinese, however it introduces unnecessary blank in between characters, making it hard to use. is there any solution?
Screenshot_20250102-224626
here is an example.

@benhaotang
Copy link
Contributor

benhaotang commented Jan 9, 2025

This seems to be a problem from tesseract tesseract-ocr/tesseract#4031 . Are you using "fast" or "best" models here? for "best" maybe the only solution is to implement a space removal function.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants