Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Plenty of scanners work fine with SANE but you're not going to get OCR.


tesseract is a cli OCR program. In the Archlinux community repo there is also, ocrad, cuneiform and gocr. I have not used these.


Open source OCR programs have not yet reached the level of what is commercially available and none of this stuff is packaged together in an easy to use manner for OCRing scanned documents.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: