I’m looking for something that I can scan hand-written notes into and have OCR’d. Maybe one that I can even train on my handwriting. Ideally I end up with a searchable PDF of my notes.
People use one-note for this, but I’m not really comfortable with letting microsoft see my handwriting.
Can you fine tune tesseract on a local hand writing dataset ? Or insert it in context like a pre-prompt ?
It wasn’t possible a year ago when pos6ted around with tesseract. Things might have changed during the last couple of months though.
I found the following It migth be possible and affordable
https://konfuzio.com/en/tesseract/
https://github.com/Matleo/Tesseract_fine_tuning_training
https://groups.google.com/g/tesseract-ocr/c/ZLOZpW1fD6I/m/B1Ponc0VBAAJ
https://arcruz0.github.io/posts/finetuning-tess/