How do you encode your paper scans?

Atemu@lemmy.ml · 1 year ago

How do you encode your paper scans?

lemming007@lemm.ee · 1 year ago

PDF/A

kyle@infosec.pub · edit-2 1 year ago

I’ve never used paperless but just checked it out and it looks pretty neat. My first thought would be to scan documents in a higher resolution, let the OCR happen, then convert the file to a JPEG or something smaller after you’ve extracted the text.

I spent a few minutes looking at their wiki and it looks like it might be possible.

Like I said though, no experience with this software so I’m not sure that’d actually work.

Atemu@lemmy.ml · 1 year ago

Interesting idea but I think I’d like to retain similar to original quality in case I wanted to redo OCR if/when Paperless’ OCR improves in the future.

surewhynotlem@lemmy.world · 9 months ago

By ‘paperless’, y’all mean this one? https://docs.paperless-ngx.com/

Atemu@lemmy.ml · 9 months ago

Correct. That’s the currently maintained paperless project.

surewhynotlem@lemmy.world · 9 months ago

Thanks! There’s a very interesting trail of dead projects to follow. But I got ngx working and it’s great so far.