How to Use Tesseract OCR to Convert PDFs to Text

I have some PDFs which I need to get typed up into text to edit. I decided to go with Tesseract OCR as it seems to be the best tool for the job. Here are the steps for how to use Tesseract OCR to convert PDFs to text. Installation First things first, get Tesseract CLI installed. Follow the instructions here, these are linked to from the official Tesseract docs. sudo add-apt-repository ppa:alex-p/tesseract-ocr-devel sudo apt-get update sudo apt install tesseract-ocr tesseract-ocr-eng Note: the package didn’t properly place the eng....

February 20, 2022 · 3 min · chart