How to spell check PDF file?

Online: https://t.co/8STYFz9f6i

I would recommend using the command-line for this. Here are the steps courtesy of ChatGPT. These steps are for Windows:

Step 1: Install Poppler for Windows

In Powershell, run:

 winget install --id oschwartz10612.Poppler -e

Log. This will give pdftotext program. Verify the version:

PS C:\Users\siddj> pdftotext -v
pdftotext version 4.00
Copyright 1996-2017 Glyph & Cog, LLC

Step 2: Install codespell – the spell checker

Again in Powershell:

py -m pip install codespell

Log. This installs codespell spell checker but if you try to run it you will get error as its not added to the PATH by default.

 WARNING: The script codespell.exe is installed in 'C:\Users\siddj\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.12_qbz5n2kfra8p0\LocalCache\local-packages\Python312\Scripts' which is not on PATH.
  Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location.

So do that. Instructions for how to do this can be found elsewhere.

Step 3: Run it!

pdftotext -layout resume2.pdf - | codespell -
41: · Developed a content management sysem for hosting an annual Data Science Showcase at J&J.
        sysem ==> system

Well worth it!

This entry was posted in Computers, programming, Software and tagged , . Bookmark the permalink.

Leave a Reply