I have a PDF file that should be converted to a plain text file. I know there are programs that can do this, but I can’t find any. Can someone help me?
You didn’t look enough … it’s sitting right under your nose
Jump to a command line
:~> pdftotext
pdftotext version 24.03.0
Copyright 2005-2024 The Poppler Developers - http://poppler.freedesktop.org
Copyright 1996-2011, 2022 Glyph & Cog, LLC
Usage: pdftotext [options] <PDF-file> [<text-file>]
-f <int> : first page to convert
-l <int> [ ... ]
There’s also (as mentioned) Okular. (although, it’s not always successful).
You can also use the Acrobat online service - upload the PDF and it’ll offer a DOCX file.
You can even open it in LibreOffice and highlight the text you want, and more.
EDIT: I figured I’d do a test … I opened my recent electrical bill with Ocular and did the conversion … then ran pdftotext to do the conversion. Okular created an empty text file (failed) … pdftotext properly converted all the text. LIbreOffice was also successful.
I just compiled stirling-pdf that touts it does everything pdf. Just started experimenting with it but it literally has every function available that I could conceive of needing to do with a pdf.