PDF file to text file?

I have a PDF file that should be converted to a plain text file. I know there are programs that can do this, but I can’t find any. Can someone help me?

Operating System: openSUSE Tumbleweed 20240918
KDE Plasma Version: 6.1.5
KDE Frameworks Version: 6.6.0
Qt Version: 6.7.2
Kernel Version: 6.10.9-1-default (64-bit)
Graphics Platform: X11

Okular-> Export as-> Plain text

1 Like

You didn’t look enough … it’s sitting right under your nose :slight_smile:

Jump to a command line

:~>   pdftotext
pdftotext version 24.03.0
Copyright 2005-2024 The Poppler Developers - http://poppler.freedesktop.org
Copyright 1996-2011, 2022 Glyph & Cog, LLC
Usage: pdftotext [options] <PDF-file> [<text-file>]
  -f <int>             : first page to convert
  -l <int> [ ... ]

There’s also (as mentioned) Okular. (although, it’s not always successful).

You can also use the Acrobat online service - upload the PDF and it’ll offer a DOCX file.

You can even open it in LibreOffice and highlight the text you want, and more.

EDIT: I figured I’d do a test … I opened my recent electrical bill with Ocular and did the conversion … then ran pdftotext to do the conversion. Okular created an empty text file (failed) … pdftotext properly converted all the text. LIbreOffice was also successful.

Thanx :slightly_smiling_face:

I just compiled stirling-pdf that touts it does everything pdf. Just started experimenting with it but it literally has every function available that I could conceive of needing to do with a pdf.

tom kosvic