Converting pdfs to epub - How do I?

  • 🐕 I am attempting to get the site runnning as fast as possible. If you are experiencing slow page load times, please report it.
You can do it online, for free.

Not sure how good each site is, but there are several.
I've never done it for entire books.
 
Calibre provides epub to pdf and pdf to epub conversion on Linux

Ubuntu / Debian:
sudo apt-get -y install calibre

Arch Linux:
sudo pacman -S calibre
or
sudo pacman -S calibre-python3

And then simply run the command:
ebook-convert <pdfName>.pdf <epubName>.epub

Or you could use the GUI application if you don't feel comfortable with the terminal
 
You can do it online, for free.

Not sure how good each site is, but there are several.
I've never done it for entire books.
Thanks, it seems to be fucking up a bit so I'll try different sites I guess.
 
  • Feels
Reactions: Recoil
Pretty fucking rare. I tried hitting up the IRC's, but nothing. Just some dostoevsky and freud analysis written by Kaufmann. First time I've hit a roadblock and I've found some pretty obscure books. You must be reading some seriously esoteric stuff.
It's a good book, well worth the read.
 
  • Informative
Reactions: Recoil
Calibre provides epub to pdf and pdf to epub conversion on Linux

Ubuntu / Debian:
sudo apt-get -y install calibre

Arch Linux:
sudo pacman -S calibre
or
sudo pacman -S calibre-python3

And then simply run the command:
ebook-convert <pdfName>.pdf <epubName>.epub

Or you could use the GUI application if you don't feel comfortable with the terminal
Calibre GUI is what I use. It's a bit clunky for my tastes, but it gets the job done. That's also what I used to combine the last two GoT books into one with the chapters all in the correct order.
 
Calibre GUI is what I use. It's a bit clunky for my tastes, but it gets the job done. That's also what I used to combine the last two GoT books into one with the chapters all in the correct order.
The issue that I'm having is that it doesn't do a very good job of formatting: the table of contents bleeds into the foreword and most sentences are missing the first few words.
 
  • Feels
Reactions: millais
The issue that I'm having is that it doesn't do a very good job of formatting: the table of contents bleeds into the foreword and most sentences are missing the first few words.
It's entirely possible that the problem is the PDF rather than the converter. There are plenty of ways to construct a PDF that looks fine to a human reader but is so profoundly fucked up under the hood that an automated parser can't handle it unless you explicitly tell it how.

A quick shit test you can do is to select all the text in the PDF, copy it, and paste it into a plain text file. If it looks normal then congrats, you now have a file you can easily convert to whatever format you need. If it looks like it has the same issues as when you converted it then you've got a sped PDF.

If you want some help un-tarding your PDF I can take a look at it.
 
Might wanna try OCR'ing the pdf, edit the text, and then converting that to epub. gImageReader is probably the best OCR.
 
A quick shit test you can do is to select all the text in the PDF, copy it, and paste it into a plain text file.
I'll give that a shot and let you know, thanks.
Might wanna try OCR'ing the pdf, edit the text, and then converting that to epub. gImageReader is probably the best OCR.
If all else fails I'll give that a try.
 
  • Like
Reactions: BlueSpark
Back