Skip to content

PDF to Wordpress refactoring

  • Published: 2024-02-13 09:56
  • Updated: 2024-02-13 11:04

I tested a number of ways to transfer a series of plugin manuals for Herbert/Variety of Sound from PDF to Wordpress. First thought about Pandoc, but it only supports conversion to, not from PDF. Ultimately arrived at this. In case you’ve got an automated process: please let me know! 😙

The steps

  1. Convert PDF to Markdown > https://pdf2md.morethan.io/
  2. Switch to edit mode:
    1. Remove the title, potential meta-information and table of content
    2. Remove hyphenation, line-breaks, misinterpreted tags > codeblocks
    3. Replace '●' list markers with '-'
    4. Rebuild tables in Markdown using Obsidian
  3. When done—copy the entire Markdown from the editor
  4. In Wordpress, create a blank page:
    1. If editor = Gutenberg:
    2. Click “Type / to choose a block”
    3. Paste and watch the magic of Markdown
    4. Insert “Table of Content” block above content
    5. 🏁 Done

Well, images need manual upload and insertion. Additional styling might be nice. And yet, above process gets 90% of the job done. In about 10-15 minutes for a PDF with 10 pages.