PDF File Handling Tutorials


Learn how to handle PDF files in Python, from extracting links, images to inserting watermarks and manipulating text.

How to Convert HTML to PDF in Python
How to Convert HTML to PDF in Python

Learn how you can convert HTML pages to PDF files from an HTML file, URL or even HTML content string using wkhtmltopdf tool and its pdfkit wrapper in Python.

How to Extract Text from PDF in Python
How to Extract Text from PDF in Python

Learn how to extract text as paragraphs line by line from PDF documents with the help of PyMuPDF library in Python.

How to Split PDF Files in Python
How to Split PDF Files in Python

Learn how you can make a PDF splitter script with the help of pikepdf library in Python.

How to Extract PDF Metadata in Python
How to Extract PDF Metadata in Python

Learn how to use pikepdf library to extract useful information from PDF files in Python.

How to Sign PDF Files in Python
How to Sign PDF Files in Python

Learn how to generate self-signed certificates and sign them into PDF files as digital signatures using PyOpenSSL and PDFNetPython3 libraries in Python.

How to Merge PDF Files in Python
How to Merge PDF Files in Python

Learn how to merge two or multiple PDF files into a single PDF file using PyPDF4 library in Python

How to Encrypt and Decrypt PDF Files in Python
How to Encrypt and Decrypt PDF Files in Python

Learn how to add and remove passwords to PDF files using PyPDF4 library, as well as using pyAesCrypt to encrypt and decrypt PDF files in Python

How to Compress PDF Files in Python
How to Compress PDF Files in Python

Learn how to compress PDF files in Python using the wrapper of PDFTron SDK.

How to Convert PDF to Images in Python
How to Convert PDF to Images in Python

Learn how to use PyMuPDF library to convert PDF files into individual images per page in Python.

How to Convert PDF to Docx in Python
How to Convert PDF to Docx in Python

Learn how you can use pdf2docx library to convert PDF files to docx word files in Python