PDF File Handling Tutorials


Learn how to handle PDF files in Python, from extracting links, images to inserting watermarks and manipulating text.

How to Extract PDF Metadata in Python
How to Extract PDF Metadata in Python

Learn how to use pikepdf library to extract useful information from PDF files in Python.

How to Sign PDF Files in Python
How to Sign PDF Files in Python

Learn how to generate self-signed certificates and sign them into PDF files as digital signatures using PyOpenSSL and PDFNetPython3 libraries in Python.

How to Merge PDF Files in Python
How to Merge PDF Files in Python

Learn how to merge two or multiple PDF files into a single PDF file using PyPDF4 library in Python

How to Encrypt and Decrypt PDF Files in Python
How to Encrypt and Decrypt PDF Files in Python

Learn how to add and remove passwords to PDF files using PyPDF4 library, as well as using pyAesCrypt to encrypt and decrypt PDF files in Python

How to Compress PDF Files in Python
How to Compress PDF Files in Python

Learn how to compress PDF files in Python using the wrapper of PDFTron SDK.

How to Convert PDF to Images in Python
How to Convert PDF to Images in Python

Learn how to use PyMuPDF library to convert PDF files into individual images per page in Python.

How to Convert PDF to Docx in Python
How to Convert PDF to Docx in Python

Learn how you can use pdf2docx library to convert PDF files to docx word files in Python

How to Extract Text from Images in PDF Files with Python
How to Extract Text from Images in PDF Files with Python

Learn how to leverage tesseract, OpenCV, PyMuPDF and many other libraries to extract text from images in PDF files with Python

How to Highlight and Redact Text in PDF Files with Python
How to Highlight and Redact Text in PDF Files with Python

Learn how to use PyMuPDF library to highlight, frame, underline, strikeout and redact text in PDF Files with Python.

How to Watermark PDF Files in Python
How to Watermark PDF Files in Python

Learn how to add and remove watermarks to/from PDF files with PyPDF4 and reportlab libraries in Python.