PDF File Handling Tutorials


Learn how to handle PDF files in Python, from extracting links, images to inserting watermarks and manipulating text.

How to Lock PDFs in Python
How to Lock PDFs in Python

Discover how to utilize the PyPDF2 library to password-protect and encrypt PDF files, ensuring top-notch security for sensitive documents. Learn to create a secure PDF locker and adopt strong password practices for enhanced cybersecurity.

How to Make a PDF Viewer in Python
How to Make a PDF Viewer in Python

Learn how you can build a PDF viewer GUI from scratch using Tkinter and PyMuPDF libraries in Python.

How to Convert HTML to PDF in Python
How to Convert HTML to PDF in Python

Learn how you can convert HTML pages to PDF files from an HTML file, URL or even HTML content string using wkhtmltopdf tool and its pdfkit wrapper in Python.

How to Extract Text from PDF in Python
How to Extract Text from PDF in Python

Learn how to extract text as paragraphs line by line from PDF documents with the help of PyMuPDF library in Python.

How to Split PDF Files in Python
How to Split PDF Files in Python

Learn how you can make a PDF splitter script with the help of pikepdf library in Python.

How to Extract PDF Metadata in Python
How to Extract PDF Metadata in Python

Learn how to use pikepdf library to extract useful information from PDF files in Python.

How to Sign PDF Files in Python
How to Sign PDF Files in Python

Learn how to generate self-signed certificates and sign them into PDF files as digital signatures using PyOpenSSL and PDFNetPython3 libraries in Python.

How to Merge PDF Files in Python
How to Merge PDF Files in Python

Learn how to merge two or multiple PDF files into a single PDF file using PyPDF4 library in Python

How to Encrypt and Decrypt PDF Files in Python
How to Encrypt and Decrypt PDF Files in Python

Learn how to add and remove passwords to PDF files using PyPDF4 library, as well as using pyAesCrypt to encrypt and decrypt PDF files in Python

How to Compress PDF Files in Python
How to Compress PDF Files in Python

Learn how to compress PDF files in Python using the wrapper of PDFTron SDK.

How to Convert PDF to Images in Python
How to Convert PDF to Images in Python

Learn how to use PyMuPDF library to convert PDF files into individual images per page in Python.

How to Convert PDF to Docx in Python
How to Convert PDF to Docx in Python

Learn how you can use pdf2docx library to convert PDF files to docx word files in Python

How to Extract Text from Images in PDF Files with Python
How to Extract Text from Images in PDF Files with Python

Learn how to leverage tesseract, OpenCV, PyMuPDF and many other libraries to extract text from images in PDF files with Python

How to Highlight and Redact Text in PDF Files with Python
How to Highlight and Redact Text in PDF Files with Python

Learn how to use PyMuPDF library to highlight, frame, underline, strikeout and redact text in PDF Files with Python.

How to Watermark PDF Files in Python
How to Watermark PDF Files in Python

Learn how to add and remove watermarks to/from PDF files with PyPDF4 and reportlab libraries in Python.

How to Extract Images from PDF in Python
How to Extract Images from PDF in Python

Learn how to extract and save images from PDF files in Python using PyMuPDF and Pillow libraries.