Learn how to handle PDF files in Python, from extracting links, images to inserting watermarks and manipulating text.
Learn how to leverage tesseract, OpenCV, PyMuPDF and many other libraries to extract text from images in PDF files with Python
Learn how to use PyMuPDF library to highlight, frame, underline, strikeout and redact text in PDF Files with Python.
Learn how to add and remove watermarks to/from PDF files with PyPDF4 and reportlab libraries in Python.
Learn how to extract and save images from PDF files in Python using PyMuPDF and Pillow libraries.
Learn how you can extract links and URLs from PDF files with Python using pikepdf and PyMuPDF libraries.
Learn how you can use pikepdf, pdf2john and other tools to crack password protected PDF files in Python.
Learning how to extract tables from PDF files in Python using camelot and tabula libraries and export them into several formats such as CSV, excel, Pandas dataframe and HTML.