Skip to main content

PDF Compression Explained: How to Reduce File Size Without Losing Quality

Learn what gets removed when you compress a PDF, the best methods to reduce file size, and when compression helps versus when it hurts quality.

Documents·6 min read·
PDF Compression Explained: How to Reduce File Size Without Losing Quality

A 25 MB PDF that should be 2 MB is a common problem. It is too large to attach to an email, too slow to upload to a web form, and frustrating to share with clients or colleagues. PDF compression is the solution, but many people do not understand what compression actually does, which means they either over-compress (losing quality they needed) or avoid it entirely because they are afraid of damaging the document.

This guide explains exactly what happens when you compress a PDF, the different methods available, and when to use each approach.

What Is Inside a PDF?

Before understanding compression, it helps to know what a PDF actually contains. A PDF file can include:

  • Text content stored as vector data, which is inherently small and scales without quality loss
  • Embedded images (photographs, screenshots, scanned pages) stored as raster data, which can be very large
  • Fonts embedded so the document looks the same on every device
  • Metadata like author, creation date, and document properties
  • Color profiles and ICC color data
  • Thumbnails for preview display
  • Bookmarks and annotations like comments and highlights
  • Form fields and interactive elements
  • JavaScript for interactive PDF forms

For most documents, images are responsible for 70-90% of the file size. Compression tools primarily target images.

How PDF Compression Works

When you compress a PDF, the tool generally applies one or more of these techniques:

Image Downsampling

Images are reduced in resolution (measured in DPI — dots per inch). A photograph scanned at 600 DPI contains four times as many pixels as the same image at 300 DPI, which means it takes four times the storage. For screen viewing and standard printing, 150 DPI is usually sufficient. For professional printing, 300 DPI is the standard target.

Downsampling is irreversible. Once you reduce an image's resolution, you cannot recover the lost pixels from the compressed file.

Image Recompression

Images inside a PDF can be recompressed using different algorithms. JPEG compression at a lower quality setting reduces file size at the cost of some image detail. This is the most aggressive compression technique and produces the largest size reductions, but also the most visible quality loss on photographs.

Font Subsetting

If a PDF embeds a complete font (all characters), the compressor may replace it with a subset containing only the characters actually used in the document. A document that only uses basic Latin characters does not need the entire Unicode range embedded.

Removing Metadata and Hidden Content

Many PDF creation tools embed metadata, undo history, hidden layers, and other overhead data that is not visible to the reader. Stripping this can meaningfully reduce file size without affecting the visible document at all.

Flattening Transparency

PDFs with transparent overlapping elements (common in design software exports) require complex transparency calculations during rendering. Flattening transparency pre-renders these calculations and can reduce both file size and rendering complexity.

Common Compression Scenarios

Scanned Documents

Scanned PDFs are almost always larger than they need to be. A flatbed scanner at default settings typically produces 300-600 DPI images. For document archiving, 300 DPI is appropriate. For email sharing, 150 DPI is usually fine and reduces file size by 75% compared to 300 DPI.

Photo-Heavy Reports and Presentations

Marketing reports, product catalogs, and presentation decks exported from PowerPoint or InDesign often contain high-resolution photographs that were appropriate for print but excessive for digital sharing. Reducing embedded images to 150 DPI typically produces excellent screen results at a fraction of the original size.

Legal and Financial Documents

These typically contain mostly text and simple graphics. Text-only PDFs are already small and do not compress much further. If a legal document is large, it is usually because it includes scanned signature pages or attached exhibits. Compress the attachments separately if possible.

Quality vs. Size Trade-off

Every compression level represents a trade-off between file size and visual quality:

Compression levelTypical size reductionBest for
Low (gentle)20-40%Professional printing, archiving
Medium50-70%Email sharing, web upload
High (aggressive)70-90%Web thumbnails, previews

For documents where visual quality matters (photography books, design portfolios, medical imaging), use minimal compression. For documents where readability matters but photographic quality is secondary (contracts, reports, invoices), medium compression is the right balance.

What Compression Cannot Do

Compression cannot make a PDF smaller than its actual text and vector content. A 100-page text-only PDF might already be 500 KB and have nowhere to go. Compression tools also cannot recover data that was never there — a low-resolution source image will not look better after compression.

Compressing PDFs Without Uploading to a Server

Privacy matters when compressing sensitive documents. Many online PDF tools upload your files to cloud servers for processing, which is a concern for legal documents, financial records, medical records, and confidential business materials.

Our PDF compressor processes your file entirely in your browser using client-side JavaScript. Your document never leaves your device, which means no server ever sees your content.