Allscreenshots Docs

Content archiving

Preserve web content for research, posterity, or organizational records

Archive web content by capturing screenshots for long-term preservation, research, or organizational record-keeping.

The problem

Web content is ephemeral. Pages get updated, redesigned, or taken offline entirely. For researchers, historians, and organizations, losing access to web content can mean:

  • Lost research data and citations
  • Missing historical records
  • Broken references in documentation
  • No proof of what was published

The Internet Archive helps, but it doesn't capture everything, and you can't control the timing or coverage.

How Allscreenshots helps

Build your own web archive by capturing screenshots on demand or on a schedule. Preserve exactly the content you need, when you need it.

  • Full page capture — Archive entire pages including all scrollable content
  • PDF generation — Create document-format archives
  • Scheduled archiving — Automatically capture pages at regular intervals
  • Bulk capture — Archive hundreds of pages in a single operation

For comprehensive archiving, combine full page screenshots with PDF generation to preserve both visual appearance and text content.

Quick example

Archive a web page as both a screenshot and PDF:

# Capture as full page screenshot
curl -X POST https://api.allscreenshots.com/v1/screenshots \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://example.com/important-article",
    "fullPage": true,
    "format": "png",
    "waitUntil": "networkidle"
  }'

# Capture as PDF
curl -X POST https://api.allscreenshots.com/v1/screenshots \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://example.com/important-article",
    "format": "pdf",
    "pdfOptions": {
      "printBackground": true,
      "format": "A4"
    }
  }'

Set up weekly archiving:

curl -X POST https://api.allscreenshots.com/v1/schedules \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "Weekly archive",
    "url": "https://example.com/important-page",
    "schedule": "0 0 * * 0",
    "fullPage": true
  }'

Key features for this use case

Next steps

On this page