← All projects

ArchiveBox

Preserve the web. On infrastructure you control.

Utilitiesweb-archivingself-hostedopen-sourcedockerbookmarksdata-preservationcli
ArchiveBox screenshot

About

ArchiveBox is an open-source, self-hosted web archiving tool that saves websites, bookmarks, RSS feeds, social posts, and media into durable formats including HTML, PDF, PNG, WARC, and SQLite. It can be run via Docker, CLI, or a self-hosted web UI, giving individuals and organizations full control over their archived data. It is designed for personal archivists, journalists, researchers, and institutions who want portable, long-lasting captures without relying on third-party services.

Problem

Link rot, platform churn, censorship, and disappearing media cause valuable web content to be permanently lost without a reliable local archiving solution.

For

Individuals, professionals (lawyers, journalists), and institutions (researchers, libraries, governments) who want to self-host web archives

How it works

Users install ArchiveBox via Docker Compose or other methods, then feed it URLs or bookmark exports; it captures pages using tools like Chrome, wget, and yt-dlp and stores outputs as ordinary files organized in a local data directory.

Business model

open-source

Status

launched

Similar projects