It looks like you're offline.
Open Library logo
additional options menu
Last edited by raybb
January 10, 2025 | History

Open Library Data Dumps

Open Library provides dumps of all its data, generated every month. Most of the data dumps are formatted as tab separated files with the following columns:

Dumps

For past dumps, see: https://archive.org/details/ol_exports?sort=-publicdate

Downloading the dumps take too long? Checkout the link above and download via torrent for higher speeds!

Format of JSON records

A JSON schema for the various types is located at https://github.com/internetarchive/openlibrary-client/tree/master/olclient/schemata

Using Open Library Data Dumps

This guide by contributor on the LibrariesHacked GitHub about how to load Open Library's data dumps into PostgreSQL to make it more easily queriable:
https://github.com/LibrariesHacked/openlibrary-search

DuckDB

DuckDB is another easy tool to query the dump without much work.

For example:
If you wanted to get all the Wikidata IDs currently in the authors table:
`<br /> SELECT json_extract(column4, '$.remote_ids.wikidata') as wikidata_id<br /> FROM read_csv('ol_dump_authors_2024-07-31.txt.gz') <br /> WHERE wikidata_id IS NOT NULL <br /> LIMIT 100;<br /> `

GraphQL

DiFronzo on GitHub has produced a GraphQL proxy to search books using work, edition and ISBN with the Open Library API. Deployed with Deno and GraphQL:

https://github.com/DiFronzo/OpenLibrary-GraphQL
DiFronzo/OpenLibrary-GraphQL

OL Covers Dump

We do not yet have rolling monthly dumps of our book covers, despite a shared desire for their existence. Some historical cover dumps may be explored here:
https://archive.org/details/ol_data?tab=collection&query=identifier%3Acovers&sort=-addeddate

Most covers are archived in the following items. Note covers_0006 and covers_0007 are presently unavailable.

History

January 10, 2025 Edited by raybb fix typos
January 4, 2025 Edited by raybb add DuckDB note
August 7, 2024 Edited by Drini Fix dump sizes / instructions
August 7, 2024 Edited by Drini New dumps are now available!
December 14, 2011 Created by Anand Chitipothu Documented Open Library Data Dumps