Navigation and service

Overview

The sets of German National Library's metadata, the authority data from the Integrated Authority Files (GND) as well as the bibliographic and holdings data of the German Union Catalogue of Serials (ZDB) on this site are updated regularly. They are provided free of charge and for free subsequent use.

Along with these data dumps, we also offer other selections of bibliographic data relating to individual digital collections in the DNBLab.

An overview of all available metadata as well as the different options of obtaining data is given here.

Complete sets

You can view the current data version after accessing the download link in the respective data description.

Full copy Integrated Authority Files
Full copyProvision (update)Format, encoding
UTF-8 decomposed
Number of records/file size (zipped)
Integrated Authority File (GND)February/March,
June/July (cancelled 2024)
October/November
MARC 21
MARC21-xml
approx. 9.6 million/approx 2 GB
Integrated Authority File (GND)February/March,
June/July (cancelled 2024)
October/November
RDF (RDF/XML)
RDF (Turtle)
RDF (JSON-LD)
HDT file
N-Triples
approx. 9.6 million/approx 1.5 GB
Entity Factsonce a monthRDF (JSON-LD)approx. 8.9 million/approx 1.2 GB
Cross-concordancesOctober/NovemberRDF (RDF/XML)
RDF (Turtle)
RDF (JSON-LD)
N-Triples
*

* Contains only terminology-specific datasets from cross-concordances.

Some geographical authority data contains unchanged coordinates from the GeoNames database and has done so since the end of January 2014.

The dataset authorities-gnd_umlenk_loesch_JJJJMMTT with abbreviated data records is provided as an additional service to track all redirects ("x" in position 5 of the leader in MARC 21) and deletions ("d" in position 5 of the leader in MARC 21) made since the previous full copy of the GND. This file is useful for customers who used the control number in MARC field 001 to determine the valid GND number for redirects when transferring the GND data.
The file is not relevant for customers who synchronise their GND collection with the GND, i.e. who constantly track redirects and deletions via the OAI interface or the weekly change service.
Further information on the process of redirects and deletions


Full copy Bibliographic data
Full copyProvision (update)

Format, encoding

UTF-8 decomposed

Number of records/file size (zipped)
Bibliographic data of DNB*/** February/March,
June/July (cancelled 2024)
October/November
MARC 21
MARC21-xml
approx. 31 million/approx. 10.3 GB
Bibliographic data of DNB*/***February/March,
June/July (cancelled 2024)
October/November
RDF (RDF/XML)
RDF (Turtle)
RDF (JSON-LD)
HDT file
N-Triples
approx. 29.8 million/approx. 4.9 GB

* Contains records that are not part of Deutsche Nationalbibliografie (German National Bibliography).
** A full copy of the bibliographic data with hyperlinks to digitised tables of contents is provided yearly in February free of charge in MARC 21 (also in XML structure).
*** All bibliographic data that has been converted to RDF format.

Some bibliographic data contains class information from the Thema subject classification system for books; this has been the case since October 2015.


Full copy of the German Union Catalogue of Serials (ZDB)
Full copy Provision (update)

Format, encoding

UTF-8 decomposed

Number of records/file size (zipped)
Bibliographic data of ZDBMarch
October/November
MARC 21
MARC21-xml
approx. 2.1 million/approx. 700 MB
Bibliographic data of ZDBMarch
October/November
RDF (RDF/XML)
RDF (Turtle)
RDF (JSON-LD)
HDT file
N-Triples
approx. 2.1 million/approx. 350 MB
ZDB holding dataMarch
October/November
MARC 21
MARC21-xml
approx. 19.6 million/approx. 1.5 GB
Address data (ISIL- and Library Code Directory)March
October/November
RDF (RDF/XML)
RDF (Turtle)
RDF (JSON-LD)
HDT file
N-Triples
approx. 20.900/approx. 4 MB

Along with these data dumps, we also offer (theme-based) bibliographic data sets and open access digital object collections. You will find more information about these in the DNBLab.

Ongoing updates

Full copies can be regularly updated free of charge through the OAI interface as well as through the WWW and SFTP servers.

Archived full copies

Are you interested in archived full copies for research purposes?

The German National Library has been archiving full copies of its bibliographic data, the authority data of the Integrated Authority File (GND), its address data (ISIL and Library Code Directory) and the bibliographic and holdings data of the German Union Catalogue of Serials (ZDB) every year since 2021.
If you are interested, please send an email to metadatendienste@dnb.de.

Terms of use and provision

Detailed information on terms of use and provision is given here.

Frequently asked questions (FAQ)

Where can I get test data?

Currently valid test data in the MARC 21 format (also in XML structure) can be accessed at any time via the respective link to the current deliveries of the desired metadata.
In addition the catalogue as well as the metadata shop offer access to data. In order to illustrate the regular format changes test data is available in the format MARC 21 (also in XML structure), and RDF (various serialisations). All formats are offered in UTF-8 decomposed character encoding.

Further processing

If you are just getting started with processing metadata, useful programmes include the software suite Catmandu, OpenRefine or Metafacture, while data can be analysed with "Konstanz Information Miner" (KNIME) or the Metadata Quality Assurance Framework. A more detailed overview is provided in the presentation slides "Open Source Software zur Verarbeitung und Analyse von Metadaten" (available only in German) and the article "Survey of Tools for Linked Data Consumption".

Contact

metadatendienste@dnb.de

News

Last changes: 04.11.2024
Short-URL: https://www.dnb.de/dumps
Contact: metadatendienste@dnb.de

to the top