Web archiving
The German National Library's web archive contains archival copies of websites on selected subjects, institutions or events. Our reading rooms also provide access to websites in the .DE domain which the Internet Archive archives, filters and makes available as a separate collection. The German National Library also contributes to the cooperative website collections of the International Internet Preservation Consortium (IIPC).
German National Library's web archive
The German National Library’s legal collection mandate includes the collection, indexing and archiving of websites. Using an automated process known as web harvesting, we create snapshots of the websites, index them in our catalogue and archive them in our web archive.
We collect websites according to specific formal and content-related criteria. In our web archive, you will find the websites of federal authorities and universities, blogs, topics such as history, literature and music, and websites for events such as the federal elections or the 500th anniversary of the Reformation in 2017. We create joint collections in collaboration with libraries obligated to maintain web archives at the regional level. One example of this is the Thuringia Web Archive.
Our web archive is structured by subject category and has a full-text search function. You can also access the content of our web archive through the catalogue.
On copyright grounds, it is usually only possible to access the collected websites in our reading rooms in Leipzig and Frankfurt am Main. However, certain web archive content for which we have the right holder’s consent can also be used outside the reading rooms.
Archiving German-language Twitter
On 20 February 2023, an initiative launched by the Science Data Center for Literature and the German National Library issued a call for a concerted effort to download as many German-language Tweets as possible from the Twitter archive. The goal was to create as complete an archive of German-language Tweets as possible using a crowdsourcing initiative. The German National Library has made archive servers available to facilitate permanent storage. More
"Sustainable archiving of social media data - Twitter and beyond"
Conference at the German National Library Frankfurt am Main on 19 and 20 March 2024
Archiving, cataloguing and providing dynamic data from social media present challenges which affect researchers, research institutions, libraries and archives in equal measure, and the best way to solve these problems is through collaboration and partnership. This requires wide-ranging efforts which would be impossible for a single data community or discipline. A conference at the German National Library Frankfurt am Main on 19 and 20 March 2024 will explore these questions. More
Call for Participation: Twitter Datasprint
Do you have research questions for which you are keen to analyse large volumes of German-language tweets? Are Twitter data of interest for your research in the humanities or social, natural or life sciences? Or do you have a passion for visualising social media data?
Then come to our two-day data sprint on 21 and 22 March 2024.
Frequently asked questions (FAQ)
Web archive of international thematic and event-based collections
The German National Library contributes to the cooperative collections of the International Internet Preservation Consortium (IIPC). The IIPC’s member organisations collate websites on globally relevant topics such as climate change, the refugee crisis and the coronavirus pandemic, and on events such as European elections and the Olympic Games. The result is an international perspective on the latest events and the way in which they are depicted on the internet. These thematic collections are freely accessible from anywhere in the world.
Contact
Online publication service
Phone +49 341 2271-282
Last changes:
20.12.2023
Short-URL:
https://www.dnb.de/webarchiv