Website-Archivierung an der BSB
Web Archiving: FAQs
Why is the Bavarian State Library interested in archiving my website?
Based on their scientific relevance websites are selected by Bavarian State Library employees with subject expertise and recommended for web archiving. This is supposed to guarantee the future scientific use of the selected resources by means of long term archiving and providing accessibility. Digital knowledge and cultural heritage can thus be preserved for the future.
How are the archived websites made accessible to the public?
All archived websites can be found through the OPACplus-Catalog of the Bavarian State Library. Please use URLs with full http statement only, e.g. http://www.weisse-rose-stiftung.de. Additionally, the archived websites are recorded in the Internet resource guides of the Bavarian State Library’s subject libraries. They are freely accessible for all users worldwide via the Internet.
As an example, please see the record in the BSB-Catalog and then click on ‚Online lesen‘ (read online) or ‚Volltext‘ (full text). This is the overview page of the archive of the website bayerische-landesbibliothek-online.de.
Is there a reference from the archived version to the original website?
Yes, the URL of the original offer is recorded in the OPACplus-Catalog as well as in the Internet resource guides of the Bavarian State Library’s subject gateways and finally in the overview page for each archived website.
Does archiving by the Bavarian State Library result in a competition for my own offer?
No, the web archive of the BSB is not indexed by search engines. Thus the original offer will neither lose any visitors nor will a user unintentionally enter an archived version of your website. Moreover, archived websites are clearly identified as such. Firstly, a ‘banner’ is displayed above each archived website with the information that this is an archived version and secondly, this can also be seen from the archive-URLs, each beginning with langzeitarchivierung.bib-bvb.de.
Which kind of technical conditions has my website to comply with to be archived?
For web archiving no technical preparations or adjustments are necessary on the part of the website operator.
How does web archiving work technically?
The Bavarian State Library employs the Web Curator Tool with the integrated crawler ‘Heritrix’ for the collection or so called ‘crawling’ of websites. This Web crawler follows the link structure of the websites and collects all data found. Thereby external links to other offers are not further tracked. This process is repeated twice a year on a regular basis; thus your website is archived at different points in time. All individual archive copies are then equally made available for use.
Does web archiving have an impact on my server?
The crawler of the Bavarian State Library is configured in a way that the server load is kept as low as possible. If, contrary to expectations, problems occur due to the crawl process please contact us immediately at webarchivierung(at)bsb-muenchen.de. In the log files of your server you can recognise the access of the crawler of the Bavarian State Library by the following signature: Mozilla/5.0 (compatible; crawler long-term preservation project +http://www.babs-muenchen.de).
What is happening with password protected content?
Password protected content is not collected and archived.
Will all contents, design elements and functionalities be preserved when a website is archived?
The Bavarian State Library is always concerned about the complete archiving of websites and the preservation of their original representation. It is tried to collect and archive all objects with a website including HTML, pictures, PDF files, audio and video files and other objects (e.g. scripts). Current Web crawlers though are not capable yet to capture and archive dynamically generated content (e.g. Flash animations), database contents, the Deep Web or streamed contents (i.e. audio or video files which are transferred in real-time such as YouTube). Therefore it can sometimes happen that not all elements of a website are represented in the archived version. External links, forms and search functions will not work in most cases.
What kind of legal requirements have to be met for the archiving of my website?
For legal reasons the Bavarian State Library only collects, archives and provides access to websites if an explicit authorisation has been received. You as the rights owner of a website or as the authorized representative can grant this to us as an answer to our email inquiry or agree with the archiving and provision of access by completing and returning the approval form to the Bavarian State Library. The granting of an authorization must not be opposed to any third party rights (as e.g. copyrights). The approval form is only available in German, because in this case German copyright law applies. You can find it here.
Only the websites of authorities, departments or agencies of the State of Bavaria as official publications can be archived and made accessible without obtaining an explicit authorization according to the notice of the Bavarian State Government - Bavarian State Promulgation of 2. December 2008 (Az.: B II 2-480-30). These institutions are provided with information about the archiving process in advance.
Is it possible to exclude copyright protected contents such as pictures, videos, audios on my website from web archiving?
In principle the Bavarian State Library aims for the complete archiving of a website, i.e. generally no contents are excluded. If it is necessary in a particular case to leave out specific contents or files from archiving, e.g. for copyright reasons, please contact us at webarchivierung(at)bsb-muenchen.
Whom can I contact with further questions?
Please send us a short email to webarchivierung(at)bsb-muenchen.de. An employee of the Bavarian State Library will contact you as soon as possible.
July 18, 2014
V. 4.2.1 en