National Library of France
Because there is a need to preserve this new form of communication, which is now present in all areas of knowledge and all parts of society
E-administration, digital arts, online publications, e-learning, e-business, virtual exhibitions and digital libraries, blogs and new public spaces dedicated to discussion and chat… So many activities have moved to the Web and new ones have been created. With more than 30 million Internet users and an increasing number of French Websites, France has finally entered the Information Society and Heritage institutions must face this new reality.
Because archiving the Web through legal deposit extends the historical mission of collecting French cultural heritage
Legal deposit is the legal obligation for every publisher, printer, producer, distributor, or importer of documents to deposit copies of all published materials in the mandated institutions. Originally promulgated for printed books in 1537, legal deposit has been progressively extended to all types of materials of expression and creation, including new technologies as they appeared in France. After books, engravings, music scores, photographs, posters, audiovisual and multimedia documents, the time has come to archive Websites as well.
Because Web legal deposit is a legal obligation
The French Heritage Law (Code du patrimoine) now incorporates the DADVSI law (DADVSI stands for Droit d’auteur et droits voisins dans la société de l’information - loi 2006-961) which was officially published on August 3rd, 2006. Title III (Articles L131-1 to L133-1) officially establishes legal deposit of the Web.
Anyone who produces or publishes online material in order to communicate with the public by electronic channels is under the obligation of legal deposit. The law is to be applied to all those who have some connection to the national territory – this has always been the case for other types of documents: the National Library of France collects “everything published (or imported) in France”.
Unlike what is done for other materials, the law will not involve any particular procedures for producers because the Web legal deposit will be essentially managed through automatic harvesting techniques run by the mandated institutions. The only obligation for the producer will be to give, when requested by the Library, access codes and technical information if automatic harvesting has failed. A specific deposit procedure may also be implemented at the request of the BnF in any case where the selected site architecture or data format used is not compatible with automatic harvesting.
The size of the Web is exponential: it is not possible to aim for exhaustiveness nor to undertake a manual selection of sites. To respond fully yet pragmatically to the challenges addressed by Web legal deposit, the Library has chosen to combine two complementary collecting methods:
bulk automatic harvesting of French Websites
Bulk harvesting is done by robots. In the past the BnF worked in partnership with Internet Archive (IA) to collect five annual “snapshots” of websites belonging to the French domain, beginning in 2004. Historical collections representing snapshots from 1995 to 2004 have also been acquired.
In 2010 the bulk harvesting procedures are performed by the BnF itself, with the constant aim of providing a better coverage of French domain sites at a large scale.
Focused harvesting is based on a selection of sites by subject librarians at the BnF. Focus crawls can be based around an event (French Elections in 2002, 2004 and 2007 have been covered, as well as the European elections in 2009) or be on a given theme (blogs, sustainable development, Web activism…).
The Web archives are accessible to authorized users of the BnF, in the reading rooms of the Research Library only. This restriction is the same as that which applies to all legal deposit collections. As of June 22th, 2009, the BnF offers 350 computers to consult its Web archives across all its sites, in Paris and in Avignon.
Wednesday, September 11, 2013
ContactLegal Deposit Department
For more info