Innovative Approaches of Historical Newspapers: Data Mining, Data Visualization, Semantic Enrichment - BnF - Bibliothèque nationale de France Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Innovative Approaches of Historical Newspapers: Data Mining, Data Visualization, Semantic Enrichment

Approches innovantes pour la presse ancienne: fouille de données, visualisation de données, enrichissements sémantiques

Résumé

In this age of Big Data this paper describes how digital librairies can apply at large scale innovative approaches to better valorize and bring better experiences of old newspapers. On the first hand, the state-of-the-art OLR (optical layout recognition) technique in one of the largest heritage press digitization projects in Europe (Europeana Newspapers, www.europeana-newspapers.eu, 2012-2015) was used in a data mining experiment. Data analysis was applied to quantitative metadata derived from a 850K pages subset of six XIX th-XX th c. French newspaper titles from the BnF collection. The METS/ALTO XML data was analyzed with data mining and data visualization techniques that show promising ways for the production of knowledge about historical newspapers that are of great interest for library professionals (digitization programs management, curation and mediation of newspaper collections) and for end-users, particularly the digital humanities community. On the other hand, the Retronews web portal showcases how advanced semantic annotation techniques can improve the retrieval efficiency on a digital newspapers collection; thus the rediscovery and reappropriation of these documents by various types of users: teachers, students, researchers, general public.
Fichier principal
Vignette du fichier
000-moreux-V3.pdf (1.97 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01389455 , version 1 (28-10-2016)

Identifiants

  • HAL Id : hal-01389455 , version 1

Citer

Jean-Philippe Moreux. Innovative Approaches of Historical Newspapers: Data Mining, Data Visualization, Semantic Enrichment : Facilitating Access for various Profiles of Users. IFLA News Media Section, Lexington, August 2016, At Lexington, USA, IFLA, Aug 2016, Lexington, United States. ⟨hal-01389455⟩

Collections

BNF
423 Consultations
726 Téléchargements

Partager

Gmail Facebook X LinkedIn More