Anthology of Computers and the Humanities · Volume 4

Structuration, exploration et valorisation d’archives archéologiques par l’intelligence artificielle au sein d’un lac de données

Rajae El-Idrissi1 , Josefina Simon Reig2 , Laura Romero2 , Juba Agoun1 , Jean-Pierre Girard3 , Gabriel de Prado2 , Jérôme Darmont1 and Sabine Loudcher1

  • 1 Université Lumière Lyon 2, ERIC, Lyon, France
  • 2 Musée d’Archéologie de Catalogne, Site d’Ullastret, Espagne
  • 3 Université Lumière Lyon 2, Archéorient, Lyon, France

Permanent Link: https://doi.org/10.63744/B2JRjY8IpI43

Published: 21 May 2025

Keywords: archaeology, data science, data lake, metadata, excavation diaries

Mots clés : archéologie, science des données, lac de données, métadonnées, carnets de fouille

Abstract

The DataLAC project focus on the use of artificial intelligence for the alignment, annotation and interpretation of heterogeneous documents enriched with semantic metadata and aggregated within a data lake. This project seeks to digitize, unify and make accessible over thirty years of field notes (1947–1977), scientific publications, and archives related to the Iberian archaeological site of Ullastret.These diverse materials are integrated into an interoperable data lake designed to manage the heterogeneity of sources and to support complex research queries. The DataLAC project thus constitutes a generalizable proof of concept, demonstrating the potential of data lake architectures and AI-driven methodologies for the exploitation and interpretation of archaeological archives.