Les formats d’encodage de la notation musicale à l’épreuve d’un
objectif de transcription manuscrite automatique

Scheithauer, Hugo; Romero-García, Gonzalo; Romary, Laurent; Clérice, Thibault

doi:10.63744/K2fvu4Nzsv6t

Abstract

The development of Optical Music Recognition (OMR) faces fragmentation in representation formats, divided between editorial requirements (MEI), demands for graphical rendering and semantics (MusicXML), and computational analysis (Humdrum **kern). This tension is particularly evident in handwritten sources, where musical semantics are coupled with a complex documentary materiality. Building on recent advances in ATR (Automatic Text Recognition), this article evaluates the ability of current standards to serve as ground truth. Using Claude Debussy’s piano manuscripts as a case study, we advocate a graphematic approach to transcription. We justify repurposing MusicXML toward a “structural” model capable of preserving, to some extent, the physical anchoring and non-standard phenomena of the source. This proposal aims to contribute to the stabilization of annotation practices for the creation of shareable datasets, an essential condition for the development of robust historical OMR.

Les formats d’encodage de la notation musicale à l’épreuve d’un objectif de transcription manuscrite automatique