Camille Voudrin: Difference between revisions

Latest revision as of 13:40, 6 May 2026

Camille Voudrin (born 1988) is a French computational linguist and epigraphist based at the Laboratoire d'Épigraphie Numérique et Linguistique Computationnelle (LENLiC) at the Institut national des humanités numériques (INHN) in Limoux, France. She studied mathematics and linguistics at the École Normale Supérieure before completing a doctorate at Paris-Diderot in 2015 on Hidden Markov Models applied to undeciphered writing systems. Her collaborators on the 2024 study, Solène Marchand and Hadrien Leclerc, are doctoral researchers at the same laboratory.

Voudrin's declared research programme is the development of "script-agnostic" methods: statistical tools that can characterise the internal structure of an unknown writing system without prior knowledge of the language, the sign inventory's size, or the script's typological classification. She has expressed particular interest in featural scripts, in which the graphic form of a sign encodes phonological properties, as a test case for whether structural methods alone can recover design logic.

Work

Voudrin, Marchand, and Leclerc published "A distributional test of vowel–consonant structure in an undeciphered signary suggests robust class separation" (Language Codes 7, March 2024: 1281–1294). The study applied a constrained two-state Hidden Markov Model to three extended inscriptions in the Kristiansen coding system, finding that approximately 90% of sign types fell at near-deterministic class probabilities. The consonant-like class comprised box, corner, and triangle sign families; the vowel-like class comprised tee, line, meander, and barred-post families. Kristiansen's visual families showed zero cross-class mixing on a chi-square test.

The paper also derived a sonority hierarchy from onset and coda cluster directionality alone, consistent with obstruent > nasal > liquid > glide ordering, and identified C05 as the dominant cluster-initial element, a profile the authors described as "consistent with a coronal fricative or sibilant, though no phonetic claim is made."

Voudrin has been careful to frame the result as orthogonal to rather than conflicting with Rubergskier's arithmetic reading. "A text can simultaneously show word-level syllabic structure and clause-level arithmetic templating," she noted in a brief response to correspondence in Language Codes. "We are describing different levels of the same object."

The paper is notable for its methodological appendix, which provides full pseudocode for the constrained Viterbi algorithm and the sonority-learning procedure, and for nine diagnostic figures including vector tracings of the three corpus inscriptions used in the study.

@@ Line 1: / Line 1: @@
-'''Camille Voudrin''' is a researcher at the Laboratoire d'Épigraphie Numérique et Linguistique Computationnelle (LENLiC), Institut national des humanités numériques (INHN), Limoux, France.
+'''Camille Voudrin''' (born 1988) is a French computational linguist and epigraphist based at the Laboratoire d'Épigraphie Numérique et Linguistique Computationnelle (LENLiC) at the Institut national des humanités numériques (INHN) in Limoux, France. She studied mathematics and linguistics at the École Normale Supérieure before completing a doctorate at Paris-Diderot in 2015 on Hidden Markov Models applied to undeciphered writing systems. Her collaborators on the 2024 study, Solène Marchand and Hadrien Leclerc, are doctoral researchers at the same laboratory.
+Voudrin's declared research programme is the development of "script-agnostic" methods: statistical tools that can characterise the internal structure of an unknown writing system without prior knowledge of the language, the sign inventory's size, or the script's typological classification. She has expressed particular interest in featural scripts, in which the graphic form of a sign encodes phonological properties, as a test case for whether structural methods alone can recover design logic.
 == Work ==
-Voudrin, together with Solène Marchand and Hadrien Leclerc, published "A distributional test of vowel–consonant structure in an undeciphered signary suggests robust class separation" (''[[Language Codes]]'' 7, March 2024: 1281–1294). The study applied a constrained two-state Hidden Markov Model to three extended inscriptions in the [[Kristiansen coding system]], seeking to determine whether the signs partition into vowel-like and consonant-like classes on distributional grounds alone—without reference to phonetic values or an external key.
+Voudrin, Marchand, and Leclerc published "A distributional test of vowel–consonant structure in an undeciphered signary suggests robust class separation" (''[[Language Codes]]'' 7, March 2024: 1281–1294). The study applied a constrained two-state Hidden Markov Model to three extended inscriptions in the [[Kristiansen coding system]], finding that approximately 90% of sign types fell at near-deterministic class probabilities. The consonant-like class comprised box, corner, and triangle sign families; the vowel-like class comprised tee, line, meander, and barred-post families. [[Jan-Tage Kristiansen|Kristiansen]]'s visual families showed zero cross-class mixing on a chi-square test.
-The analysis found that approximately 90% of sign types fell at near-deterministic probabilities (p(V) ≤ 0.05 or ≥ 0.95), yielding a clean partition. The consonant-like class comprised box, corner, and triangle sign families; the vowel-like class comprised tee, line, meander, and barred-post families. Crucially, Kristiansen's visual families showed zero cross-class mixing, confirmed by chi-square test.
+The paper also derived a sonority hierarchy from onset and coda cluster directionality alone, consistent with obstruent > nasal > liquid > glide ordering, and identified C05 as the dominant cluster-initial element, a profile the authors described as "consistent with a coronal fricative or sibilant, though no phonetic claim is made."
-The paper also derived a sonority hierarchy from onset and coda cluster directionality alone, consistent with obstruent > nasal > liquid > glide ordering, and identified the sign C05 as the dominant cluster-initial element—a profile consistent with a coronal fricative or sibilant.
+Voudrin has been careful to frame the result as orthogonal to rather than conflicting with [[Ginevra Rubergskier|Rubergskier]]'s arithmetic reading. "A text can simultaneously show word-level syllabic structure and clause-level arithmetic templating," she noted in a brief response to correspondence in ''[[Language Codes]]''. "We are describing different levels of the same object."
-The authors noted that their findings are orthogonal to rather than in conflict with [[Ginevra Rubergskier|Rubergskier]]'s arithmetic reading: a text may simultaneously exhibit word-level vowel–consonant structure and clause-level arithmetic templating.
+The paper is notable for its methodological appendix, which provides full pseudocode for the constrained Viterbi algorithm and the sonority-learning procedure, and for nine diagnostic figures including vector tracings of the three corpus inscriptions used in the study.
 == See also ==
@@ Line 15: / Line 17: @@
 * [[Jan-Tage Kristiansen]]
 * [[Ginevra Rubergskier]]
+* [[Language Codes]]
-[[Category:Kristiansen corpus]]
+[[Category:Researchers]][[Category:Epigraphy]][[Category:Kristiansen corpus]]
-[[Category:Researchers]][[Category:Epigraphy]]