After a very long process, my former student Clayton Marr (no realationship to Nikolay Marr) and I have published a new #histling paper in #diachronica on the use of large-scale #forward #reconstruction to discover previously neglected patterns of #sound #change (in this case, in #French).

https://benjamins.com/catalog/dia.20027.mar

Regrettably, we couldn't make it open access but I would be happy to share an offprint.
#historical #linguistics #diachronic #Latin

Large-scale computerized forward reconstruction yields new perspectives in French diachronic phonology:

Traditionally, historical phonologists have relied on tedious manual derivations to sequence the sound changes that have shaped the phonological evolution of languages. However, humans are prone to errors, and cannot track thousands of parallel derivations in any efficient manner. We demonstrate computerized forward reconstruction (CFR), deriving each etymon in parallel, as a task with metrics to optimize, and as a tool which drastically facilitates inquiry. To this end we present DiaSim, an application which simulates “cascades” of diachronic developments over a language’s lexicon and provides various diagnostics for “debugging” those cascades. We test our method on a Latin-to-French reflex prediction task, using a newly compiled, publicly available dataset FLLex consisting of 1368 paired Latin and Modern French forms. We also introduce a second dataset, FLLAPS, which maps 310 reflexes from Latin through five attested intermediate stages up to Modern French, derived from Pope’s (1934) periodic development tables. We present publicly available rule cascades: the baseline BaseCLEF and BaseCLEF* cascades, based on Pope’s (1934) widely-cited view of French development, and DiaCLEF, made from incremental corrections to BaseCLEF aided by DiaSim’s diagnostics. DiaCLEF outperforms the baselines by large margins, improving raw accuracy on FLLex from 3.2% to 84.9% of etyma, with similarly large improvements for each of FLLAPS’ periods. Changes were made to build DiaCLEF considering only the baseline and DiaSim’s diagnostics, but they often independently reproduced past work in French diachronic phonology, corroborating both our procedure and past endeavors; we discuss the implications of some of our findings in detail.

John Benjamins Publishing Catalog

Introduction with hashtags: I'm a #language researcher at #CMU's Language Technologies Institute (#LTI) operating at the intersection between #computational #linguistics and #NLP.. I'm interested in anything dealing with #sounds (#phonetics and #phonology), the structure of words (#morphology), and change in language over time (#histling).

I'm also a big advocate of functional programming and #rustlang.

Politically, I am a very #moderate #anarchist.

#introduction time! I'm @anggarrgoon/ Claire (she/they); I'm a #linguistics professor who works on language change, particularly how we can use #language in investigating Holocene human dynamics. I also work on language #revitalization with #Indigenous #Australian communities. Also interested in #gender #indigenousAI #histling