Dr Matteo Pellegrini
Pronouns: he/him
About
Biography
I am a Research Fellow at the Surrey Morphology Group, where I work as the Language Data Scientist of the NILOMORPH project, funded by an ERC Synergy Grant (cPI: Matthew Baerman; PIs: Bert Remijsen, Lameen Souag). Before that, I worked as a Posdoctoral Researcher at the CIRCSE Research Centre (Catholic University of the Sacred Heart, Milan), within the LiLa project, funded by an ERC Consolidator Grant (PI: Marco Passarotti).
PhD: University of Bergamo / University of Pavia (joint PhD program, supervisor: Pierluigi Cuzzolin)
Master's and Bachelor's degrees: University of Turin (supervisor: Davide Ricca)
ResearchResearch interests
My research interests include:
- the structure of implication between word forms in morphological paradigms, mostly focusing on inflection, but also exploring interactions with derivation;
- overabundance, i.e. the availability of several variants to express the same grammatical properties for a given lexeme;
- how those aspects can be modelled with computational techniques.
To investigate such issues, I also value and enjoy the hard work necessary for the creation of large, high quality datasets, and I participate in efforts to make them openly available and interoperable with other linguistic resources, by developing and applying open standards and vocabularies and by adhering the principles of the framework of Linguistic Linked Open Data.
During my PhD, I applied information-theoretic methodology and used conditional entropy to estimate uncertainty in predicting word forms from each other in Latin verb and noun paradigms.
Within the LiLa project, I worked on the creation of PrinParLat a large lexicon of inflected forms of Latin verbs, nouns and adjectives, featuring a systematic documentation of overabundance using the theoretical notion of flexemes.
Within the NILOMORPH project, I curate the data necessary for a reconstruction of how the complex non-concatenative morphology of West Nilotic languages has developed.
Research interests
My research interests include:
- the structure of implication between word forms in morphological paradigms, mostly focusing on inflection, but also exploring interactions with derivation;
- overabundance, i.e. the availability of several variants to express the same grammatical properties for a given lexeme;
- how those aspects can be modelled with computational techniques.
To investigate such issues, I also value and enjoy the hard work necessary for the creation of large, high quality datasets, and I participate in efforts to make them openly available and interoperable with other linguistic resources, by developing and applying open standards and vocabularies and by adhering the principles of the framework of Linguistic Linked Open Data.
During my PhD, I applied information-theoretic methodology and used conditional entropy to estimate uncertainty in predicting word forms from each other in Latin verb and noun paradigms.
Within the LiLa project, I worked on the creation of PrinParLat a large lexicon of inflected forms of Latin verbs, nouns and adjectives, featuring a systematic documentation of overabundance using the theoretical notion of flexemes.
Within the NILOMORPH project, I curate the data necessary for a reconstruction of how the complex non-concatenative morphology of West Nilotic languages has developed.