Stylo visualisations of Middle English documents

Research output: Contribution to journalArticleScientificpeer-review


Automated approaches to identifying authorship of a text have become commonplace in the stylometric studies. The current article applies an unsupervised stylometric approach on Middle English documents using the script Stylo in R, in an attempt to distinguish between texts from different dialectal areas. The approach is based on the distribution of character 3-grams generated from the texts of the corpus of Middle English Local Documents (MELD). The article adopts the middle ground in the study of Middle English spelling variation, between the concept of relational linguistic space and the real linguistic continuum of medieval England. Stylo can distinguish between Middle English dialects by using the less frequent character 3-grams.
Original languageEnglish
Peer-reviewed scientific journalJournal of Data Mining & Digital Humanities
VolumeSpecial issue on Visualisations in Historical Linguistics
Pages (from-to)1-10
Number of pages10
Publication statusPublished - 23.12.2020
MoE publication typeA1 Journal article - refereed


  • 612,1 Languages
  • Middle English
  • non-standard spelling
  • historical dialectology
  • diatopical variation
  • unattended analysis
  • stylometry
  • authorship attribution
  • R


Dive into the research topics of 'Stylo visualisations of Middle English documents'. Together they form a unique fingerprint.

Cite this