Over 32,000 medieval manuscripts transcribed in four months using AI

Medievalists can now access automated transcriptions of 32,763 digitised medieval manuscripts, produced in just four months as part of a project called CoMMA—a large-scale corpus designed to make manuscript texts searchable and analysable at a scale that would be impossible to tackle by hand.

medievalists.net/2026/01/32000

Original paper:
inria.hal.science/hal-05299220

The CoMMA website:
comma.inria.fr/homepage

First page of Chronicon Pictum, the "Illuminated Chronicle" from the court of King Louis the Great of Hungary from 1358.

Main illumination shows:

A multi-paneled scene within an elaborate architectural frame
Central figure: A crowned king (likely representing Hungarian royalty) seated on a throne
Left panel: Armed warriors or knights
Right panel: Groups of courtiers or nobles
Rich colors: deep blues, reds, golds, and ochres
Gothic architectural elements including towers and arches

Latin text in two columns
Red lettering (rubrics) for important headings and initial words
Black text for the main chronicle
Beautiful calligraphy in Gothic script

https://en.wikipedia.org/wiki/Illuminated_manuscript#/media/File:K%C3%A9pes_kr%C3%B3nika_els%C5%91_lapja.jpg
0
0
1

If you have a fediverse account, you can quote this note from your own instance. Search https://mastodon.social/users/gutenberg_org/statuses/115983651663698761 on your instance and quote it. (Note that quoting is not supported in Mastodon.)

RE: mastodon.social/@gutenberg_org

The method has nothing to do with the marketing slopWord AI. It is = which is often used in science. Better read the study.

0