Change the repository type filter
All
Repositories list
9 repositories
- Segmentation data used in multilingual alignment tasks across English, French, Spanish, and other languages. Includes raw and segmented text files for training and evaluation.
Aquilign
PublicAQUILIGN is a multilingual alignment and collation tool for 📜 medieval texts. It uses ✂️ clause-level segmentation and 🔗 contextual alignment based on BERT models, with applications in 🌍 historical linguistics, 📖 philology, and 🤖 premodern NLP.CorpusTemporis
PublicApp for medieval multilingual metadata — feeds the Segmentation Dataset and trains the Aquilign alignerMultilingual_Aegidius
PublicMultilingual alignment and collation of the De Regimine Principum in latin and vernacular