Loading…

Loading grant details…

Completed HORIZON European Commission

High Performance Language Technologies

€4.06M EUR

Funder European Commission
Recipient Organization Univerzita Karlova
Country Czech Republic
Start Date Sep 01, 2022
End Date Aug 31, 2025
Duration 1,095 days
Number of Grantees 8
Roles Participant; Coordinator; Associated Partner
Data Source European Commission
Grant ID 101070350
Grant Description

High Performance Language Technologies (HPLT) is a space combining petabytes of natural language data with large-scale model training. With trillions of words of text, the space will be the largest open text collection. Cleaning and privacy protecting services improve the quality and ethical properties of the text.

Going beyond static repositories that require the user to individually analyze each data set, the project will rate data sets by how much they improve end-to-end language models and machine translation systems.

Continuous integration of models and data will result in free downloadable high-quality models for all official European Union languages and beyond. The models will be reproducible with information and evaluation metrics shown in a publicly available dashboard.

By focusing on training at scale, the project complements the inference-focused European Language Grid, which in turn will be used for model deployment.

Datasets, models and information about them will be published in recognized FAIR data repositories, aggregation catalogues and marketplaces for easy discovery, access, replication, and exploitation.

All Grantees

Cesnet Zajmove Sdruzeni Pravnickych Osob; Helsingin Yliopisto; Prompsit Language Engineering, Sl; Universitetet I Oslo; Sigma2 As; Univerzita Karlova; Turun Yliopisto; The University of Edinburgh

Advertisement
Apply for grants with GrantFunds
Advertisement
Browse Grants on GrantFunds
Interested in applying for this grant?

Complete our application form to express your interest and we'll guide you through the process.

Apply for This Grant