GoURMET | Global Under-Resourced MEedia Translation

Summary
Machine translation (MT) is an increasingly important technology for supporting communication in a globalised world. MT technology has gradually increased over the last ten years, but recent advances in neural machine translation (NMT), have resulted in significant interest in industry and have lead to very rapid adoption of the new paradigm (eg. Google, Facebook, UN, World International Patent Office). Although these models have shown significant advances in state-of-the-art performance they are data intensive and require parallel corpora of many millions of human translated sentences for training. Neural Machine translation is currently not able to deliver usable translations for the vast majority of language pairs in the world. This is especially problematic for our user partners, the BBC and DW who need access to fast and accurate translation for languages with very few resources.

The aim of GoURMET is to significantly improve the robustness and applicability of neural machine translation for low-resource language pairs and domains.

GoURMET has five objectives:
- Development of a high-quality machine translation for under-resourced language pairs and domains;
- Adaptable to new and emerging languages and domains;
- Development of tools for analysts and journalists;
- Sustainable, maintainable platform and services;
- Dissemination and communication of project results to stakeholders and user group.

The project will focus on two use cases:
- Global content creation - managing content creation in several languages efficiently by providing machine translations for correction by humans;
- Media monitoring for low resource language pairs - tools to address the challenge of international news monitoring problem.

The outputs of the project will be field-tested at partners BBC and DW, and the platform will be further validated through innovation intensives such as the BBC NewsHack.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/825299
Start date: 01-01-2019
End date: 30-06-2022
Total budget - Public funding: 2 906 098,00 Euro - 2 906 098,00 Euro
Cordis data

Original description

Machine translation (MT) is an increasingly important technology for supporting communication in a globalised world. MT technology has gradually increased over the last ten years, but recent advances in neural machine translation (NMT), have resulted in significant interest in industry and have lead to very rapid adoption of the new paradigm (eg. Google, Facebook, UN, World International Patent Office). Although these models have shown significant advances in state-of-the-art performance they are data intensive and require parallel corpora of many millions of human translated sentences for training. Neural Machine translation is currently not able to deliver usable translations for the vast majority of language pairs in the world. This is especially problematic for our user partners, the BBC and DW who need access to fast and accurate translation for languages with very few resources.

The aim of GoURMET is to significantly improve the robustness and applicability of neural machine translation for low-resource language pairs and domains.

GoURMET has five objectives:
- Development of a high-quality machine translation for under-resourced language pairs and domains;
- Adaptable to new and emerging languages and domains;
- Development of tools for analysts and journalists;
- Sustainable, maintainable platform and services;
- Dissemination and communication of project results to stakeholders and user group.

The project will focus on two use cases:
- Global content creation - managing content creation in several languages efficiently by providing machine translations for correction by humans;
- Media monitoring for low resource language pairs - tools to address the challenge of international news monitoring problem.

The outputs of the project will be field-tested at partners BBC and DW, and the platform will be further validated through innovation intensives such as the BBC NewsHack.

Status

CLOSED

Call topic

ICT-29-2018

Update Date

27-10-2022
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
EU-Programme-Call
Horizon 2020
H2020-EU.2. INDUSTRIAL LEADERSHIP
H2020-EU.2.1. INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies
H2020-EU.2.1.1. INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT)
H2020-EU.2.1.1.0. INDUSTRIAL LEADERSHIP - ICT - Cross-cutting calls
H2020-ICT-2018-2
ICT-29-2018 A multilingual Next Generation Internet