Data collection and curation

Provided by:
European Language Resource Coordination (ELRC)


Description

The European Commission has launched a comprehensive European Language Resource Coordination (ELRC) effort to identify and gather language and translation data relevant to national public services, administrations and governmental institutions across all 30 European countries participating in the CEF programme. These resources are needed in order to improve the quality and the coverage of the machine translation engines in eTranslation. All data resources gathered in this ELRC initiative will therefore be used to develop a high-quality machine translation service.

Features

  • Increased language coverage: collection of resources to cover the languages from the 30 European countries participating in the CEF programme. Additional coverage for languages with currently less resources.
  • Increased domain coverage: collection of domain-specific corpora and terminology resources (e.g. lexica and dictionaries) in the fields of consumer rights, culture, legal domain, social security, health, public procurement, etc. allowing the training of domain specific engines for the eTranslation service.
  • The technical processing services guarantee that the provided language resources will lead to higher quality automated translation systems.

Geographic Coverage
Austria, Belgium, Bulgaria, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal, Romania, Slovakia, Slovenia, Spain, Sweden, United Kingdom
Language Coverage
Bulgarian, Czech, Croatian, Danish, Dutch; Flemish, English, Estonian, Finnish, French, German, Hungarian, Irish, Italian, Latvian, Lithuanian, Maltese, Modern Greek (1453-), Polish, Portuguese, Romanian; Moldavian; Moldovan, Slovak, Slovenian, Spanish; Castilian, Swedish, Norwegian, Icelandic

Target Users
  • Data owners and contributors

    Providers of language and translation data relevant to national public services, administrations and governmental institutions across all 30 European countries participating in the CEF programme in the form of large general domain corpora, whether monolingual or multilingual parallel corpora as well as domain-specific corpora and terminology resources (e.g. lexica and dictionaries) in the fields of consumer rights, culture, legal domain, social security, health, public procurement, etc.

Get Started with the service

Support

Helpdesk: http://www.lr-coordination.eu/helpdesk

Interaction Language: English

Documentation: Data collection and curation Service Offering Description

Access the service

Request - Other : info@lr-coordination.eu