Speech-to-Text is a speech recognition based service that converts audio or video files to text. It currently supports English. It is offered as part of the CEF eTranslation platform and it can be used by eTranslation eligible users, i.e. EU institutions, public administrations in the CEF-affiliated countries and European SMEs. 

Language Coverage
English (Latin)

Target Users
  • Public administrations

    National public administrations in the EU Member States, Iceland and Norway

  • EU staff members
  • European Small and Medium-sized Enterprises
  • Universities

    Language faculties in all EU countries, Iceland and Norway

Get Started with the service

: Free of charge



Interaction Language: English (Latin)

Access the service
Web Service
Input Output
Media Type
Data format (other) .aac, .aiff, .flac, .m4a, .mp3, .oga, .ogg, .opus, .wav, .weba, .wma, .avi, .mov, .mp4, .mpg, .ogv, .webm, .wmv
Languages English English