Source

www.statmt.org

1 dataset

http://www.statmt.org/europarl/

Source for dataset “Statistical Machine Translation – Europarl Parallel Corpus” as listed by CKAN.

  • Statistical Machine Translation - Europarl Parallel Corpus

    Offsite — About Overview: > The Europarl parallel corpus is extracted from the proceedings of the European Parliament. It includes versions in 11 European languages: Romanic (French, Italian, Spanish, Portuguese), Germanic (English, Dutch, German, Danish, Swedish), Greek and Finnish. > The goal of the extraction and processing was to generate sentence aligned text for ...