Some data from the EU DGT (all languages)
Some people are apparently not aware that the ZIP files with EU legislation, etc., which are downloadable from the European Commission site, contain individual TMX files of the documents with names corresponding to the CELEX numbers. That is rather useful to know and can avoid a lot of wasted time on alignments.
Each TMX file has all the translation languages maintained by the DGT. It contains segments from the Acquis Communautaire, the body of European legislation, comprising all the treaties, regulations and directives adopted by the European Union (EU). Since each new country joining the EU is required to accept the whole Acquis Communautaire, this body of legislation has been translated into 24 official languages.
When a TMX file from this collection is imported to memoQ, the data are extracted in bilingual form only. So if the file is imported to a Portuguese-Spanish TM or LiveDocs corpus, only the PT-ES pairs will be imported.


0 comments