Filter by:
Croatian (20)
English (15)
German (14)
Latvian (12)
Romanian (12)
Lithuanian (11)
Portuguese (11)
Czech (10)
Danish (10)
Estonian (10)
French (10)
Polish (10)
Slovenian (10)
Swedish (10)
Finnish (9)
Italian (9)
Greek (8)
Russian (8)
Slovak (8)
Spanish (8)
Dutch (6)
Hungarian (6)
Bulgarian (5)
Chinese (5)
Maltese (5)
Norwegian (5)
Turkish (5)
Vietnamese (5)
Arabic (4)
Japanese (4)
Korean (4)
Thai (4)
Dutch; Flemish (3)
Hindi (3)
Basque (2)
Icelandic (2)
Persian (2)
Serbian (2)
Swahili (2)
Tamil (2)
Afrikaans (1)
Albanian (1)
Armenian (1)
Bosnian (1)
Catalan (1)
Esperanto (1)
Georgian (1)
Hausa (1)
Indonesian (1)
Irish (1)
Kannada (1)
Kurdish (1)
Latin (1)
Lojban (1)
Macedonian (1)
Malayalam (1)
Mandarin (1)
Norvegian (1)
Ukrainian (1)
Welsh (1)
True (2)
Nlp Applications (8)
Human Use (3)
Speech Synthesis (1)
Text Mining (1)
Multilingual (10)
Monolingual (7)
Parallel (5)
Comparable (2)
Plain text (1)
TBX (3)
News (1)
Renewable energy (1)
Wikipedia (1)
Accounting (1)
Animal product (1)
Land transport (1)
Prices (1)
Transport policy (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
20 Language Resources
Order by:
ACCURAT balanced test corpus for under resourced languages
69
372
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
438
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
448
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
Bilingual term pairs extracted from comparable Web resources using the TaaS Bilingual Term Extraction System
0
420
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Italian
- Latvian
- Lithuanian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
Bilingual term pairs extracted from Wikipedia using the TaaS Bilingual Term Extraction System
0
172
- Bulgarian
- Croatian
- Danish
- English
- Estonian
- Greek, Modern (1453-)
- Irish
- Latvian
- Lithuanian
- Maltese
- Romanian
- Slovak
- Slovenian
Collins Multilingual database (MLD) - PhraseBank
0
103
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hindi
- Italian
- Japanese
- Korean
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish
- Swedish
- Thai
- Turkish
- Vietnamese
Collins Multilingual database (MLD) – PhraseBank with audio files
0
86
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Finnish
- French
- German
- Greek
- Hindi
- Italian
- Japanese
- Korean
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish
- Swedish
- Thai
- Turkish
- Vietnamese
Collins Multilingual database (MLD) – WordBank with audio files
0
84
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Finnish
- French
- German
- Greek
- Italian
- Japanese
- Korean
- Norwegian
- Polish
- Portuguese
- Russian
- Spanish
- Swedish
- Thai
- Turkish
- Vietnamese
eSpeak
0
111
- Afrikaans
- Albanian
- Armenian
- Catalan
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Estonian
- Finnish
- French
- Georgian
- German
- Greek, Modern (1453-)
- Hindi
- Hungarian
- Icelandic
- Indonesian
- Italian
- Kannada
- Kurdish
- Latvian
- Lojban
- Macedonian
- Malayalam
- Mandarin
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Slovak
- Spanish; Castilian
- Swahili
- Swedish
- Tamil
- Turkish
- Vietnamese
- Welsh
EuroTermBank
0
199
- Basque
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Italian
- Latin
- Latvian
- Lithuanian
- Maltese
- Norvegian
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
EUROVOC tezaurus (v4.2)
0
159
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Italian
- Latvian
- Lithuanian
- Maltese
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
GlobalPhone 2000 Speaker Package
0
61
- Arabic
- Bulgarian
- Chinese
- Croatian
- Czech
- French
- German
- Hausa
- Japanese
- Korean
- Polish
- Portuguese
- Russian
- Spanish
- Swahili
- Swedish
- Tamil
- Thai
- Turkish
- Ukrainian
- Vietnamese
LEXACC - Lucene-based parallel phrase EXtractor from Comparable Corpora
0
17
- Croatian
- English
- German
- Greek, Modern (1453-)
- Latvian
- Lithuanian
- Romanian
- Slovenian
- Spanish; Castilian
Microsoft Terminology Collection
0
194
- Basque
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Italian
- Latvian
- Lithuanian
- Maltese
- Polish
- Portuguese
- Romanian
- Russian
- Slovak
- Slovenian
- Spanish
- Swedish
Tilde MODEL - Multilingual Open Data for EU Languages
65
226
- Croatian
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Greek, Modern (1453-)
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Norwegian
- Polish
- Portuguese
- Romanian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish