Filter by:
English (923)
Spanish (480)
French (320)
German (319)
Estonian (305)
Finnish (235)
Swedish (209)
Portuguese (149)
Italian (146)
Russian (119)
Danish (97)
Latvian (85)
Chinese (76)
Lithuanian (69)
Icelandic (67)
Arabic (55)
Romanian (55)
Catalan (54)
Basque (46)
Polish (46)
Czech (43)
Dutch (43)
Spanish; Castilian (41)
Hungarian (39)
Japanese (38)
Bulgarian (30)
Maltese (29)
Latin (28)
Vietnamese (28)
Finland Swedish (26)
Galician (25)
Korean (25)
Norwegian (23)
Croatian (20)
Multiple languages (20)
Slovenian (20)
Dutch; Flemish (19)
Turkish (17)
Slovak (14)
Persian (12)
Thai (12)
Erzya (11)
Greek (10)
Hindi (10)
Northern Sami (9)
Moksha (8)
Pushto (8)
Swahili (8)
Faroese (7)
Macedonian (7)
Esperanto (6)
Serbian (6)
Sign Languages (6)
Tamil (6)
Karelian (5)
Ukrainian (5)
Hebrew (4)
Hill Mari (4)
Ingrian (4)
Khanty (4)
Kurdish (4)
Modern Greek (4)
Tundra Nenets (4)
Udmurt (4)
Albanian (3)
Bengali (3)
Hausa (3)
Inari Sami (3)
Irish (3)
Kildin Sami (3)
Komi Zyrian (3)
Ludian (3)
Malayalam (3)
Nepali (3)
No Language (3)
Panjabi (3)
Sami languages (3)
Urdu (3)
Uzbek (3)
Votic (3)
Võro (3)
Welsh (3)
Afrikaans (2)
Armenian (2)
Asturian (2)
Avaric (2)
Chukchi (2)
Chuvash (2)
Eastern Mari (2)
Even (2)
Evenki (2)
Gujarati (2)
Available - Restricted Use (1793)
Under Negotiation (90)
ELRA_VAR (973)
ELRA_END_USER (814)
CC - BY (379)
Proprietary (237)
Other (159)
Under Negotiation (126)
CC - BY - SA (116)
GPL (114)
CLARIN_RES (74)
CLARIN_ACA (71)
ELRA_EVALUATION (56)
CC - BY - NC - SA (45)
CLARIN_ACA - NC (44)
CC - BY - NC (35)
LGPL (25)
Apache Licence_2.0 (13)
MS Commons - BY - NC (13)
BSD - Style (8)
GFDL (7)
MS Commons - BY (7)
CC - BY - ND (5)
AGPL (3)
CLARIN_PUB (3)
CC - ZERO (2)
BSD (1)
MS - C - No Re D (1)
Commercial Use (1013)
Attribution (376)
Other (203)
No Redistribution (107)
Share Alike (98)
Evaluation Use (57)
Inform Licensor (48)
No Derivatives (35)
Redeposit (15)
Only M Smembers (4)
Nlp Applications (311)
Human Use (251)
Information Retrieval (155)
Linguistic Research (55)
Pos Tagging (34)
Text Mining (33)
Machine Translation (31)
Other (26)
Parsing (23)
Lemmatization (15)
Annotation (12)
Spell Checking (12)
Speech Recognition (10)
Lexicon Access (8)
Speech Synthesis (8)
Speech Analysis (6)
Web Services (6)
Event Extraction (5)
Text Generation (3)
Summarisation (2)
Voice Control (2)
Opinion Mining (1)
Written Language (622)
Spoken Language (125)
Voice (78)
Body Gesture (32)
Facial Expression (32)
Sign Language (20)
Other (3)
Text/xml (132)
Text/plain (61)
Plain text (56)
Other (25)
Text (15)
Xml (14)
Audio/x-wav (12)
Text / plain (12)
Text / xml (12)
Audio/wav (9)
Text/tsv (8)
Wav (8)
Audio (4)
Rdf+xml (3)
Text/turtle (3)
MS Excel (2)
WAV (2)
Plain/text (2)
Text/csv (2)
Txt (2)
20 (1)
Audio/wav (1)
HTML (1)
MS Word (1)
US- ASCII (1)
XML (1)
Application/xml (1)
Audio/ AMR (1)
Audio/mp3 (1)
Audio/mp4 (1)
Audio/mpeg (1)
Audio/mpeg3 (1)
Audio/ogg (1)
Mp3 (1)
Sgml (1)
Text/html (1)
Txt/xml (1)
TBX (144)
LMF (95)
Word Net (18)
Other (17)
TEI_P5 (14)
XCES (7)
TEI (6)
OWL (3)
RDF (3)
Gr AF (2)
MULTEXT (2)
TMX (2)
EAGLES (1)
Penn Tree Bank (1)
Time ML (1)
General (122)
Environment (53)
Labour legislation (35)
News (10)
Medicine (10)
Health (9)
Law (8)
Economics (7)
Novels (6)
Test (6)
Economy (6)
Humanities (6)
Communications (5)
Computer science (5)
Science (5)
Education (4)
Energy (4)
Finance (4)
Law_politics (4)
Movies (4)
Social questions (4)
Taxation (4)
Community law (3)
Marketing (3)
Social affairs (3)
Teaching (3)
Tourism (3)
Wood industry (3)
General (2)
Political (2)
Accounting (2)
Blog (2)
Chemistry (2)
Civil law (2)
Consumption (2)
Documentation (2)
Forum (2)
Government (2)
Informative (2)
Periodicals (2)
Religion (2)
Tariff policy (2)
Transport (2)
Unknown (2)
Accomodation (1)
Automotive (1)
Biodiversity (1)
Europarl (1)
Fiction (1)
General language (1)
Geographic (1)
IT (1)
Legal news (1)
Medical History (1)
Renewable energy (1)
Science (1)
Wikipedia (1)
Agriculture (1)
Animal product (1)
Books (1)
Budget (1)
Camera (1)
Cars (1)
Computers (1)
Construction (1)
Criminal law (1)
Defence (1)
Family (1)
Other (25)
Portugal (20)
Iceland (11)
Finland (7)
Helsinki (3)
Is (2)
Around the world (1)
Brasil (1)
Brazil (1)
Espoo, Finland (1)
Estonia (1)
Europe (1)
IS (1)
Karelia (1)
Mozambique (1)
Rantasalmi (1)
Scotland (1)
Sääminki (1)
UK (1)
Vantaa, Finland (1)
English (1)
Portuguese (1)
Pt (1)
Other (25)
1996-2011 (9)
2003 (3)
2000-2008 (2)
2001-2015 (2)
2012-2014 (2)
Early 1990s (2)
1410-1681 (1)
1540-1750 (1)
1543-1810 (1)
1543–1810 (1)
1562-1563 (1)
1564-1939 (1)
16.-19. century (1)
1620-1630 (1)
1770-1949 (1)
1770-2011 (1)
1785 (1)
1800-2000 (1)
1809-1899 (1)
1810-1940 (1)
1820-2000 (1)
1840 - 2013 (1)
1855-1871 (1)
1880-1949 (1)
1895-1909 (1)
1920-1939 (1)
1934-1935 (1)
1935–2007 (1)
1955-1968 (1)
1967-2008 (1)
1970 - 2002 (1)
1970 to 2001 (1)
1970-1974 (1)
1970-1975 (1)
1970-1989 (1)
1970-2000 (1)
1970-2001 (1)
1970-2002 (1)
1972-2013 (1)
1974-2004 (1)
1978-2000 (1)
1980-1990 (1)
1981-1990 (1)
1986 (1)
1986 - 1987 (1)
1986-1994 (1)
1987-2000 (1)
1989-1998 (1)
1989-2007 (1)
1990-2015 (1)
1993-2012 (1)
1995-2003 (1)
1996-1997 (1)
2001 (1)
2001-2005 (1)
2001-2014 (1)
2002-2003 (1)
2003-2015 (1)
2004-2005 (1)
2005, 2010 (1)
2006-2015 (1)
2006-2016 (1)
2008-2014 (1)
2011-2012 (1)
2011-2014 (1)
2013 (1)
2013- (1)
2015 (1)
771 - 1884 (1)
Years 2010-2011 (1)
Ca. 730–1710 (1)
Castilian (367)
Flemish (35)
Valencian (15)
Brazil (5)
Legalese (4)
Punjabi (3)
American English (2)
British English (2)
Mandarin Chinese (2)
Mexico (2)
Modern (1453-) (2)
Venezuela (2)
American English (1)
American Finnish (1)
American Spanish (1)
American Spanish (1)
Australian (1)
British English (1)
Costa Rica (1)
European Spanish (1)
European Spanish (1)
Finland Swedish (1)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
2620 Language Resources (Page 1 of 131)
« Previous | Next »Order by:
2006 CoNLL Shared Task - Ten Languages
0
200
- Bulgarian
- Danish
- Dutch
- German
- Japanese
- Portuguese
- Slovenian
- Spanish
- Swedish
- Turkish
ACCURAT balanced test corpus for under resourced languages
68
372
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of comparable sentences
79
438
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT corpus of Wikipedia texts
68
448
- Croatian
- English
- Estonian
- German
- Greek
- Latvian
- Lithuanian
- Romanian
- Slovenian
ACCURAT Toolkit for for Multi-Level Alignment and Information Extraction from Comparable Corpora
9
597
« Previous | Next »