|
The New York Times Annotated Corpus
|
1 |
2022-07-01 |
3.23GB |
3,110 | 15+ |
0 |
|
BOLT Egyptian Arabic SMS/Chat and Transliteration - LDC2017T07
|
1 |
2023-05-06 |
8.91MB |
203 | 10+ |
0 |
|
Abstract Meaning Representation AMR Annotation Release 3.0 LDC2017T10
|
1 |
2022-07-11 |
38.82MB |
179 | 8 |
0 |
|
BOLT Egyptian Arabic Treebank - Discussion Forum - LDC2018T23
|
1 |
2023-05-06 |
60.02MB |
67 | 8+ |
0 |
|
Penn Treebank II 2 - LDC95T7
|
1 |
2022-08-14 |
134.99MB |
914 | 7+ |
0 |
|
OntoNotes 5.0 Annotated Text Corpus LDC2013T19
|
1 |
2022-07-02 |
839.11MB |
456 | 6+ |
0 |
|
TIMIT Acoustic-Phonetic Continuous Speech Corpus - LDC93S1
|
1 |
2023-05-06 |
385.20MB |
128 | 5+ |
0 |
|
BOLT Egyptian Arabic SMS/Chat Parallel Training Data - LDC2021T15
|
1 |
2023-05-06 |
10.21MB |
63 | 5+ |
0 |
|
BOLT Chinese SMS/Chat Parallel Training Data - LDC2021T11
|
1 |
2023-05-06 |
14.43MB |
27 | 5+ |
0 |
|
BOLT Chinese SMS/Chat - LDC2018T15
|
1 |
2023-05-06 |
9.83MB |
77 | 5+ |
0 |
|
DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus LDC93S1
|
1 |
2022-07-02 |
385.52MB |
2,727 | 4+ |
0 |
|
BOLT Egyptian Arabic Treebank Conversational Telephone Speech NLP LDC2021T12
|
1 |
2022-07-11 |
19.42MB |
226 | 4 |
0 |
|
French Gigaword 3rd edition LDC2011T10
|
1 |
2022-07-12 |
2.08GB |
1,944 | 4+ |
0 |
|
Penn Treebank 1 - ACL / DCI - LDC93T1
|
1 |
2022-08-15 |
139.95MB |
291 | 4+ |
0 |
|
TAC KBP Entity Discovery and Linking - Comprehensive Training and Evaluation Data LDC2019T02
|
1 |
2022-07-05 |
20.36GB |
100 | 3 |
0 |
|
English Gigaword 5th edition LDC2011T07
|
1 |
2022-07-12 |
9.76GB |
872 | 3+ |
0 |
|
Chinese Gigaword 5th edition LDC2011T13
|
1 |
2022-07-12 |
4.39GB |
113 | 3+ |
0 |
|
TAC KBP Comprehensive English Source Corpora LDC2018T03
|
1 |
2022-07-12 |
6.80GB |
99 | 3 |
0 |
|
LORELEI Tigrinya Language Pack NLP LDC2020T22
|
1 |
2022-07-11 |
122.79MB |
54 | 2 |
0 |
|
Spanish Gigaword 3rd edition LDC2011T12
|
1 |
2022-07-12 |
2.78GB |
104 | 2+ |
0 |
|
Penn Treebank Revised: English News Text Treebank LDC2015T13
|
1 |
2022-08-14 |
6.86MB |
117 | 2+ |
0 |
|
CCGBank: CCG Combinatory Categorical Grammar for Penn Treebank 2 - LDC2005T13
|
1 |
2022-08-14 |
27.90MB |
81 | 2+ |
0 |
|
Penn Treebank III 3 LDC99T42
|
1 |
2022-08-15 |
29.83MB |
144 | 2+ |
0 |
|
RST Discourse Treebank LDC2002T07
|
1 |
2022-08-15 |
2.71MB |
811 | 1+ |
0 |