Language resource #: 3330
Results 1371 - 1380 of 2023
-
C-003882: EAT-ALL
EAT corpus containing three groups of channels: PSTN, MIC16K and GSM was stored in three DVD discs. PSTN and GSM corpora were stored in the same DVD disc which is label as “PSTN +GSM”. Because the sampling rate of MIC16K speech data was high, the resulting storage requirement was huge. We stored MIC16K speech in two DVD discs labeled by “Mic16K English” and “Mic16K NonEnglish” for English Department and non-English Department, respectively.
- hasVersion: C-003883: EAT-200
-
C-003883: EAT-200
EAT corpus containing three groups of channels: PSTN, MIC16K and GSM was stored in one DVD discs. PSTN and GSM corpora were stored in the same DVD disc which is label as “PSTN +GSM”. Because the sampling rate of MIC16K speech data was high, the resulting storage requirement was huge. We stored MIC16K speech in two DVD discs labeled by “Mic16K English” and “Mic16K NonEnglish” for English Department and non-English Department, respectively.
- hasVersion: C-003882: EAT-ALL
-
C-003884: MATBN
The MATBN Mandarin Chinese broadcast news corpus is a product of a joint project
sponsored by the National Science Council, Taiwan. It contains a total of 198 one-hour
news shows from the Public Television Service Foundation, Taiwan with corresponding
transcripts. The primary purpose of this collection is to provide training and testing data
for continuous speech recognition evaluation in the broadcast news domain. -
C-003885: Affix Database
This sub-corpus is composed of the following high-frequency initial and final morphemes retrieved from Sinica Corpus.
* Initial Morpheme in Noun Compound : 1,135 (words with ambivalent meanings: 1,197)
* Final Morpoheme in Noun Compound : 1,427 (words with ambivalent meanings: 1,610)
* Initial Morpheme in Verb Compound : 735 (words with ambivalent meanings: 918 )
* Final Morpoheme in Verb Compound : 282 (words with ambivalent meanings: 300)
There are 4,025 morphemes in total.
English meaning, POS, cilin, and examples are provided in each morpheme.
For Verb Compound, its English meaning, morphological rules, and examples are provided in each morpheme. The number of morphological rules varies in Verb Compound per se.- isPartOf: C-003865: Sinica Balanced Corpus
-
C-003887: The NIE Corpus of Spoken Singapore English
The NIE Corpus of Spoken Singapore English aims to provide high-quality recordings of Singaporean speakers. The aim of the corpus is to facilitate acoustic/phonetic analysis of Singapore English. In order to eliminate background noise and thereby facilitate acoustic/phonetic measurement, all recordings were made directly onto the computer in the NIE Phonetics Laboratory.
-
C-003888: The Lim Siew Hwee Corpus of Informal Singapore Speech
The corpus provides recordings of young Singaoreans talking informally. All recordings were made directly onto the computer in the Phonetics Laboratory at NIE in Singapore.
-
C-003889: A Corpus of Spoken PRC English
The corpus aims to provide high-quality recordings of speakers from the People's Republic of China. All recordings were made directly onto the computer in the Phonetics Laboratory at NIE in Singapore.
-
C-003890: The Yeo (2001) Corpus of Sec 2 Compositions
The Yeo (2001) Corpus of Sec 2 Compositions is a set of short essays written by pupils in the Express Stream of the 2nd year of a Secondary School in Singapore. Most of the pupils were between 13 and 14 years of age at the time they wrote the essays.http://videoweb.nie.edu.sg/phonetic/yeo-2001/index.htm
-
C-003891: Gyan Nidhi
Corpus collected from various domains.Useful resource for applications such as improving translation system, translation memory, spell checkers, dictionaries, statistical text analyzer, language related research, Writing style analysis, morphological analyzer & CLIR.
-
C-003893: Annotated Speech Corpora-DRDO
Annotated Speech corpora for Indian Languages