Language resource #: 3330 Results 1371 - 1380 of 2023
Current query
Input keywords
Select items
  • C-003882: EAT-ALL
    EAT corpus containing three groups of channels: PSTN, MIC16K and GSM was stored in three DVD discs. PSTN and GSM corpora were stored in the same DVD disc which is label as “PSTN +GSM”. Because the sampling rate of MIC16K speech data was high, the resulting storage requirement was huge. We stored MIC16K speech in two DVD discs labeled by “Mic16K English” and “Mic16K NonEnglish” for English Department and non-English Department, respectively.
  • C-003883: EAT-200
    EAT corpus containing three groups of channels: PSTN, MIC16K and GSM was stored in one DVD discs. PSTN and GSM corpora were stored in the same DVD disc which is label as “PSTN +GSM”. Because the sampling rate of MIC16K speech data was high, the resulting storage requirement was huge. We stored MIC16K speech in two DVD discs labeled by “Mic16K English” and “Mic16K NonEnglish” for English Department and non-English Department, respectively.
  • C-003884: MATBN
    The MATBN Mandarin Chinese broadcast news corpus is a product of a joint project
    sponsored by the National Science Council, Taiwan. It contains a total of 198 one-hour
    news shows from the Public Television Service Foundation, Taiwan with corresponding
    transcripts. The primary purpose of this collection is to provide training and testing data
    for continuous speech recognition evaluation in the broadcast news domain.
  • C-003885: Affix Database
    This sub-corpus is composed of the following high-frequency initial and final morphemes retrieved from Sinica Corpus.

    * Initial Morpheme in Noun Compound : 1,135 (words with ambivalent meanings: 1,197)

    * Final Morpoheme in Noun Compound : 1,427 (words with ambivalent meanings: 1,610)

    * Initial Morpheme in Verb Compound : 735 (words with ambivalent meanings: 918 )

    * Final Morpoheme in Verb Compound : 282 (words with ambivalent meanings: 300)

    There are 4,025 morphemes in total.

    English meaning, POS, cilin, and examples are provided in each morpheme.

    For Verb Compound, its English meaning, morphological rules, and examples are provided in each morpheme. The number of morphological rules varies in Verb Compound per se.
  • C-003887: The NIE Corpus of Spoken Singapore English
    The NIE Corpus of Spoken Singapore English aims to provide high-quality recordings of Singaporean speakers. The aim of the corpus is to facilitate acoustic/phonetic analysis of Singapore English. In order to eliminate background noise and thereby facilitate acoustic/phonetic measurement, all recordings were made directly onto the computer in the NIE Phonetics Laboratory.
  • C-003888: The Lim Siew Hwee Corpus of Informal Singapore Speech
    The corpus provides recordings of young Singaoreans talking informally. All recordings were made directly onto the computer in the Phonetics Laboratory at NIE in Singapore.
  • C-003889: A Corpus of Spoken PRC English
    The corpus aims to provide high-quality recordings of speakers from the People's Republic of China. All recordings were made directly onto the computer in the Phonetics Laboratory at NIE in Singapore.
  • C-003890: The Yeo (2001) Corpus of Sec 2 Compositions
    The Yeo (2001) Corpus of Sec 2 Compositions is a set of short essays written by pupils in the Express Stream of the 2nd year of a Secondary School in Singapore. Most of the pupils were between 13 and 14 years of age at the time they wrote the essays.http://videoweb.nie.edu.sg/phonetic/yeo-2001/index.htm
  • C-003891: Gyan Nidhi
    Corpus collected from various domains.Useful resource for applications such as improving translation system, translation memory, spell checkers, dictionaries, statistical text analyzer, language related research, Writing style analysis, morphological analyzer & CLIR.
  • C-003893: Annotated Speech Corpora-DRDO
    Annotated Speech corpora for Indian Languages