Language resource #: 3330
Results 1051 - 1060 of 2023
-
C-003260: RIKEN Spoken Dialogue Corpus (Word processing task,Japanese)
The RIKEN Spoken Dialogue Corpus consists of simulated dialogues (word processor specialist - the user), lasting up to two hours at longest, and monologues (usage guidance by word processor specialists). The corpus comes with corresponding morphologically tagged transcripts.
-
C-003261: ASJ Continuous Speech Corpus : Vols. 1 - 16 - Japanese Newspaper Article Sentences (JNAS) -
The corpus contains speech recordings and their orthographic transcriptions, with Kana or Romaji representing pronunciation of sentences, of 306 speakers reading excerpts from the Mainichi Newspaper and the ATR 503 Phonetically Balanced Sentences. All utterances and sentences are in the Japanese language.
- isReferencedBy: C-003262: S-JNAS : Large-scale Database for Speech Recognition of Elderly People
- references: ATR 503 Phonetically Balanced Sentences
- references: C-001601: CD-Mainichi Shimbun Data Collection
- references: C-001602: CD-ROM Mainichi Shimbun '92 Data Collection
- references: C-001599: CD-Mainichi Shimbun '93 Data Collection
- references: C-001603: CD-ROM Mainichi Shimbun '94 Data Collection
-
C-003262: S-JNAS : Large-scale Database for Speech Recognition of Elderly People
The S-JNAS is a collection of read speech data of Japanese elderly people reading newspaper article sentences and ATR phonemically balanced sentences from the read-out text of the JNAS (Japanese Newspaper Article Sentence).
-
C-003263: ATR Chinese Hotel Reservation Dialogue
The ATR Chinese Hotel Reservation Dialogue is the Chinese parallel speech corpus included in the multi-lingual project, which has collected multi-lingual hotel reservation dialogues, consisting of Japanese, English, and Chinese speeches.
-
C-003264: Singapore Primary School Chinese Language Text
12 volumes of manually tagged (lemma, POS, syntax) Chinese language text used in the Singapore schools, covering levels from Primary 1 to Primary 6.
-
C-003265: CSTSC-Flight Corpus
The CSTC-Flight is a domain-specific corpus containing flight enquiry and reservation domain telephone conversations, taken from the real life. All the dialogues are fully spontaneous since the coustomers are not aware of being monitored and recorded. Corresponding transcripts are included in the corpus for only about half of the speech data.
-
C-003266: CUCorpora
CUCorpora is a large scale Cantonese spoken language corpora, made up of several sub-corpora designed for different specific domain of applications. The sub-corpoora are; (1) 1800 Cantonese syllables with pitch-marking (CUSYL), (2) Multi-syllabe short phrases covering most common Cantonese syllables (CUWORD), (3) Phoneticaly-rich read Cantonese sentences (CUSENT), (4) Cantonese digit strings of length from single digit to 16 digit (CUDIGIT), and (5) Cantonese command words simulating a navigation control scenario (CUCMD). All sub-corpora are manually transcribed.
- hasPart: C-003267: CUSYL (Version 1.0)
- hasPart: C-003268: CUWORD (Version 1.0)
- hasPart: C-003269: CUSENT (Version 1.0)
- hasPart: C-003271: CUDIGIT (Version 1.0)
- hasPart: C-003272: CUCMD (Version 1.0)
- hasVersion: C-003279: CUCall
-
C-003267: CUSYL (Version 1.0)
CUSYL is a part of CUCopora, a large scale Cantonese spoken language corpora. It is a collection of 1800 Cantonese syllables with pitch-marking, covering all valid syllables as well as common lazy and colloquial pronunciations. The corpus also includes manual transcripts. The package comes with CUWORD (Version 1.0).
- isPartOf: C-003266: CUCorpora
- hasVersion: C-003268: CUWORD (Version 1.0)
- hasVersion: C-003269: CUSENT (Version 1.0)
- hasVersion: C-003271: CUDIGIT (Version 1.0)
- hasVersion: C-003272: CUCMD (Version 1.0)
- hasVersion: C-003274: CUCall Cantonese Words (Version 1.0)
-
C-003268: CUWORD (Version 1.0)
CUWORD is a part of CUCopora, a large scale Cantonese spoken language corpora. It is a collection of 2500 multi-syllabe short phrases covering most common Cantonese syllables. The corpus also includes manually verified phonemic transcription provided for each utterance. The package comes with CUSYL. (Version 1.0).
- isPartOf: C-003266: CUCorpora
- hasVersion: C-003267: CUSYL (Version 1.0)
- hasVersion: C-003269: CUSENT (Version 1.0)
- hasVersion: C-003271: CUDIGIT (Version 1.0)
- hasVersion: C-003272: CUCMD (Version 1.0)
- hasVersion: C-003274: CUCall Cantonese Words (Version 1.0)
-
C-003269: CUSENT (Version 1.0)
CUSENT is a part of CUCopora, a large scale Cantonese spoken language corpora. It is a large collection of spoken Cantonese sentences designed to be phonetically rich. The corpus also includes manually verified phonemic transcription.
- isPartOf: C-003266: CUCorpora
- hasVersion: C-003267: CUSYL (Version 1.0)
- hasVersion: C-003268: CUWORD (Version 1.0)
- hasVersion: C-003271: CUDIGIT (Version 1.0)
- hasVersion: C-003272: CUCMD (Version 1.0)
- hasVersion: C-003273: CUCall Cantonese Sentences (Version 1.0)
- isReferencedBy: C-003273: CUCall Cantonese Sentences (Version 1.0)