Language Resource Search - SHACHI: Language Resource Metadata Database

Language resource #: 3330 Results 1041 - 1050 of 2023

Select items

description_language
language_area
language
type
subject_monoMultilingual
subject_resourceSubject
type_style
type_form
type_sentence
type_linguisticType
type_discourseType
type_purpose
subject_linguisticField
contributor_author_level
contributor_speaker_level
contributor_author_motherTongue
contributor_speaker_motherTongue
contributor_author_dialect
contributor_speaker_dialect
contributor_author_age
contributor_speaker_age
contributor_author_gender
contributor_speaker_gender
type_annotation

C-003250: RWCP-SP99 News Speech Database for Information Retrieval and Speech Summarization Research
The RWCP News Speech Database contains professionally read broadcast news data. The news scripts for reading were also written by a professional broadcast reporter based on the actual event. Each announcer read a total of 40 stories, totaling about an hour, and then Set A (50 sentences) of ATR phonetically balanced sentences.
C-003251: RWCP-SP01 Meeting Speech Corpus
The RWCP Meeting Speech Corpus contains speech data of simulated meetings by three or more participants. The subjects of meeting includes; package tour plan, making a website of a travel agent, mail magazine planning of a travel agent, and research on people's opinion about travel.
C-003252: RWCP Real Environment Speech and Acoustic Database
The database contains; "RWCP Sound Scene Database in Real Acoustical Environments" (Vol.1), speech data of fixed sound sources measured be microphone-array (Vol.2), and speech data of moving sound sources measured by microphone-array (Vol.3).
- references: C-001303: TIMIT Acoustic-Phonetic Continuous Speech Corpus
C-003253: CIAIR Children Voice Speech Corpus
CIAIR-VCV is a collection of 288 elementary school students' voices (words and sentences) inside a room in a normal living environment.
C-003254: CENSREC-1 (AURORA-2-J) Noisy Speech Recognition Evaluation Environments
CENSREC-1(AURORA-2J) is a noisy digit speech database and a Japanized version of the AURORA-2 database. "CENSREC" stands for "Corpus and Environments for Noisy Speech RECognition. The CENSREC-1 database was created in exactly the same way as the AURORA-2 database, but it was uttered in Japanese, like "ichi, ni, san," for "one, two, three" in AURORA-2. The number of speakers is the same, and the digit strings for each speaker are identical as the AURORA-2, too.
- conformsTo: C-001326: AURORA Project Database 2.0 - Evaluation Package
- references: C-001326: AURORA Project Database 2.0 - Evaluation Package
- isReferencedBy: C-003255: CENSREC-1-C Noisy Speech Detection Evaluation Environments
- isReferencedBy: C-003256: CENSREC-2 In-car Spoken Digits Data and Environments for Noisy Speech Recognition
C-003255: CENSREC-1-C Noisy Speech Detection Evaluation Environments
CENSREC-1-C is a speech database for the evaluation of voice activity detection in several noise environment. The simulated speech data of CENSREC-1-C are constructed by concatenating several utterances spoken by one speaker. The vocabulary of simulated data included in the CENSREC-1-C consist of eleven Japanese digits ("ichi," "ni," "san," ...), which are the same as in CENSREC-1(AURORA-2J). The noise environments for simulated data are the same as CENSREC-1, too.
C-003256: CENSREC-2 In-car Spoken Digits Data and Environments for Noisy Speech Recognition
CENSREC-2 is a database for the evaluation of continuous digit recognition in real car driving environments. The digit sequence of each utterance and the pronunciation of Japanese digits in this database are the same as the CENSREC-1(AURORA-2J) database. The data were recorded under 11 environmental conditions using combinations of three kinds of vehicle speeds and four kinds of in-car environments.
C-003257: CENSREC-3 In-car Isolated Words Data and Environments for Noisy Speech Recognition
CENSREC-3 is an in-car speech database for the evaluation of isolated word recognition in real driving car environments. The data was recorded under 16 environmental conditions using combinations of three kinds of vehicle speeds and six kinds of in-car environments . For training data, driver's speech of phonetically-balanced sentences was recorded. 50 words were recorded as testing data basically for each person in each environment.
- hasVersion: C-003256: CENSREC-2 In-car Spoken Digits Data and Environments for Noisy Speech Recognition
- hasVersion: C-003255: CENSREC-1-C Noisy Speech Detection Evaluation Environments
- conformsTo: AURORA Project Database 3.0
C-003258: UME-ERJ English Speech Database Read by Japanese Students
The UME-ERJ corpus was made in support of the Priority Areas Project on "Advanced Utilization of Multimedia to Promote Higher Education Reform" during 2000-2002. It is an English speech database read by Japanese undergraduate and graduate students. Grading lists by native English teachers are also included in the corpus.
- hasVersion: C-003259: UME-JRF Japanese Speech Database Read by Foreign Students
C-003259: UME-JRF Japanese Speech Database Read by Foreign Students
The UME-JRF corpus was made in support of the Priority Areas Project on "Advanced Utilization of Multimedia to Promote Higher Education Reform" during 2000-2002. It is a Japanese speech database read by foreign undergraduate and graduate students studying in Japan. Grading lists by native Japanese teachers are also included in the corpus.
- hasVersion: C-003258: UME-ERJ English Speech Database Read by Japanese Students

SHACHI - Language Resource Metadata Database