Language Resource Search - SHACHI: Language Resource Metadata Database

Language resource #: 3330 Results 2011 - 2020 of 2023

C-005069: CHiME2 WSJ0
*Introduction*

CHiME2 WSJ0 was developed as part of The 2nd CHiME Speech Separation and Recognition Challenge and contains approximately 166 hours of English speech from a noisy living room environment. The CHiME Challenges focus on distant-microphone automatic speech recognition (ASR) in real-world environments.

CHiME2 WSJ0 reflects the medium vocabulary track of the CHiME2 Challenge. The target utterances were taken from CSR-I (WSJ0) Complete (LDC93S6A), specifically, the 5,000 word subset of read speech from Wall Street Journal news text.

*Data*

Data is divided into training, development and test sets. All data is provided as 16 bit WAV files sampled at 16 kHz. The noisy utterances are in isolated form and in embedded form. The latter involves five seconds of background noise before and after the utterance. Seven hours of noise background not part of the training set are also included.

Also included are baseline scoring, decoding and retraining tools based on Cambridge University' s tool, HTK (the Hidden Markov Toolkit) and related recipes. These tools include three baseline speaker-independent recognition systems trained on clean, reverberated and noisy data, respectively, and a number of scripts.
- references: C-000677: CSR-I (WSJ0) Complete
- hasVersion: C-005008: CHiME2 Grid
C-005070: LibriSpeech ASR corpus
LibriSpeech is a corpus of approximately 1000 hours of 16kHz read English speech, prepared by Vassil Panayotov with the assistance of Daniel Povey. The data is derived from read audiobooks from the LibriVox project, and has been carefully segmented and aligned.
C-005073: The AMI Corpus
The AMI Meeting Corpus consists of 100 hours of meeting recordings. The recordings use a range of signals synchronized to a common timeline. These include close-talking and far-field microphones, individual and room-view video cameras, and output from a slide projector and an electronic whiteboard. During the meetings, the participants also have unsynchronized pens available to them that record what is written. The meetings were recorded in English using three different rooms with different acoustic properties, and include mostly non-native speakers.
C-005076: Quoted Speech Attribution Corpus
This corpus collects over 3,000 instances of quoted speech from 6 works of 19th and 20th century literature, along with annotations for the speaker (if any) of each quote among the character names and nominals present in the text. Related publication: Elson and McKeown, Automatic Attribution of Quoted Speech in Literary Narrative, AAAI 2010. This material is based on research supported in part by the U.S. National Science Foundation (NSF) under IIS-0935360. Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF.
C-005077: The Kiel Corpus of Read Speech Vol. I
The Kiel Corpus is a growing collection of read and spontaneous German which has been collected and labelled segmentally at the ipds since 1990. At present the Kiel Corpus available on CD-ROM comprises over four hours of labelled read speech on The Kiel Corpus of Read Speech Vol. I as well as four hours of labelled spontaneous speech on The Kiel Corpus of Spontaneous Speech Vol. I, Vol. II and Vol. III.
- hasVersion: C-005078: The Kiel Corpus of Spontaneous Speech Vol. I
- hasVersion: C-005079: The Kiel Corpus of Spontaneous Speech Vol. II
- hasVersion: C-005080: The Kiel Corpus of Spontaneous Speech Vol. III
C-005078: The Kiel Corpus of Spontaneous Speech Vol. I
The Kiel Corpus is a growing collection of read and spontaneous German which has been collected and labelled segmentally at the ipds since 1990. At present the Kiel Corpus available on CD-ROM comprises over four hours of labelled read speech on The Kiel Corpus of Read Speech Vol. I as well as four hours of labelled spontaneous speech on The Kiel Corpus of Spontaneous Speech Vol. I, Vol. II and Vol. III.
- hasVersion: C-005077: The Kiel Corpus of Read Speech Vol. I
- hasVersion: C-005079: The Kiel Corpus of Spontaneous Speech Vol. II
- hasVersion: C-005080: The Kiel Corpus of Spontaneous Speech Vol. III
C-005079: The Kiel Corpus of Spontaneous Speech Vol. II
The Kiel Corpus is a growing collection of read and spontaneous German which has been collected and labelled segmentally at the ipds since 1990. At present the Kiel Corpus available on CD-ROM comprises over four hours of labelled read speech on The Kiel Corpus of Read Speech Vol. I as well as four hours of labelled spontaneous speech on The Kiel Corpus of Spontaneous Speech Vol. I, Vol. II and Vol. III.
- hasVersion: C-005077: The Kiel Corpus of Read Speech Vol. I
- hasVersion: C-005078: The Kiel Corpus of Spontaneous Speech Vol. I
- hasVersion: C-005080: The Kiel Corpus of Spontaneous Speech Vol. III
C-005080: The Kiel Corpus of Spontaneous Speech Vol. III
The Kiel Corpus is a growing collection of read and spontaneous German which has been collected and labelled segmentally at the ipds since 1990. At present the Kiel Corpus available on CD-ROM comprises over four hours of labelled read speech on The Kiel Corpus of Read Speech Vol. I as well as four hours of labelled spontaneous speech on The Kiel Corpus of Spontaneous Speech Vol. I, Vol. II and Vol. III.
- hasVersion: C-005077: The Kiel Corpus of Read Speech Vol. I
- hasVersion: C-005078: The Kiel Corpus of Spontaneous Speech Vol. I
- hasVersion: C-005079: The Kiel Corpus of Spontaneous Speech Vol. II
C-005081: Japanese Isolated Word Database Read by Children
The database contains read speech of 24 Japanese children.
Each speaker read 653 isolated words listed in Kyoto University's database of phonetically balanced sentences.
C-005082: Persian Speech Corpus
This about 2.5-hour Single-Speaker Speech corpus has been developed using the same methodologies used in the PhD work carried out by Nawar Halabi at the University of Southampton. The corpus was recorded in Persian (Tehrani accent) by one male speaker using a professional studio, through a "Blubbery" model microphone of "Blue" brand with "Presonus Studio Channel” as preamp and compressor. It has been recorded by "Reaper" software, and some plugins for enhancing his voice. Synthesized speech as an output using this corpus has produced a high quality, natural voice.

This package includes:
- 399 .wav files containing spoken utterances.
- 399 .lab files containing phonetic utterances.
- 399 .TextGrid files containing the phoneme labels with time stamps of the boundaries where these occur in the .wav files. These files can be opened using Praat software (see http://www.fon.hum.uva.nl/praat).
- aligned.mlf which contains the HTS friendly alignments.
- orthographic transcriptions are gathered in one single text file (orthographic-transcript.txt) which has the form "[wav_filename]" "[Orthographic Transcript]" in every line.

Persian Speech Corpus by Nawar Halabi is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

SHACHI - Language Resource Metadata Database