Language resource #: 3330
Results 21 - 30 of 2023
-
C-000041: Mandarin Chinese Speech Recognition Corpus (desktop) - single Chinese sentence (200 people)
Desktop/Microphone
This corpus comprises 8,000 Chinese sentences uttered by 200 speakers of different dialects, ages and various educational levels, recorded over 2 channels. Speech samples are stored as a sequence of 16-bit 44.1kHz WAV for 12.21 hours of speech per channel. The total capacity of the data is 7.2 Gb.
Each speaker read 40 items. Text files are stored in Unicode format. All data have been proofread manually.
The corpus aims to be applied to the testing and telephone natural speech recognition system.- hasVersion: C-000043: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - Chinese single sentence (100 people)
- hasVersion: C-000045: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - person name (100 people)
- hasVersion: C-000046: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - place name (100 people)
- hasVersion: C-000044: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - digit string (100 people)
- hasVersion: C-000042: Chinese dialect Mandarin Speech Recognition Corpus (desktop)- person name (200 people)
- hasVersion: C-001386: Chinese dialect Mandarin Speech Recognition Corpus (desktop) - place name (200 people)
- hasVersion: C-001385: Chinese dialect Mandarin Speech Recognition Corpus (desktop) - digit string (200 people)
-
C-000042: Mandarin Chinese Speech Recognition Corpus (desktop)- person name (200 people)
Desktop/Microphone
This corpus comprises 8,000 person names uttered by 200 speakers of different dialects, ages and various educational levels, recorded over 2 channels. Speech samples are stored as a sequence of 16-bit 44.1kHz WAV for 10 hours of speech per channel. The total capacity of the data is 5.92 Gb.
Each speaker read 40 items. Text files are stored in Unicode format. All data have been proofread manually.
The corpus aims to be applied to the testing and telephone natural speech recognition system.- hasVersion: C-000041: Chinese dialect Mandarin Speech Recognition Corpus (desktop) - single Chinese sentence (200 people)
- hasVersion: C-001386: Chinese dialect Mandarin Speech Recognition Corpus (desktop) - place name (200 people)
- hasVersion: C-001385: Chinese dialect Mandarin Speech Recognition Corpus (desktop) - digit string (200 people)
-
C-000043: Mandarin Chinese Speech Recognition Corpus (telephone channel) - Chinese single sentence (100 people)
Desktop/Microphone
This corpus comprises sentences uttered by 100 speakers of different dialects, ages and various educational levels. Speech samples are stored as a sequence of 16-bit 8kHz WAV for a total of 7.3 hours of speech. The total capacity of the data is 400 Mb.
Each speaker read 40 items. Text files are stored in Unicode format. All data have been proofread manually.
The corpus aims to be applied to the testing and telephone natural speech recognition system.- hasVersion: C-000045: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - person name (100 people)
- hasVersion: C-000046: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - place name (100 people)
- hasVersion: C-000044: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - digit string (100 people)
-
C-000044: Mandarin Chinese Speech Recognition Corpus (telephone channel) - digit string (100 people)
Desktop/Microphone
This corpus comprises digit strings uttered by 100 speakers of different dialects, ages and various educational levels. Speech samples are stored as a sequence of 16-bit 8kHz WAV for a total of 7.5 hours of speech. The total capacity of the data is 410 Mb.
Each speaker read 40 items. Text files are stored in Unicode format. All data have been proofread manually.
The corpus aims to be applied to the testing and telephone natural speech recognition system.- hasVersion: C-000043: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - Chinese single sentence (100 people)
- hasVersion: C-000045: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - person name (100 people)
- hasVersion: C-000046: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - place name (100 people)
-
C-000045: Mandarin Chinese Speech Recognition Corpus (telephone channel) - person name (100 people)
Desktop/Microphone
This corpus comprises person names uttered by 100 speakers of different dialects, ages and various educational levels. Speech samples are stored as a sequence of 16-bit 8kHz WAV for a total of 6 hours of speech. The total capacity of the data is 328 Mb.
Each speaker read 40 items. Text files are stored in Unicode format. All data have been proofread manually.
The corpus aims to be applied to the testing and telephone natural speech recognition system.- hasVersion: C-000043: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - Chinese single sentence (100 people)
- hasVersion: C-000046: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - place name (100 people)
- hasVersion: C-000044: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - digit string (100 people)
-
C-000046: Mandarin Chinese Speech Recognition Corpus (telephone channel) - place name (100 people)
Desktop/Microphone
This corpus comprises place names uttered by 100 speakers of different dialects, ages and various educational levels. Speech samples are stored as a sequence of 16-bit 8kHz WAV for a total of 6.2 hours of speech. The total capacity of the data is 338 Mb.
Each speaker read 40 items. Text files are stored in Unicode format. All data have been proofread manually.
The corpus aims to be applied to the testing and telephone natural speech recognition system.- hasVersion: C-000043: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - Chinese single sentence (100 people)
- hasVersion: C-000045: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - person name (100 people)
- hasVersion: C-000044: Chinese dialect Mandarin Speech Recognition Corpus (telephone channel) - digit string (100 people)
-
C-000050: Corpus of Contemporaneous Spanish Novels
Written Corpora
This corpus consists of 11 novels written in Castilian Spanish by Inmaculada Ferrer-Vidal Turull, a contemporaneous author. The list of novels consists of:
- La búsqueda: 113,639 words
- Tristeza: 41,125 words
- Cuarto menguante: 42,419 words
- Recuerdos: 55,694 words
- Sucedió en Abril: 46,040 words
- Viejos amigos: 84,082 words
- Soledad & Cia: 69,848 words
- El chispazo, la hoguera y las brasas: 108,877 words
- Un giro en la vida : 70,736 words
- Adiós: 2,016 words
- Vacaciones: 3,623 words
The novels are available in Word format. -
C-000053: Danish SpeechDat(II) FDB-4000
Telephone
The Danish SpeechDat(II) FDB-4000 comprises 4,000 Danish speakers (1,940 males, 2,060 females) recorded over the Danish fixed telephone network. This database is partitioned into 14 CDs. The first 13 CDs comprise 300 speakers sessions each, the 14th comprises 100 speakers.
This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat format and content specifications.
Speech samples are stored as sequences of 8-bit 8 kHz A-law. Each prompted utterance is stored in a separate file. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
Each speaker uttered the following items:
* 3 application words
* 1 sequence of 10 isolated digits
* 4 numbers (1 sheet number 5/10 digits, 1 telephone number 9/11 digits, 1 credit card number -16 digits), 1 PIN code -6 digits)
* 3 dates (1 spontaneous e.g. birthday, 1 word style prompted date, 1 relative and general date expression)
* 1 word spotting phrase using an embedded application word
* 1 isolated digit
* 3 spelled words (1 spontaneous e.g. own forename, 1 spelling of directory city name, 1 real word for coverage)
* 1 currency money amount
* 1 natural number
* 5 directory assistance names (1 spontaneous e.g. own forename, 1 city of school at 7 years, 1 most frequent cities out of a set of 500, 1 most frequent company/agency out of a set of 500 names, 1 "forename surname" out of a set of 500 names)
* 2 yes/no questions (1 predominantly "yes" question, 1 predominantly "no" question)
* 9 phonetically rich sentences
* 2 time phrases (1 spontaneous time of day, 1 word style time phrase)
* 4 phonetically rich words
The following age distribution has been obtained: 372 speakers are under 16, 1004 speakers are between 16 and 30, 1109 speakers are between 31 and 45, 901 speakers are between 46 and 60, and 614 speakers are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included. -
C-000061: EUROM1g German
Desktop/Microphone
The first really multilingual speech database produced in Europe. Equivalent corpora for each of the European languages: same number of speakers selected in the same way, and recorded in the same conditions with common file formats. Initially eight European countries have made recordings: Italy, United Kingdom, Germany, Netherlands, Denmark, Sweden, Norway, France. Additional recordings have been then completed (thanks to CEE Esprit Project SAM-A), in Greece, Spain and Portugal. The content consists of Numbers, Passages, Sentences and CVC. More than sixty speakers per language.- isVersionOf: C-000915: EUROM1f
- isVersionOf: C-001403: EUROM1e
-
C-000066: Finnish Speechdat(II) FDB-1000
Telephone
The Finnish SpeechDat(II) FDB-1000 comprises 1000 Finnish speakers (617 males, 383 females) recorded over the Finnish fixed telephone network. The FDB-1000 database is partitioned into 4 CDs, 3 CDs comprise 300 speakers sessions, the 4th comprises 100 speakers sessions. The speech databases made within the SpeechDat(II) project were validated by SPEX, the Netherlands, to assess their compliance with the SpeechDat format and content specifications.
Speech samples are stored as sequences of 8-bit 8 kHz A-law. Each prompted utterance is stored in a separate file. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
Each speaker uttered the following items:
? 1 isolated digit
? 1 sequence of 10 isolated digits
? 4 numbers: 1 sheet number (5 digits), 1 telephone number (9-10 digits), 1 credit card number (16 digits), 1 PIN code (6 digits)
? 1 currency money amount
? 1 natural number
? 3 dates: 1 spontaneous date (birthdate), 1 prompted date, 1 relative or general date expression
? 2 time phrases: 1 time of day (spontaneous), 1 time phrase
? 3 spelled words: 1 spontaneous own forename, 1 city name, 1 phonetically rich word
? 5 directory assistance names: 1 spontaneous own forename, 1 spontaneous city of growing up, 1 frequent city name, 1 frequent company name, 1 common forename surname
? 2 yes/no questions: 1 predominantly ?yes? question, 1 predominantly ?no? question
? 3 application words
? 1 word spotting phrase using an embedded application word
? 4 phonetically rich words
? 9 phonetically rich sentences
The following age distribution has been obtained: 57 speakers are below 16 years old, 609 speakers are between 16 and 30, 223 speakers are between 31 and 45, 104 speakers are between 46 and 60, and 7 speakers are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.