Language resource #: 3330
Results 111 - 120 of 2023
-
C-000372: VERBMOBIL II - VM CD 19.1 - VM19.1 (new edition)
Desktop/Microphone
Verbmobil is a long-term project of the German Federal Ministry of Education, Science, Research and Technology (BMBF, Projekträger DLR). Its aim is to give Germany an international top position in language technology and its economical application in the next millenium by cooperation and concentration of as many as possible specialists from industry and science. The long-sighted aim is the development of a mobile translation system for the translation of spontaneous speech in face-to-face situations.The following resources are spontaneous speech databases recorded in a dialogue task (appointment scheduling) .
VERBMOBIL II - VM CD 19.1 - VM19.1 (new edition) consists of 1 CD-ROM with 200 dialogues, 200 appointment schedulings - 2911 turns, in Japanese. -
C-000373: VERBMOBIL II - VM CD 65.0 - VM65.0 (original edition)
Desktop/Microphone
Verbmobil is a long-term project of the German Federal Ministry of Education, Science, Research and Technology (BMBF, Projekträger DLR). Its aim is to give Germany an international top position in language technology and its economical application in the next millenium by cooperation and concentration of as many as possible specialists from industry and science. The long-sighted aim is the development of a mobile translation system for the translation of spontaneous speech in face-to-face situations.The following resources are spontaneous speech databases recorded in a dialogue task (appointment scheduling) .
VERBMOBIL II - VM CD 65.0 - VM65.0 (original edition) consists of 1 CD-ROM with 13 WOZ dialogues designed to evoke emotions (mainnly anger) - transliteration, emotion labeling in German. -
C-000374: VERBMOBIL II - VM CD 53.1 - VM53.1 (BAS edition)
Desktop/Microphone
Verbmobil is a long-term project of the German Federal Ministry of Education, Science, Research and Technology (BMBF, Projekträger DLR). Its aim is to give Germany an international top position in language technology and its economical application in the next millenium by cooperation and concentration of as many as possible specialists from industry and science. The long-sighted aim is the development of a mobile translation system for the translation of spontaneous speech in face-to-face situations.The following resources are spontaneous speech databases recorded in a dialogue task (appointment scheduling) .
VERBMOBIL II - VM CD 53.1 - VM53.1 (BAS edition) consists of 1 CD-ROM with 16 spontaneous dialogues (16 close mic, 8 room mic, 8 phone line (GSM) recordings) - 1771 turns, transliteration (VM II Format) in German.- hasVersion: C-000375: VERBMOBIL II - VM CD 60.1 - VM60.1 (BAS edition)
- hasVersion: C-000376: VERBMOBIL II - VM CD 61.1 - VM61.1 (BAS edition)
- hasVersion: C-000369: VERBMOBIL II - VM CD 62.1 - VM62.1 (BAS edition)
- hasVersion: C-001576: VERBMOBIL II - VM CD 51.1 - VM51.1 (BAS edition)
- hasVersion: C-001577: VERBMOBIL II - VM CD 52.1 - VM52.1 (BAS edition)
- hasVersion: C-001578: VERBMOBIL II - VM CD 55.1 - VM55.1 (BAS edition)
- hasVersion: C-001579: VERBMOBIL II - VM CD 56.1 - VM56.1 (BAS edition)
- hasVersion: C-001580: VERBMOBIL II - VM CD 57.1 - VM57.1 (BAS edition)
- hasVersion: C-001581: VERBMOBIL II - VM CD 58.1 - VM58.1 (BAS edition)
- hasVersion: C-001582: VERBMOBIL II - VM CD 59.1 - VM59.1 (BAS edition)
-
C-000375: VERBMOBIL II - VM CD 60.1 - VM60.1 (BAS edition)
Desktop/Microphone
Verbmobil is a long-term project of the German Federal Ministry of Education, Science, Research and Technology (BMBF, Projekträger DLR). Its aim is to give Germany an international top position in language technology and its economical application in the next millenium by cooperation and concentration of as many as possible specialists from industry and science. The long-sighted aim is the development of a mobile translation system for the translation of spontaneous speech in face-to-face situations.The following resources are spontaneous speech databases recorded in a dialogue task (appointment scheduling) .
VERBMOBIL II - VM CD 60.1 - VM60.1 (BAS edition) consists of 1 CD-ROM with 10 spontaneous dialogues (10 close mic, 0 room mic, 0 phone line (GSM) recordings) - 501 turns, transliteration (VM II Format) in Japanese.- hasVersion: C-000374: VERBMOBIL II - VM CD 53.1 - VM53.1 (BAS edition)
- hasVersion: C-000376: VERBMOBIL II - VM CD 61.1 - VM61.1 (BAS edition)
- hasVersion: C-000369: VERBMOBIL II - VM CD 62.1 - VM62.1 (BAS edition)
- hasVersion: C-001576: VERBMOBIL II - VM CD 51.1 - VM51.1 (BAS edition)
- hasVersion: C-001577: VERBMOBIL II - VM CD 52.1 - VM52.1 (BAS edition)
- hasVersion: C-001578: VERBMOBIL II - VM CD 55.1 - VM55.1 (BAS edition)
- hasVersion: C-001579: VERBMOBIL II - VM CD 56.1 - VM56.1 (BAS edition)
- hasVersion: C-001580: VERBMOBIL II - VM CD 57.1 - VM57.1 (BAS edition)
- hasVersion: C-001581: VERBMOBIL II - VM CD 58.1 - VM58.1 (BAS edition)
- hasVersion: C-001582: VERBMOBIL II - VM CD 59.1 - VM59.1 (BAS edition)
-
C-000376: VERBMOBIL II - VM CD 61.1 - VM61.1 (BAS edition)
Desktop/Microphone
Verbmobil is a long-term project of the German Federal Ministry of Education, Science, Research and Technology (BMBF, Projekträger DLR). Its aim is to give Germany an international top position in language technology and its economical application in the next millenium by cooperation and concentration of as many as possible specialists from industry and science. The long-sighted aim is the development of a mobile translation system for the translation of spontaneous speech in face-to-face situations.The following resources are spontaneous speech databases recorded in a dialogue task (appointment scheduling) .
VERBMOBIL II - VM CD 61.1 - VM61.1 (BAS edition) consists of 1 CD-ROM with 19 spontaneous dialogues (19 close mic, 0 room mic, 0 phone line (GSM) recordings) - 946 turns, transliteration (VM II Format) in Japanese.- hasVersion: C-000374: VERBMOBIL II - VM CD 53.1 - VM53.1 (BAS edition)
- hasVersion: C-000375: VERBMOBIL II - VM CD 60.1 - VM60.1 (BAS edition)
- hasVersion: C-000369: VERBMOBIL II - VM CD 62.1 - VM62.1 (BAS edition)
- hasVersion: C-001576: VERBMOBIL II - VM CD 51.1 - VM51.1 (BAS edition)
- hasVersion: C-001577: VERBMOBIL II - VM CD 52.1 - VM52.1 (BAS edition)
- hasVersion: C-001578: VERBMOBIL II - VM CD 55.1 - VM55.1 (BAS edition)
- hasVersion: C-001579: VERBMOBIL II - VM CD 56.1 - VM56.1 (BAS edition)
- hasVersion: C-001580: VERBMOBIL II - VM CD 57.1 - VM57.1 (BAS edition)
- hasVersion: C-001581: VERBMOBIL II - VM CD 58.1 - VM58.1 (BAS edition)
- hasVersion: C-001582: VERBMOBIL II - VM CD 59.1 - VM59.1 (BAS edition)
-
C-000377: VERBMOBIL II - VM CD 64.0 - VM64.0 (original edition)
Desktop/Microphone
Verbmobil is a long-term project of the German Federal Ministry of Education, Science, Research and Technology (BMBF, Projekträger DLR). Its aim is to give Germany an international top position in language technology and its economical application in the next millenium by cooperation and concentration of as many as possible specialists from industry and science. The long-sighted aim is the development of a mobile translation system for the translation of spontaneous speech in face-to-face situations.The following resources are spontaneous speech databases recorded in a dialogue task (appointment scheduling) .
VERBMOBIL II - VM CD 64.0 - VM64.0 (original edition) consists of 1 CD-ROM with 13 WOZ dialogues designed to evoke emotions (mainnly anger) - transliteration, emotion labeling in German. -
C-000380: French SpeechDat-Car
Desktop/Microphone
The French SpeechDat-Car comprises the recordings of 313 French speakers from 6 different regions (158 males, 155 females), recorded over the GSM telephone network and in a car. This database is partitioned into 16 DVDs. The speech databases made within the SpeechDat-Car project were validated by SPEX, the Netherlands, to assess their compliance with the SpeechDat-Car format and content specifications.
The speech data files are in two formats. Four of the microphones were recorded on the computer in the boot of the car. The speech data are stored as sequences of 16 kHz, 16 bit and uncompressed. The fifth microphone was connected to the GSM phone, and was recorded on a remote machine, with compressed data stored as sequences of 8 bit A-law 8.kHz. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
Each speaker uttered the following items:
- 2 voice activation keywords
- 1 sequence of 10 isolated digits
- 7 connected digits : 1 sheet number (5+ digits), 1 spontaneous telephone number, 3 read telephone numbers, 1 credit card number (14-16 digits), 1 PIN code (6 digits)
- 3 dates : 1 spontaneous date (e.g. birthday), 1 prompted date, 1 relative or general date expression
- 2 word spotting phrases using an application word (embedded)
- 1 question (extra item)
- 4 isolated digits
- 7 spelled words : 1 spontaneous (own forename or surname), 1 spelling of directory city name, 4 real word/name, 1 artificial name for coverage
- 1 money amount
- 1 email address (extra item)
- 1 natural number
- 7 directory assistance names : 1 spontaneous (own forename or surname), 1 city of birth / growing up (spontaneous), 2 most frequent cities, 2 most frequent company/agency, 1 "forename surname"
- 9 phonetically rich sentences
- 2 time phrases : 1 time of day (spontaneous), 1 time phrase (word style)
- 4 phonetically rich words
- 67 application words: 13 mobile phone application words, 22 IVR function keywords, 32 car products keywords
- 2 additional language dependent keywords
- 1 additional language dependent keywords (extra item)
- Prompts for spontaneous speech
The following age distribution has been obtained: 208 speakers are between 16 and 30, 78 speakers are between 31 and 45, 25 speakers are between 46 and 60, and 2 speakers are over 60. A pronunciation lexicon with a phonemic transcription in SAMPA is also included. -
C-000381: Danish SpeechDat-Car - GSM recordings - GSM recordings only
Desktop/Microphone
The Danish SpeechDat-Car contains the recordings of 300 Danish speakers from 5 different regions (162 males, 138 females), recorded over the GSM telephone network, and in a car. This database is partitioned into 15 DVDs (53 GB), plus 1 CD-ROM for e.g. non-signal files and documentation. The speech data files are in two formats. Four of the microphones were recorded on the computer in the boot of the car. The speech data are stored as sequences of 16 kHz, 16 bit and uncompressed. The fifth microphone was connected to the cell phone, and was recorded on a remote machine, with compressed data stored as sequences of 8 bit A-law 8.kHz. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat-Car format and content specifications.
Each speaker uttered the following items:
* 2 voice activation keywords
* 1 sequence of 10 isolated digits
* 7 connected digits (1 sheet number -5 digits, 1 spontaneous telephone number, 3 read telephone numbers, 1 credit card number 14/16 digits, 1 PIN code -6 digits)
* 3 dates (1 spontaneous date e.g. birthday, 1 prompted date, 1 relative or general date expression)
* 2 word spotting phrases using an embedded application word
* 4 isolated digits
* 7 spelled words (1 spontaneous e.g. own forename or surname, 1 directory city name, 4 real word/name, 1 artificial name for coverage)
* 1 money amount
* 1 natural number
* 7 directory assistance names (1 spontaneous e.g. own forename or surname, 1 city of birth/growing up, 2 most frequent cities, 2 most frequent company/agency, 1 "forename surname")
* 9 phonetically rich sentences
* 2 time phrases (1 spontaneous time of day, 1word style time phrase)
* 4 phonetically rich words
* 67 application words (13 mobile phone application words, 22 IVR function keywords, 32 car products keywords)
* 2 additional language dependent keywords
* Prompts for spontaneous speech
* 2 additional keywords from a list of 10
The following age distribution has been obtained: 84 speakers are between 18 and 30, 99 speakers are between 31 and 45, 98 speakers are between 46 and 60, and 19 speakers are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included. -
C-000382: Danish SpeechDat-Car - In-car recordings
Desktop/Microphone
The Danish SpeechDat-Car contains the recordings of 300 Danish speakers from 5 different regions (162 males, 138 females), recorded over the GSM telephone network, and in a car. This database is partitioned into 15 DVDs (53 GB), plus 1 CD-ROM for e.g. non-signal files and documentation. The speech data files are in two formats. Four of the microphones were recorded on the computer in the boot of the car. The speech data are stored as sequences of 16 kHz, 16 bit and uncompressed. The fifth microphone was connected to the cell phone, and was recorded on a remote machine, with compressed data stored as sequences of 8 bit A-law 8.kHz. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat-Car format and content specifications.
Each speaker uttered the following items:
* 2 voice activation keywords
* 1 sequence of 10 isolated digits
* 7 connected digits (1 sheet number -5 digits, 1 spontaneous telephone number, 3 read telephone numbers, 1 credit card number 14/16 digits, 1 PIN code -6 digits)
* 3 dates (1 spontaneous date e.g. birthday, 1 prompted date, 1 relative or general date expression)
* 2 word spotting phrases using an embedded application word
* 4 isolated digits
* 7 spelled words (1 spontaneous e.g. own forename or surname, 1 directory city name, 4 real word/name, 1 artificial name for coverage)
* 1 money amount
* 1 natural number
* 7 directory assistance names (1 spontaneous e.g. own forename or surname, 1 city of birth/growing up, 2 most frequent cities, 2 most frequent company/agency, 1 "forename surname")
* 9 phonetically rich sentences
* 2 time phrases (1 spontaneous time of day, 1word style time phrase)
* 4 phonetically rich words
* 67 application words (13 mobile phone application words, 22 IVR function keywords, 32 car products keywords)
* 2 additional language dependent keywords
* Prompts for spontaneous speech
* 2 additional keywords from a list of 10
The following age distribution has been obtained: 84 speakers are between 18 and 30, 99 speakers are between 31 and 45, 98 speakers are between 46 and 60, and 19 speakers are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included. -
C-000383: Finnish SpeechDat-Car
Desktop/Microphone
The Finnish SpeechDat-Car contains the recordings of 302 Finnish speakers from 3 major dialectal areas (with 13 sub-areas) (151 males, 151 females), recorded over the GSM telephone network, and in a car. This database is partitioned into 142 CDs (DVDs are also available).
The speech data files are in two formats. Four of the 5 microphones were recorded on the computer in the boot of the car. The speech data are stored as sequences of 16 kHz, 16 bit and uncompressed. The fifth microphone was connected to the cell phone, and was recorded on a remote machine, with compressed data stored as sequences of 8 bit A-law 8.kHz. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat-Car format and content specifications.
Each speaker uttered the following items:
* 2 voice activation keywords
* 1 sequence of 10 isolated digits
* 7 connected digits (1 sheet number -5 digits, 1 spontaneous telephone number, 3 read telephone numbers, 1 credit card number 14/16 digits, 1 PIN code -6 digits)
* 3 dates (1 spontaneous date e.g. birthday, 1 prompted date, 1 relative or general date expression)
* 2 word spotting phrases using an embedded application word
* 4 isolated digits
* 7 spelled words (1 spontaneous e.g. own forename or surname, 1 directory city name, 4 real word/name, 1 artificial name for coverage)
* 1 money amount
* 1 natural number
* 7 directory assistance names (1 spontaneous e.g. own forename or surname, 1 city of birth/growing up, 2 most frequent cities, 2 most frequent company/agency, 1 "forename surname")
* 9 phonetically rich sentences
* 2 time phrases (1 spontaneous time of day, 1word style time phrase)
* 4 phonetically rich words
* 67 application words (13 mobile phone application words, 22 IVR function keywords, 32 car products keywords)
* 2 additional language dependent keywords
* 10 spontaneous situations
* 6 names + extensions
The following age distribution has been obtained: 138 speakers are between 16 and 30, 89 speakers are between 31 and 45, and 75 speakers are between 46 and 60. No speaker are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.