言語資源検索 - SHACHI: Language Resource Metadata Database

言語資源の登録件数: 3330件 2023 件中 881 - 890 件目

C-001508: Phonetically Balanced Words (4)
Desktop/Microphone
Large acoustic corpus in Korean produced by Kaist Korterm. 70 native Korean speakers (males and females) read 4 times 32 cardinal numbers and 9 determinatives of one syllable. Two announcers read these only 2 times. Information such as the size and the level of studies of the speakers are provided. The recordings took place in a soundproof room. The data are stored in a 8-bit A-law speech file, with a 16 kHz sampling rate. The standard in use is NIST.
C-001509: Phonetically Balanced Words (5)
Desktop/Microphone
Large acoustic corpus in Korean produced by Kaist Korterm. 70 native Korean speakers (males and females) read 4 times 35 cardinal numbers compounded of 4 single numbers. Two announcers read these only two times. Information such as the size and the level of studies of the speakers are provided. The recordings took place in a soundproof room. The data are stored in a 8-bit A-law speech file, with a 16 kHz sampling rate. The standard in use is NIST.
C-001510: Phonetically Balanced Words (3)
Desktop/Microphone
Large acoustic corpus in Korean produced by Kaist Korterm. Two announcers and 70 native speakers (males and females) read 2 times one paragraph. . Information such as the size and the level of studies of the speakers are provided. The recordings took place in a soundproof room. The data are stored in a 8-bit A-law speech file, with a 16 kHz sampling rate. The standard in use is NIST.
C-001511: Phonetically Rich Words
Telephone
Large acoustic corpus in Korean produced by Kaist Korterm. 500 native speakers have been recorded (250 males, 250 females). They have uttered 32 single cardinal numbers, 1620 cardinal numbers compounded of 4 single numbers and 3813 phonetically rich words. The recordings took place in natural environment, by telephone(wire, wireless and mobile phone). The data are stored in a 8-bit A-law speech file, with a 16 kHz sampling rate. The standard in use is NIST.
C-001512: Qualified POS Tagged Corpus
Written Corpora
Monolingual corpus in a .txt format, produced by KAIST KORTERM, containing 1020000 eojeols (Korean terms) in Korean. This corpus is morphologically analyzed, POS tagged, and rectified 3 times by specialists.
C-001513: RVG-J (Regional Variants of German J)
Desktop/Microphone
This corpus contains 21,691 recordings in quiet living room acoustics of 182 adolescents (13-20) living in the German state Bavaria.
The content is:
- RVG prompts; the prompts are identical to the prompted texts of the RVG1 project (see ELRA-S0058) (including one short monologue of spontaneous speech).
- digit strings
- credit card numbers
- date expressions
- spellings
- proper names
- time expressions
- spontaneous responses to questions
Features:
- 120 Prompts per speaker
- total: 15GByte, 100h
- two different recording environments
- 2 microphones recorded in parallel: headset and collar-attached microphone
- Formats and distribution: SpeechDat Exchange Format
- Transcription: SpeechDa
C-001514: Russian Speech Database
Desktop/Microphone
The STC Russian speech database was recorded in 1996-1998. The main purpose of the database is to investigate individual speaker variability and to validate speaker recognition algorithms. The database was recorded through a 16-bit Vibra-16 Creative Labs sound card with an 11,025 Hz sampling rate.
The database contains Russian read speech of 89 different speakers (54 male, 35 female), including 70 speakers with 15 sessions or more, 10 speakers with 10 sessions or more and 9 speakers with less than 10 sessions. The speakers were recorded in Saint-Petersburg and are within the age of 18-62. All are native speakers.
The corpus consists of 5 sentences. Each speaker reads carefully but fluently each sentence 15 times on different dates over the period of 1-3 months. The corpus contains a total of 6,889 utterances and of 2 volumes, total size 700 MB uncompressed data. The signal of each utterance is stored as a separate file (approx. 126 KB). Total size of data for one speaker approximates 9,500 KB. Average utterance duration is about 5 sec.
A file gives information about the speakers (speaker?s age and gender). The orthography and phonetic transcription of the corpus is given in separate files which contain the prompted sentences and their transcription in IPA. The signal files are raw files without any header, 16 bit per sample, linear, 11,025 Hz sample frequency.
The recording conditions were as follows:
Microphone: dynamic omnidirectional high-quality microphone, distance to mouth 5-10 cm
Environment: office room
Sampling rate: 11,025 Hz
Resolution: 16 Bit
Sound board: Creative Labs Vibra-16
Means of delivery: CD-ROM
C-001515: SALA Spanish Chilean Database
Telephone
The SALA Spanish Chilean Database comprises 1,024 Chilean speakers (477 males, 547 females) recorded over the Chilean fixed telephone network. This database is partitioned into 6 CD-ROMs The speech databases made within the SALA project were validated by SPEX, the Netherlands, to assess their compliance with the SALA format and content specifications.

The speech files are stored as sequences of 8-bit, 8kHz A-law speech files and are not compressed, according to the specifications of SALA. Each prompt utterance is stored within a separate file and has an accompanying ASCII SAM label file.

Each speaker uttered the following items:
- 6 application words;
- 1 sequence of 10 isolated digits;
- 4 connected digits: 1 sheet number (6 digits), 1 telephone number (9-11 digits), 1 credit card number (14-16 digits), 1 PIN code (6 digits);
- 3 dates: 1 spontaneous date (e.g. birthday), 1 prompted date (word style), 1 relative and general date expression;
- 1 spotting phrase using an application word (embedded);
- 1 isolated digit;
- 3 spelled-out words (letter sequences): 1 spelling of surname; 1 spelling of directory assistance city name; 1 real/artificial name for coverage;
- 1 currency money amount;
- 1 natural number;
- 5 directory assistance names: 1 surname (out of 500); 1 city of birth / growing up (spontaneous); 1 most frequent city (out of 500); 1 most frequent company/agency (out of 500); 1 "forename surname" (set of 150 )
- 2 questions, including "fuzzy" yes/no: 1 predominantly "yes" question, 1 predominantly "no" question;
- 9 phonetically rich sentences;
- 2 time phrases: 1 time of day (spontaneous), 1 time phrase (word style);
- 4 phonetically rich words.

The following age distribution has been obtained: 2 speakers are under 16 years old, 288 speakers are between 16 and 30, 481 speakers are between 31 and 45, 239 speakers are between 46 and 60, and 14 speakers are over 60.

A phonetic lexicon with canonical transcriptions in SAMPA is also provided.
C-001516: SALA Spanish Colombian Database
Telephone
The SALA Spanish Colombian database contains the recordings of 1,000 Colombian speakers (475 males, 525 females) recorded over the Colombian fixed telephone network. Six speakers repeated the same prompt sheet in different calls. This database is partitioned into 4 CDs, which comprise 300 speakers sessions each (except for CD 4, with 100 speakers sessions).

Speech samples are stored as sequences of 8-bit, 8kHz A-law and are not compressed, according to the specifications of SALA. Each prompt utterance is stored within a separate file and has an accompanying ASCII SAM label file.

This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SALA format and content specifications.

Each speaker uttered the following items:

* 6 application words
* 1 sequence of 10 isolated digits
* 4 connected digits (1 sheet number -6 digits, 1 telephone number -9/11 digits, 1 credit card number 14/16 digits, 1 PIN code -6 digits)
* 3 dates (1 spontaneous date e.g. birthday, 1 word style prompted date, 1 relative and general date expression)
* 1 spotting phrase using an embedded application word
* 1 isolated digit
* 3 spelled words (1 surname, 1 directory assistance city name, 1 real/artificial name for coverage)
* 1 currency money amount
* 1 natural number
* 5 directory assistance names (1 surname out of a set of 500, 1 city of birth/growing up, 1 most frequent city out of a set of 500, 1 most frequent company/agency out of a set of 500, 1 "forename surname" out of a set of 150
* 2 yes/no questions (1 predominantly "yes" question, 1 predominantly "no" question),
* 9 phonetically rich sentences,
* 2 time phrases (1 spontaneous time of day, 1 word style time phrase),
* 4 phonetically rich words.

The following age distribution has been obtained: 11 speakers are under 16, 486 speakers are between 16 and 30, 305 speakers are between 31 and 45, 163 speakers are between 46 and 60, and 35 speakers are over 60.

A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
C-001517: SIEMENS 100 - SI100
Desktop/Microphone
The corpus contains read speech of 101 different speakers. Each speaker has read approximately 100 sentences from a German newspaper corpus from the SuedDeutch Zeitungen (SZ), consiting of two sub-corpus known as the SZ subcorpus (contains 544 sentences from newspaper articles) and the CeBit subcorpus (contains 483 sentences from newspaper articles about CeBit 1995). Each subcorpus is divided into 5 parts of approximately 100 utterances each. Every speaker read only part of one subcorpus (with some exceptions), thus resulting in a total of approximately 10100 recorded utterances (7 CDROMs).

SHACHI - Language Resource Metadata Database