言語資源検索 - SHACHI: Language Resource Metadata Database

言語資源の登録件数: 3330件 2023 件中 1921 - 1930 件目

C-004962: SALA II US English database (2000 speakers)
The SALA II US English database collected in the United States was recorded within the scope of the SALA II project. It contains the recordings of ca. 2,000 US English speakers (equally balanced between males and females, including some speakers with Hispanic accents) recorded over the United States mobile telephone network.

The following acoustic conditions were selected as representative of a mobile user's environment (some speakers were recorded in several environments):
- Passenger in moving car, railway, bus, etc.
- Public place
- Stationary pedestrian by road side
- Home/office environment
- Passenger in moving car using a hands-free kit

The speech files are stored as sequences of 8-bit, 8kHz Mu-law speech files and are not compressed, according to the specifications of SALA II. Each prompt utterance is stored within a separate file and has an accompanying ASCII SAM label file.

This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SALA II format and content specifications.

Each speaker uttered the following items:
- 6 application words (out of a set of 30)
- 1 sequence of 10 isolated digits
- 4 connected digits (1 sheet number -5+ digits, 1 telephone number –9/11 digits, 1 credit card number –14/16 digits, 1 PIN code -6 digits)
- 3 dates (1 spontaneous date e.g. birthday, 1 word style prompted date, 1 relative and general date expression)
- 1 spotting phrase using an embedded application word
- 2 isolated digits
- 3 spelled words (1 surname, 1 directory assistance city name, 1 real/artificial name for coverage)
- 1 currency money amount
- 1 natural number
- 5 directory assistance names (1 spontaneous, e.g. own surname, 1 city of birth/growing up, 1 most frequent city out of a set of 500, 1 most frequent company/agency out of a set of 500, 1 “forename surname” out of a set of 150 )
- 2 yes/no questions (1 predominantly “yes” question, 1 predominantly “no” question, including fuzzy questions)
- 9 phonetically rich sentences
- 2 time phrases (1 spontaneous time of day, 1 word style time phrase)
- 4 phonetically rich words

A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
C-004963: Buckeye Corpus
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer). Software for searching the transcription files is currently being written. The corpus is FREE for noncommercial uses.
C-004964: Annotated Speech Corpora for 3 East Indian Languages
All the informants of the corpora are professional voice over artist. The speech is recorded in a speech studio environment and digitized at a sampling rate of 22,050 Hz with an accuracy of 16 bits/sample in PCM wave format. The annotation has been done both at text level and speech level. At text level Parts of Speech (POS), Phrase and Clause have been annotated. Text files are also phonetically transcribed in Internal Phonetic Alphabet (IPA). In case of speech, phonemes, syllables and breath pause have been annotated. The total size of the speech corpora is about 8.5GB. Majority of this Corpus is for Bangla Language (5.12 GB). Only standard dialect of a particular language is included in this corpora.
The content of the corpora has been designed in a way that it can help various aspects of speech research such as Speech Synthesis, Speech Recognition, Speaker Recognition etc.
C-004965: RML Emotion Database
The RML emotion database contains 720 audiovisual emotional expression samples that were collected at Ryerson Multimedia Lab. Six basic human emotions are expressed: Anger, Disgust, Fear, Happiness, Sadness, Surprise. A digital video camera was used to record the samples in a quiet and bright environment, with a simple background. Our experimental subjects were provided with a list of emotional sentences and were directed to express their emotions as naturally as possible by recalling the emotional happening, which they had experienced in their lives. A total number of ten different sentences were provided for each emotional class.
C-004966: Surrey Audio-Visual Expressed Emotion (SAVEE) Database
Surrey Audio-Visual Expressed Emotion (SAVEE) database has been recorded as a pre-requisite for the development of an automatic emotion recognition system. The database consists of recordings from 4 male actors in 7 different emotions, 480 British English utterances in total. The sentences were chosen from the standard TIMIT corpus and phonetically-balanced for each emotion. The data were recorded in a visual media lab with high quality audio-visual equipment, processed and labeled. To check the quality of performance, the recordings were evaluated by 10 subjects under audio, visual and audio-visual conditions. Classification systems were built using standard features and classifiers for each of the audio, visual and audio-visual modalities, and speaker-independent recognition rates of 61%, 65% and 84% achieved respectively.
C-004967: 台湾国語多言語話しことばコーパス
本コーパスは東京外国語大学大学院の21世紀COE「言語運用を基盤とする言語情報学拠点」の研究成果として公開。コーパスは今後もさらに構築されます。

台湾国語話ことばコーパスの調査は，台湾(中華民国)台北県淡水鎮淡江大学において行われた。インフォーマントとして音声の吹き込みを行ったのは淡江大学外国語学部日本語学科の学部生・大学院生54名(女性39名男性15名)である。二人一組で約1時間ほど自由に会話をしてもらい，それを録音した。録音はICレコーダを用い，16bit/ 44.1KHzというオーディオCDと同じフォーマットで，非圧縮形式で録音を行った。全ての音声データはハードディスクに保存されている。全録音時間は約33時間である。そのうち約22時間分(45万字以上)の録音について，威立活動顧問有限公司(Willy Event Consultans)によって文字転写が行われた。
- hasVersion: C-001319: カナダ・バイリンガル話ことばコーパス
- hasVersion: C-001320: フランス語（エックス）多言語話し言葉コーパス
- hasVersion: C-001321: フランス語（パリ）多言語話しことばコーパス
- hasVersion: C-001322: マレーシア語多言語話しことばコーパス
- hasVersion: C-001323: スペイン語多言語話しことばコーパス2004年度版
- hasVersion: C-001324: トルコ語多言語話しことばコーパス
- hasVersion: C-004968: スペイン語多言語話しことばコーパス2006年度版
C-004968: スペイン語多言語話しことばコーパス2006年度版
スペイン語話しことばコーパス2006年度版は40個の対話から構成されています。総語数は50,000語を超え、それぞれの対話は異なる主題に焦点があてられています。
- hasVersion: C-001319: カナダ・バイリンガル話ことばコーパス
- hasVersion: C-001320: フランス語（エックス）多言語話し言葉コーパス
- hasVersion: C-001321: フランス語（パリ）多言語話しことばコーパス
- hasVersion: C-001322: マレーシア語多言語話しことばコーパス
- hasVersion: C-001323: スペイン語多言語話しことばコーパス2004年度版
- hasVersion: C-001324: トルコ語多言語話しことばコーパス
- hasVersion: C-004967: 台湾国語多言語話しことばコーパス
C-004971: アイヌ語口承文芸コーパス―音声・グロスつき―
木村きみさん (1900-1988，沙流川上流域のペナコリ出身) がアイヌ語で語った物語10編 (ウエペケㇾ (散文説話) 8編，カムイユカㇻ (神謡) 2編) 約3時間分の音声に，日本語と英語による訳とグロスや注解を付けた初めてのアイヌ口承文芸デジタル集成。
C-004972: 「日本の消滅危機言語・方言」データベース
このデータベースは，日本の消滅危機言語・方言の音声を公開するものです。奄美・沖縄のことば，八丈島のことばをはじめとする，日本各地の消滅危機言語・方言の単語の発音や自然談話の発音が収録されています。文字化テキスト，共通語訳もついています。
C-004975: Accented English GlobalPhone
The Accented English part of the GlobalPhone resources contains 63 recording sessions of Bulgarian, Chinese, German, and Indian native speakers reading 37 English sentences each, produced in GlobalPhone-style, i.e. 16kHz PCM encoded audio recordings of utterance-segmented read speech from the newspaper domain.

SHACHI - Language Resource Metadata Database