言語資源の登録件数: 3330件
2023 件中 1051 - 1060 件目
-
C-003260: 理研ワープロ操作対話音声コーパス
文書作成依頼対話(ワープロ操作の専門家がユーザの希望を聞きながらコンピュータを用いて文書作成を行う(ユーザ ⇔ 秘書 ⇔ オペレーター ⇔ 専門家の対話)、文書作成画面を録画したビデオを見ながら、専門家が自分の作業について説明する(専門家の独話))、質問応答対話(ユーザが自ら文書を作成しながら、ワープロ操作方法について専門家に質問をする対話(ユーザ ⇔ 専門家の対話)の2タイプの模擬対話を収録。書き起こしテキスト、形態素タグ付き。
-
C-003261: 日本音響学会新聞記事読み上げ音声コーパス
日本語大語彙連続音声認識研究を目的とした 毎日新聞記事文を読み上げた音声コーパス。毎日新聞記事とATR 音素バランス503文を306人の話者(男女そ れぞれ153名)が読み上げたデータとそのテキストを含むDVDから構成さ れている。発話はすべて日本語。
- isReferencedBy: C-003262: S-JNAS : Large-scale Database for Speech Recognition of Elderly People
- references: ATR音素バランス503文
- references: C-001601: CD-Mainichi Shimbun Data Collection
- references: C-001602: CD-ROM Mainichi Shimbun '92 Data Collection
- references: C-001599: CD-Mainichi Shimbun '93 Data Collection
- references: C-001603: CD-ROM Mainichi Shimbun '94 Data Collection
-
C-003262: 新聞記事読み上げ高齢者音声コーパス
日本語大語彙連続音声認識研究を目的とした高齢者による新聞記事文と音素バランス文の読み上げ音声コーパス。毎日新聞記事、ATR音素バランス文、情報検索(レシピ,医療相談等)のタスク文を読み上げている。発話はすべて日本語。
-
C-003263: ATR Chinese Hotel Reservation Dialogue
The ATR Chinese Hotel Reservation Dialogue is the Chinese parallel speech corpus included in the multi-lingual project, which has collected multi-lingual hotel reservation dialogues, consisting of Japanese, English, and Chinese speeches.
-
C-003264: Singapore Primary School Chinese Language Text
12 volumes of manually tagged (lemma, POS, syntax) Chinese language text used in the Singapore schools, covering levels from Primary 1 to Primary 6.
-
C-003265: CSTSC-Flight Corpus
The CSTC-Flight is a domain-specific corpus containing flight enquiry and reservation domain telephone conversations, taken from the real life. All the dialogues are fully spontaneous since the coustomers are not aware of being monitored and recorded. Corresponding transcripts are included in the corpus for only about half of the speech data.
-
C-003266: CUCorpora
CUCorpora is a large scale Cantonese spoken language corpora, made up of several sub-corpora designed for different specific domain of applications. The sub-corpoora are; (1) 1800 Cantonese syllables with pitch-marking (CUSYL), (2) Multi-syllabe short phrases covering most common Cantonese syllables (CUWORD), (3) Phoneticaly-rich read Cantonese sentences (CUSENT), (4) Cantonese digit strings of length from single digit to 16 digit (CUDIGIT), and (5) Cantonese command words simulating a navigation control scenario (CUCMD). All sub-corpora are manually transcribed.
- hasPart: C-003267: CUSYL (Version 1.0)
- hasPart: C-003268: CUWORD (Version 1.0)
- hasPart: C-003269: CUSENT (Version 1.0)
- hasPart: C-003271: CUDIGIT (Version 1.0)
- hasPart: C-003272: CUCMD (Version 1.0)
- hasVersion: C-003279: CUCall
-
C-003267: CUSYL (Version 1.0)
CUSYL is a part of CUCopora, a large scale Cantonese spoken language corpora. It is a collection of 1800 Cantonese syllables with pitch-marking, covering all valid syllables as well as common lazy and colloquial pronunciations. The corpus also includes manual transcripts. The package comes with CUWORD (Version 1.0).
- isPartOf: C-003266: CUCorpora
- hasVersion: C-003268: CUWORD (Version 1.0)
- hasVersion: C-003269: CUSENT (Version 1.0)
- hasVersion: C-003271: CUDIGIT (Version 1.0)
- hasVersion: C-003272: CUCMD (Version 1.0)
- hasVersion: C-003274: CUCall Cantonese Words (Version 1.0)
-
C-003268: CUWORD (Version 1.0)
CUWORD is a part of CUCopora, a large scale Cantonese spoken language corpora. It is a collection of 2500 multi-syllabe short phrases covering most common Cantonese syllables. The corpus also includes manually verified phonemic transcription provided for each utterance. The package comes with CUSYL. (Version 1.0).
- isPartOf: C-003266: CUCorpora
- hasVersion: C-003267: CUSYL (Version 1.0)
- hasVersion: C-003269: CUSENT (Version 1.0)
- hasVersion: C-003271: CUDIGIT (Version 1.0)
- hasVersion: C-003272: CUCMD (Version 1.0)
- hasVersion: C-003274: CUCall Cantonese Words (Version 1.0)
-
C-003269: CUSENT (Version 1.0)
CUSENT is a part of CUCopora, a large scale Cantonese spoken language corpora. It is a large collection of spoken Cantonese sentences designed to be phonetically rich. The corpus also includes manually verified phonemic transcription.
- isPartOf: C-003266: CUCorpora
- hasVersion: C-003267: CUSYL (Version 1.0)
- hasVersion: C-003268: CUWORD (Version 1.0)
- hasVersion: C-003271: CUDIGIT (Version 1.0)
- hasVersion: C-003272: CUCMD (Version 1.0)
- hasVersion: C-003273: CUCall Cantonese Sentences (Version 1.0)
- isReferencedBy: C-003273: CUCall Cantonese Sentences (Version 1.0)