言語資源の登録件数: 3330件
2023 件中 1421 - 1430 件目
-
C-004012: List of about 300 Pairs of Morphologically-related Wa Words
List of about 300 Pairs of Morphologically-related Wa Words, with Chinese glosses.
- references: D-004009: Wa Dictionary Database
-
C-004013: Samples of spoken Wa
MP3 Audio materials of the Tale of the Two Kings, Friends Forever, The Peaceable Kingdom (Haktiex Yien yawk), "Pet La Pa Ang Kwe Rhawm Keut" (The Mindless Rabbit), Dialogues 25-38 from Wayu huihua keben 佤语会话课本), and New bilingual Chinese-Wa primary-school language text Lāi Loux (2003)
-
C-004014: Chinese Web 5-gram Corpus
IntroductionThis data set contains Chinese word n-grams and their observed frequency counts. The length of the n-grams ranges from unigrams (single words) to five-grams. http://www.chineseldc.org/EN/doc/CLDC-LAC-2008-001/intro.htm
- isPartOf: C-003107: Web English N-gram Data
-
C-004016: 863 program in 2007 SSMT machine translation evaluation data
SSMT2007 statistics from the third seminar on machine translation machine translation evaluation.
SSMT2007 include Chinese-English, English-Chinese translation of the two directions of machine testing corpus, the chapter types, from the information field. SSMT2007 Chinese and English words with the direction of alignment test corpus, to provide after-word Chinese-English sentence right, from the field of information.In addition, the measure contains the outline report on the results of evaluation and assessment software. -
C-004017: The contemporary chinese general balanced corpus of National Language Committee(Segmentation lexicon)
Chinese language teaching and research, information processing, etc.
-
C-004018: The contemporary chinese general balanced corpus of National Language Committee(Syntactic Treebank)
Chinese language teaching and research, information processing, etc.
- hasVersion: C-004017: The contemporary chinese general balanced corpus of National Language Committee(Segmentation lexicon)
- hasVersion: C-004019: The contemporary chinese general balanced corpus of National Language Committee(Segmentation and part-of-speech annotated)
- hasVersion: C-004020: The contemporary chinese general balanced corpus of National Language Committee(Raw)
-
C-004019: The contemporary chinese general balanced corpus of National Language Committee(Segmentation and part-of-speech annotated)
Chinese language teaching and research, information processing, etc.
- hasVersion: C-004017: The contemporary chinese general balanced corpus of National Language Committee(Segmentation lexicon)
- hasVersion: C-004018: The contemporary chinese general balanced corpus of National Language Committee(Syntactic Treebank)
- hasVersion: C-004020: The contemporary chinese general balanced corpus of National Language Committee(Raw)
-
C-004020: The contemporary chinese general balanced corpus of National Language Committee(Raw)
The contemporary chinese general balanced corpus of National Language Committee(Raw). Chinese language teaching and research, information processing, etc.
- hasVersion: C-004017: The contemporary chinese general balanced corpus of National Language Committee(Segmentation lexicon)
- hasVersion: C-004018: The contemporary chinese general balanced corpus of National Language Committee(Syntactic Treebank)
- hasVersion: C-004019: The contemporary chinese general balanced corpus of National Language Committee(Segmentation and part-of-speech annotated)
-
C-004021: Chinese-English/Chinese-Japanese parallel corpora
The main objective of this project were to build Chinese-English / Chinese-Japanese parallel corpora and to provide fundamental language resources and evaluation corpora for research on Machine Translation and other language information processing technologies. So far, 200000 Chinese-English, 20000 Chinese-Japanese sentence-aligned, and 10000 Chinese-English lexically aligned and verified sentence pairs have been achieved.
-
C-004023: Audio recordings and streams
A collection of audio recordings. Suttas in English, Pali chanting, and Dhamma talks.