Language resource #: 3330
Results 1321 - 1330 of 2023
-
N-003787: Hanzi Normative Glyphs
The orthographic standard differs according to periods and geographical regions. This database selects the manuscripts of Buddhist and historical texts and records various examples of Chinese glyphs.
-
C-003788: Middle Korean Morpheme Database
The database collects the usages of Chinese readings and morphemes based on the analysis of the documents such as Rime dictionaries or Phonetic Glosses of Dharani between 15-18 centuries in Korea. The image samples from the original texts are partially available.
-
C-003790: Large scale blog corpus
Among 5,300,000 blogs at 28 domestic companies of blog business, 600,000,000 articles have been retrieved since January, 2007.
-
C-003793: Japanese-English patent parallel corpus
Large-scale parallel corpus of Japanese-English corresponding data from American patent collection and Japanese patent collection.
-
C-003796: The database of a hundred names of the places
Sound database of 100 names of places. 12 males each pronounced the words twice.
-
C-003798: Chinese-Japanese Bilingual Corpus
Novels, essays, biographies, political criticisms / white paper, law-affiliated documents / treaty documents, poetry, are collected and made into Chinese-Japanese Bilingual Corpus.
-
C-003803: Hypermedia Corpus of Spoken Japanese
It is bundled with digital video and audio data (movies) as well as full texts. The unique features of our hypermedia corpus will be as follows:
1. the intonation, pause and some other information can be represented in their original forms by digital sounds, thereby making it unnecessary to assign specific linguistic notations or 'esoteric' symbols
2. non-verbal information is available in the form of multi-angle digital movies, providing the speaker/listener's facial expressions, noddings, and so on
3. any movie and sound data is randomly accessible by respective code number and/or one or more key words in addition to the traditional text search thanks to new digital video-on-demand (VOD) technology and hypertext description languages such as HTML/XML.
The feature 1 will enable novice users of language databases, especially language teachers, to use them as their teaching materials since they are not required to learn anything about linguistic notations and other complicated assumptions. The features 2 and 3 will extend the range of conversation analysis to deal with extra-linguistic factors like gestures and even atmospheric conditions that have virtually been excluded from the subjects of studies. -
C-003804: 羅生門
-
C-003805: The Small Catechism of Martin Luther
-
C-003806: 奥の細道