Language Resource Search - SHACHI: Language Resource Metadata Database

Language resource #: 3330 Results 1551 - 1560 of 2023

C-004184: Vienna-Oxford International Corpus of English 1.1
VOICE is an on-line accessible computer-readable corpus of English spoken by non-native English speakers in different contexts. It comprises 1 million words of transcribed spoken ELF (English as a lingua franca) from professional, educational and leisure domains and various speech event types. The xml versions of VOICE is also available.
- hasFormat: C-004185: Vienna-Oxford International Corpus of English (version 1.1 XML)
C-004185: Vienna-Oxford International Corpus of English (version 1.1 XML)
This is the xml version of VOICE Online, the computer-readable corpus of English spoken by non-native English speakers in different contexts. It comprises 1 million words of transcribed spoken ELF (English as a lingua franca) from professional, educational and leisure domains and various speech event types.
- hasFormat: C-004184: Vienna-Oxford International Corpus of English 1.1
C-004186: Yahoo-based Contrastive Corpus of Questions and Answers
YCCQA is a contrastive corpus of English, French, German and Spanish, containing question-answer interactions between internet users, produced under almost identical circumstances. The corpus uses questions and answers submitted by users of the Yahoo Answers website. The languages and styles in the corpus illustrating the casual writing style of internet postings.
C-004187: The Penn-Helsinki Parsed Corpus of Modern British English
The PPCMBE corpus is part of an ongoing larger project at the University of Pennsylvania and the University of York to produce syntactically annotated corpora for all stages of the history of English, consisting of around one million words.
- hasVersion: C-004188: The Penn-Helsinki Parsed Corpus of Middle English, second edition
- hasVersion: C-004189: The Penn-Helsinki Parsed Corpus of Early Modern English
- isPartOf: C-004190: The Penn Corpora of Historical English
C-004188: The Penn-Helsinki Parsed Corpus of Middle English, second edition
The PPCME2 corpus is part of an ongoing larger project at the University of Pennsylvania and the University of York to produce syntactically annotated corpora for all stages of the history of English. The PPCME2 text samples are based largely on the Middle English section of the Diachronic Part of the Helsinki Corpus of English Texts (available from ICAME), with certain additions and deletions.
- hasVersion: C-004187: The Penn-Helsinki Parsed Corpus of Modern British English
- hasVersion: C-004189: The Penn-Helsinki Parsed Corpus of Early Modern English
- isPartOf: C-004190: The Penn Corpora of Historical English
- references: C-000811: The Helsinki Corpus of English Texts: Diachronic Part
C-004189: The Penn-Helsinki Parsed Corpus of Early Modern English
The PPCEME corpus is part of an ongoing larger project at the University of Pennsylvania and the University of York to produce syntactically annotated corpora for all stages of the history of English, consisting of over 1.7 million words.
- hasVersion: C-004188: The Penn-Helsinki Parsed Corpus of Middle English, second edition
- hasVersion: C-004187: The Penn-Helsinki Parsed Corpus of Modern British English
- isPartOf: C-004190: The Penn Corpora of Historical English
- references: C-000811: The Helsinki Corpus of English Texts: Diachronic Part
C-004190: The Penn Corpora of Historical English
The Penn Corpora of Historical English is a collection of three subcorpora; the Penn-Helsinki Parsed Corpus of Middle English, second edition (PPCME2), the Penn-Helsinki Parsed Corpus of Early Modern English (PPCEME), and the Penn Parsed Corpus of Modern British English (PPCMBE). It is a project at the University of Pennsylvania and the University of York to produce syntactically annotated corpora for all stages of the history of English. All of the annotation has been carefully checked by expert human annotators for accuracy and consistency.
C-004191: The Small Corpus of Political Speeches
Developed as part of Corpus Methodology courses at the Unit of English at the Department of Modern Languages, University of Helsinki, The corpus includes full-length speeches delivered by elected politicians and other civic leaders. Useful for the study of speech structure, the use of rhetorical devices, and the grammatical features of texts of the written-to-be-spoken type. All texts in the corpus were downloaded from online speech repositories, with the name of the source repository retained inside mark-up.
C-004192: Corpus of Contemporary American English
The COCA corpus is the largest freely-available corpus of American English, containing more than 450 million words of text and is equally divided among spoken, fiction, popular magazines, newspapers, and academic texts. The users of COCA can search for exact words or phrases, wildcards, lemmas, part of speech, or any combinations of these. Surrounding words (collocates) within a ten-word window can also be searched.
- hasVersion: C-003498: TIME CORPUS
- hasVersion: C-004193: Corpus of Historical American English
- hasVersion: C-004194: Corpus of American Soap Operas
- hasVersion: C-003501: Corpus del Español
- hasVersion: Corpus do Português
C-004193: Corpus of Historical American English
The COHA corpus is the largest structured corpus of historical American English, containing more than 400 million words of text of American English from 1810 to 2009. The users can see how words, phrases and grammatical constructions have increased or decreased in frequency, how words have changed meaning over time, and how stylistic changes have taken place in the language.
- hasVersion: C-003498: TIME CORPUS
- hasVersion: C-004192: Corpus of Contemporary American English
- hasVersion: C-004194: Corpus of American Soap Operas
- hasVersion: C-003501: Corpus del Español
- hasVersion: Corpus do Português

SHACHI - Language Resource Metadata Database