1. Corpus
Corpus is a collection of texts that has been assembled for the purposes of language study. Modern corpora are stored electronically and consist of many millions of words of texts, both written and spoken. They range from academic texts through newspaper articles to casual conversation and include American, British, and Australian.
코퍼스(corpus, 복수형은 corpora)는 글(written) 또는 말 (spoken) 텍스트를 모아 놓은 것. 우리말로 ‘말뭉치’ ‘말모둠’으로 번역. 온라인상 또는 앱을 다운로드해놓고, 컴퓨터에서 처리할 수 있는 형태의, 전자화된 텍스트. written, spoken, academic texts, 신문기사, 인터뷰, 연설문, 평상시의 대화 등 다양한 언어자료 포함.
(1) a collection of written or spoken material stored on a computer and used to find out how language is used:
(2) A collection of authentic language text
2. Concordance
Corpus information is typically presented in the form of “concordances”. A concordance displays the results of word research as individual lines of text with the targeted word or words aligned in the center.
용어색인, 용어목록, 용례목록
단어가 쓰인 문맥(단어의 앞 뒤)을 보여주는 색인
(1) a book or document that is an alphabeticallist of the words used in a book or a writer's work, with information about where the words can be found and in which sentences; An alphabetical index of the principal words in a book or the works of an author with their immediate contexts and an account of the meaning (옛날 원래 의미)
(2) an index produced by computer or machine, alphabetically listing every word in a text.
A concordance is a list of the words in a text or group of texts, with information about where in the text each word occurs and how often it occurs. The sentences each word occurs in are often given.
(3) Concordances are frequently used in linguistics, when studying a text.
For example:
- comparing different usages of the same word
- analysing keywords
- analysing word frequencies
- finding and analysing phrases and idioms
- creating indexes and word lists (also useful for publishing)
3. Concordancing
Concordancing techniques are widely used in national text corpora such as American National Corpus, British National Corpus, and Corpus of Contemporary American English available on-line.
4. Concordancer
Concordancing을 해주는 프로그램.
A concordancer is a computer program that automatically constructs a concordance; A concordancer is a piece of software, either installed on a computer or accessed through a website, which can be used to search, access and analyse language from a corpus.
==========
For example, here is a concordance for ‘wield power’.
<보충>
1. What is a concordancer?
A concordancer is a piece of software, either installed on a computer or accessed through a website, which can be used to search, access and analyse language from a corpus. They can be particularly useful in exploring the relationships between words and can give us very accurate information about the way language is authentically used.
A concordancer is a computer program that automatically constructs a concordance.
Concordancers are also used in corpus linguistics to retrieve alphabetically or otherwise sorted lists of linguistic data from the corpus in question, which the corpus linguist then analyzes.
A number of concordancers have been published notably Oxford Concordance Program (OCP), a concordancer first released in 1981 by Oxford University Computing Services claims to be used in over 200 organisations worldwide.
2. What is a concordance?
A concordance is a list of the words in a text or group of texts, with information about where in the text each word occurs and how often it occurs. The sentences each word occurs in are often given.
3. 유명한 corpora websites
(1) The Corpus of Contemporary American English (COCA) is a 1.1 billion word corpus of American English and is one of the most widely used corpora used.
(2) The British National Corpus (BNC).
첫댓글 Speaking part를 공부 중에 'corpus-based data on spoken language = The growth of spoken corpora 라고 제가 적어놨더라구요. (tbp발췌)corpus는 단수 corpora는 복수인건 알겠는데 와닿지 않아서 혹시! 하고 검색해보니 역시나! 있네요! 자료들을 보다 보면 정말 최곱니다. 문제는 제가 시험전까지 교수님의 알찬 자료를 머리에 다 완전히 넣을 수 있을지; 그게 의문입니다. 하하하;)
주옥같은 이 자료들! 시험전에 다보고 갈 수 있기를 바라며:) 정말 감사합니다.
네 정말 좋은 자료들 많지요. 대학 강의를 오래해왔기 때문에 임용기출의 범위이면서 기본 개념이지만 중요한 개념들이 뭔지를 누구보다 잘 알고 있지요~ 차분히 궁금한 내용부터 틈틈히 보세요.
교수님~그럼 어떤 단어의 쓰임이 궁금하면 corpus에 들어가서 concordancer를 이용해 concordancing을 해서 concordance를 찾으면 되는 건가요?
네, 어떤 단어를 concordancing하고 싶을 때, corpora websites인 (1) The Corpus of Contemporary American English (COCA)이나 (2) The British National Corpus (BNC), 이런 웹사이트에 들어가면 이미 concordancer프로그램이 깔려있어서, search에 원하는 단어 검색하면 특정단어와 관련된 여러가지 정보를 보여주는데, 이런 것들이 concordance겠지요.
아님 아예 예를 들면, Antconk같은 앱을 깔면, 이 안에 corpus도 갖고있고, concordancer tool을 제공하기 때문에 쉽게 concordancing할 수도 있습니다.
@Dio Jin 교수 감사합니다~
@올리씨 Sure~