MULTILINGUAL CORPORA

Based on the following, but no reading due before class

Lawson, Ann (2001) “Collecting, aligning, and analyzing parallel corpora”. In Small Corpus Studies and ELT: Theory and practice, ed. Mohsen Ghadessy, et al. John Benjamins. 279-310.

Borin, Lars (2002) “…and never the twain shall meet?” In Parallel corpora, parallel worlds, ed. Lars Borin. Rodopi. 1-25.


http://opus.lingfil.uu.se/ (Open Subtitles)

Write 4-5 sentence overview of Open Subtitles, how searches went, etc. Print it out; bring it to class, and this will be the quiz for Tuesday.



(Note: some questions and page numbers below refer just to the Lawson article)

Intro: challenge of translation

1. Why are corpora so useful for translation? (from overview on p284)

2. Are large corpora always better than small corpora (for translation studies)? Give some examples.

3. What are "comparable" corpora?  What are their strengths and weaknesses?

4. What are "parallel" corpora?  Give one example of each of the three types of parallel corpora at the bottom of p284.

5. Summarize the benefits of parallel corpora, discussed on p286

6. How can (parallel) corpora be used effectively for language teaching?

7. What are the challenges in using (parallel) corpora for language teaching?

8. What are some tools that can be used to analyze parallel corpora?

9. Discuss the role of the following in the creation of parallel corpora: design, collection of texts, copyright, encoding, annotation (most of this reviews material we discussed earlier this semester, when we talked about corpus design in general)

10. What are some of the approaches to parallel corpus alignment?


TRANSLATION PARALLEL CORPORA