Final exam
Winter 2008

Please limit each of the following answers to one paragraph each. PLEASE, PLEASE, please...


  1. In one sentence each, discuss which corpus (or other electronic source) of English you would use for each of the following tasks, and why:
    1. historical changes from the 1300s-1600s
    2. changes from the 1960s-1990s
    3. collocations for a given word
    4. data from learners of English
    5. the evolution of the meaning of a given word over the past 400 years
  2. Imagine that someone has asked you to create a 10 million word corpus of a language of your choice, or a particular aspect of English.  What are 5-6 design criteria that might be important, and how would you address each of these?
  3. What are 5-6 of the most important principles governing the use of concordances?
  4. When would each of the following be most helpful:
    1. WordCruncher (the version we’ve been using)
    2. WordSmith
    3. relational databases
  5. What are the uses of each of the following:
    1. proportions
    2. chi-square
    3. z-scores
    4. factor analysis
  6. Briefly discuss Biber’s “multi-dimensional” approach to register variation
  7. What are 5-6 of the major challenges and issues facing the designers or historical corpora and/or challenges in using these corpora to investigate language change
  8. What are the major uses of parallel corpora?
  9. How can corpora most effectively be used in the classroom?
  10. What are some general principles governing the use of data from the Web?

  1. Comment on the idea that “corpora have made language analysis more simple, as well as more complex”
  2. Give some examples where corpora provide (even native speakers) with insights that otherwise might not be available