STRUCTURED CORPORA

Main idea: Use the British National Corpus to see what types of data one can get from a corpus

  1. What are the most common verbs ([VVI]) overall in the corpus?
  2. What are the five adjectives ([AJ0]) with highest frequency in SERMONS (S_sermon) vs. SPOKEN in general) (note: check MIN FREQ and set to 1 in both registers)
  3. In which register (spoken, fiction, newspaper, academic) is each of the following words most common (note: use charts): glistening, whom, yesterday, think
  4. What are the five most common adjectives within five words to the left or right of cliff? (note: use "Surrounding Words", select ADJ.ALL and 5 LEFT and 5 RIGHT)

  5. In which of the four macro registers (SPOKEN, FICTION, NEWS, ACADEMIC) is each of the following most common: passives [VB*] [VVN], progressives [VB*] [VVG], and perfects [VH*] [VVN].