Table 1. Composition of 20 million word Modern Spanish corpus used in the
Routledge Frequency Dictionary of Spanish
 

 

# words
(million W)

Spain

# words
(million W)

Latin America

Spoken

1.00

España Oral1

2.00

Habla Culta (ten countries)

 

0.35

Habla Culta (Madrid, Sevilla)

 

 

   3.35

1.35

 

2.00

 

Transcripts/
Plays

1.00

Transcripts/Interviews (congresses, press conferences, other)

1.00

Transcripts/Interviews (congresses, press conferences, other)

 

0.27

Interviews in the newspaper ABC

 

 

 

0.40

Plays

0.73

Plays

   3.40

1.67

 

1.73

 

Literature

0.06

Novels (BV2)

1.60

Novels (BV2)

 

0.00

Short stories (BV2)

0.87

Short stories (BV2)

 

0.19

Three novels (BYU3)

1.11

Twelve novels (BYU3)

 

2.17

Mostly novels, from LEXESP4

0.18

Four novels from Argentina5

 

 

 

0.20

Three novels from Chile6

    6.38

2.42

 

3.96

 

Texts

1.05

Newspaper ABC

3.00

Newspapers from six different countries

 

0.15

Essays in LEXESP4

0.07

Cartas (“letters”) from Argentina5

 

2.00

Encarta encyclopedia

0.30

Humanistic texts (e.g. philosophy, history from Argentina5)

 

 

 

0.30

Humanistic texts (e.g. philosophy, history from Chile6)

    6.87

3.20

 

3.67

 

Total

8.64

 

11.36

 

Sources:

1. Corpus oral de referencia de la lengua española contemporánea (http://elvira.lllf.uam.es/docs_es/corpus/ corpus.html)
2. The Biblioteca Virtual (http://www.cervantesvirtual.com)
3. Fifteen recent novels, acquired in electronic form from the Humanities Research Center, Brigham Young University
4. Léxico informatizado del español (http://www.edicionsub.com/coleccion.asp ?coleccion=90)
5. From the Corpus lingüístico de referencia de la lengua española en argentina (http://www.lllf.uam.es/
~fmarcos/informes/corpus/coarginl.html)
6. From the Corpus lingüístico de referencia de la lengua española en chile (http://www.lllf.uam.es/~fmarcos/informes/corpus/cochile.html)