Mark Davies
Professor, Corpus Linguistics
Brigham Young University

Back to Home Page

EDUCATION


PUBLICATIONS


Click on year in parentheses to download the PDF file (for 2004 and earlier). The password to open the files is: byu_ling
  1. (Forthcoming, 2008) “Relational databases as a robust architecture for the analysis of word frequency”. In AHRC ICT Methods Network: Expert Seminar on Linguistics: Word Frequency and Keyword Extraction, ed. Dawn Archer. Ashgate.

  2. (Forthcoming, 2008) "Research on historical pragmatics with Biblia Medieval (an aligned parallel corpus of medieval Spanish). In Claus Pusch, et al (eds), Romance Corpus Linguistics III: Corpora and Pragmatics. Guntar Naar. (Co-authored with Andrés Enrique-Arias).

  3. (2008) "Spanish and Portuguese Corpus Linguistics". Studies in Hispanic and Lusophone Linguistics. 1:149-86.

  4. (2007) A Frequency Dictionary of Modern Portuguese: Core Vocabulary for Learners.  Routledge. (Co-authored with Ana Maria Raposo Preto-Bay)

  5. (2007)Pointing Out Frequent Phrasal Verbs: A Corpus-Based Analysis”. TESOL Quarterly 41:339-59. (Co-authored with Dee Gardner)

  6. (2007) "Semantically-based queries with a joint BNC/WordNet database". In Corpus Linguistics twenty-five years on, ed. Roberta Facchinetti. Amsterdam: Rodopi. 149-167.

  7. (2006) "Towards the first comprehensive survey of register variation in Spanish".  In Corpus Linguistics Beyond the Word: Corpus Research from Phrase to Discourse, ed. Eileen Fitzpatrick. Rodopi. 73-86.

  8. (2006) “Vocabulary Coverage in Spanish Textbooks: How Representative is It?” In Selected Proceedings from the Conference on the Acquisition of Spanish and Portuguese as First and Second Languages, ed. Jacqueline Toribio. Cascadilla. 132-43. (Co-authored with Timothy L. Face). 132-43.

  9. (2006) "Spoken and written register variation in Spanish: A Multi-dimensional Analysis." Corpora 1:1-37. (Co-authored with Doug Biber, James Jones, and Nicole Tracy-Ventura).

  10. (2005) A Frequency Dictionary of Modern Spanish: Core Vocabulary for Learners.  Routledge.

  11. (2005) "The advantage of using relational databases for large corpora: speed, advanced queries, and unlimited annotation".  International Journal of Corpus Linguistics 10: 301-28.

  12. (2005) "On diachronic shifts with Spanish se: preliminary evidence from large electronic corpora." In Claus Pusch, et al (eds), Romance Corpus Linguistics II: Corpora and Diachronic Linguistics. Guntar Naar. 431-42.

  13. (2005) "Vocabulary Range and Text Coverage: Insights from the Forthcoming Routledge Frequency Dictionary of Spanish". In David Eddington, (ed), Selected Proceedings from the 7th Hispanic Linguistics Symposium. 106-15.

  14. (2005) "Advanced research on syntactic and semantic change with the Corpus del Español". In Claus Pusch, et al (eds), Romance Corpus Linguistics II: Corpora and Diachronic Linguistics. Guntar Naar. 203-14. Reprinted in: Teubert, Wolfgang & Ramesh Krishnamurthy (eds.). 2007. Corpus Linguistics. Critical Concepts in Linguistics (6 vols.). London: Routledge. 337-48 (Volume 5).

  15. (2004) El uso del Corpus del Español y otros corpus para investigar la variación actual y los cambios  históricos. Tokyo: Univ. Sophia.

  16. (2004) Review of Léxico Hispanoamericano (Peter Boyd-Bowman, et al). La Coronica: A Journal of Medieval Spanish Literature and Language 33:259-64.

  17. (2004) "Student use of large, annotated corpora to analyze syntactic variation". In Guy Aston, et al (eds). Corpora and Language Learners. Philadelphia: John Benjamins. 259-69.

  18. (2004) Review of "Computer Learner Corpora, Second Language Acquisition and Foreign Language Teaching (Sylvaine Granger, et al). Modern Language Journal.

  19. (2004) "Student use of large corpora to investigate language change". In Thomas Upton, et al (eds).  Applied Corpus Linguistics: A Multidimensional Perspective. Amsterdam: Rodopi. 207-22.

  20. (2003)  "Diachronic Shifts and Register Variation with the "Lexical Subject of Infinitive" Construction. (Para yo hacerlo)". In Silvina Montrul and Francisco Ordóñez, Linguistic Theory and Language Development in Hispanic Languages. Somerville, MA: Cascadilla Press. 13-29. 

  21. (2003)  "Annotation without lexicons: an alternative to the standard bootstrapping approach".  In Paul Rayson, et al. Proceedings from Corpus Linguistics 2003 (Lancaster, England, March 2003). 174-83.

  22. (2002) "Un corpus anotado de 100.000.000 palabras del español histórico y moderno". SEPLN 2002 (Sociedad Española para el Procesamiento del Lenguaje Natural). (Valladolid).  21-27.

  23. (2002) "'Esto es ligero de fazer: Object to Subject Raising in Medieval and Early Modern Spanish".  In James F. Lee, et al, Structure, Meaning, and Acquisition of Spanish. Somerville, MA: Cascadilla Press.  19-31.

  24. (2001) “Review of Construcciones causativas en el español medieval by Milagros Alfonso Vega”. Revista Canadiense de Estudios Hispánicos 25:329-30.

  25. (2001) "Creating and using multi-million word corpora from web-based newspapers".  In Corpus Linguistics in North America, eds. Rita C. Simpson and John M. Swales. Ann Arbor: U Michigan P.  58-75.

  26. (2000)  "Using multi-million word corpora of historical and dialectal Spanish texts to teach advanced courses in Spanish linguistics".  In Rethinking Language Pedagogy from a Corpus Perspective, eds. Lou Burnard and Tony McEnery.  Frankfurt am Main; New York: P. Lang. 173-85.

  27. (2000"Syntactic Diffusion in Spanish and Portuguese Infinitival Complements”.  In New Approaches to Old Problems: Issues in Romance Historical Linguistics, eds.Steven Dworkin and Dieter Wanner.  Amsterdam; Philadelphia: John Benjamins. 109-27.

  28. (1999)  "The Historical Development of Subject Raising in Portuguese: A Corpus-Based Approach". Neuphilologische Mitteilungen 100:95-110.

  29. (1999)  "A Computer Corpus-Based Study of Subject Raising in Modern Portuguese". Lingvisticae Investigationes 21:379-400.

  30. (1998)  "The Evolution of Spanish Clitic Climbing: A Corpus-Based Approach." Studia Neophilologica 69:251-63.

  31. (1997)  "A Corpus-Based Approach to Diachronic Clitic Climbing in Portuguese." Hispanic Journal 17: 93-111.

  32. (1997) "Using Large Computer-Based Corpora as a Philological Tool: An Analysis of Four Medieval Spanish Bibles." Dactylus 16: 70-92.

  33. (1997)  "The History of Subject Raising in Spanish". Bulletin of Hispanic Studies (Liverpool) 74: 399-411.

  34. (1997)  "A Corpus-Based Analysis of Subject Raising in Modern Spanish." Hispanic Linguistics 9: 33-63.

  35. (1996)  "The Diachronic Interplay of Finite and Nonfinite Verbal Complements in Spanish and Portuguese." Bulletin of Hispanic Studies (Glasgow) 73:137-58.

  36. (1996) "The Diachronic Evolution of the Causative Construction in Portuguese." Journal of Hispanic Philology 17:261-92.

  37. (1995)  "The Evolution of Causative Constructions in Spanish and Portuguese." In Current Research in Romance Linguistics, ed. John Amastae, et al. Philadelphia: John Benjamins, 1995. 105-122.

  38. (1995)  "Omnipage and WordCruncher: Tools for Creating and Searching Digitalized Text Corpora." La Corónica 23:111-115.

  39. (1995)  "The Evolution of the Spanish Causative Construction." Hispanic Review 63:57-77.

  40. (1995) "Analyzing Syntactic Variation with Computer-Based Corpora: The Case of Modern Spanish Clitic Climbing". Hispania 78:370-380.

  41. (1994)  "Parameters, Passives, and Parsing: Explaining Diachronic Shifts in Spanish and Portuguese". In Variation and Linguistic Theory, ed. K. Beals, et al. Chicago: CLS. Vol 2. 46-60.

  42. (1992)  "A Tentative Bibliography of Historical Spanish Syntax." Hispanic Linguistics 5:279- 351.

CONFERENCE PRESENTATIONS


  1. (2007) "Investigating Recent Linguistic Shifts with a New 100+ Million Word Corpus of American English from the 1900s. SHEL 5 (Studies in the History of the English Language". Univ. Georgia.

  2. (2007) "From the Corpus del español to the Corpus do portugues (and back again): Evolving architectures for historical corpora". Colloquium on Ibero-Romance Historical Corpora. Univ. Balearic Islands, Spain. (Invited speaker)

  3. (2007) "A 100+ Million Word Corpus of American Magazines, 1900-1999".  ICAME (International Computer Archive of Medieval and Modern English). Stratford-upon-Avon.

  4. (2006) "A new, 37 million word, Web-based corpus of historical English". ICAME (International Computer Archive of Medieval and Modern English). Univ. of Helsinki.

  5. (2006) "Competing architectures for large historical corpora".  Workshop on Historical Text Mining (AHRC ICT Methods workshop). Univ. of Lancaster (England).

  6. (2006) "Towards a 250 Million Word Corpus of Historical English". Bringing Text Alive: The Future of Scholarship, Pedagogy, and Electronic Publication (Text Creation Partnership). U Michigan.

  7. (2006) "Resolving Trade Name Legal Disputes through Corpus Research". American Association of Applied Corpus Linguistics. Northern Arizona U.

  8. (2006) "Incorporating “meaning-based” searches into corpus architectures and interfaces". American Association of Applied Corpus Linguistics. Northern Arizona U.

  9. (2006) "Size, speed, and annotation with historical corpora". Digital Historical Corpora - Architecture, Annotation, and Retrieval. Dagstuhl Int'l Conference and Research Center for Computer Science (#06491). Dec 2006.

  10. (2005) “Vocabulary Coverage in Spanish Textbooks: How Representative is It?” Conference on the Acquisition of Spanish and Portuguese as First and Second Languages. Penn State Univ.

  11. (2005) “Relational databases as a robust architecture for the analysis of word frequency”. AHRC ICT Methods Network: Expert Seminar on Linguistics: Word Frequency and Keyword Extraction. Univ. of Lancaster, England. (Invited speaker)

  12. (2005) “A new interface for examining use of synonyms (and other related words ) in the BNC.” Corpus Linguistics 2005. Univ. of Birmingham, England.

  13. (2004) "The frequency and distribution of [se] constructions in Spanish: a corpus and learner-based approach."  7th Conference on the Acquisition of Spanish and Portuguese as First and Second Languages. Univ. of Minnesota. (With Timothy L. Face)

  14. (2004) "Incorporating register variation into BNC queries: a relational database approach."  Sixth International Conference on Teaching and Language Corpora. Granada, Spain.

  15. (2004) "El uso del Corpus del Español y otros corpus para investigar la variación actual y los cambios históricos."  Series of workshops presented at Sophia University (Tokyo, Japan).

  16. (2004) "Creating and Using Corpora to Investigate Language Change and Variation."  Department of Linguistics, Sophia University (Tokyo, Japan).

  17. (2004) "El diseño y uso de los corpus grandes para investigar el cambio lingüístico y la variación actual." Kobe University, Japan.

  18. (2004) "A joint BNC/WordNet database: the best of both worlds". The Fifth North American Symposium on Corpus Linguistics. Montclair State, NJ.

  19. (2004) "A multi-dimensional analysis of register variation in Spanish.". The Fifth North American Symposium on Corpus Linguistics. Montclair State, NJ.

  20. (2004) "A match made in corpus heaven: the BNC and WordNet in relational database form." 25th Conference of the International Computer Archive of Modern and Medieval English. Verona, Italy.

  21. (2004) "The impact of phrasal forms in corpus-based vocabulary studies".  AAAL 2004 (American Association of Applied Linguistics). Portland, OR.

  22. (2003) "How much vocabulary is enough?: Insights from recent corpus-based studies".  6th Conference on the Acquisition of Spanish and Portuguese as First and Second Languages. U New Mexico.

  23. (2003) "A multidimensional analysis of register variation in Spanish".  7th Hispanic Linguistics Symposium. U New Mexico.

  24. (2003) "Advanced research on syntactic and semantic change with the 100 million word, fully-annotated Corpus del Español.".  2nd Freiburg Workshop on Romance Corpus Linguistics. U Freiburg, Germany. September 2003.

  25. (2003) "On the frequency, use, and omission of se: Evidence from the 100 million word Corpus del Español".  2nd Freiburg Workshop on Romance Corpus Linguistics. U Freiburg, Germany. September 2003.

  26. (2003) "Annotation without lexicons".  Corpus Linguistics 2003. Lancaster University, UK. March 2003.

  27. (2003) "Relational n-gram databases as a basis for unlimited annotation on very large corpora". Workshop on the Shallow Processing of Large Corpora.  Lancaster University, UK. March 2003.

  28. (2002) "Using Relational Databases to Create Highly Searchable and Very Large Corpora". The Fourth North American Symposium on Corpus Linguistics. IUPUI, Indianapolis, IN.

  29. (2002) "Student use of a 100 million word, fully annotated corpus of Spanish to model language variation and change". Fifth International Conference on Teaching and Language Corpora. Bertinoro, Italy.

  30. (2002) " Un corpus anotado de 100.000.000 palabras del español histórico y moderno". SEPLN 2002 (Sociedad Española para el Procesamiento del Lenguaje Natural). (Univ. de Valladolid, Spain).

  31. (2002) "Modeling Syntactic Change with the Fully Annotated, 100 Million Word 'Corpus del Español': Suppresion of se with Causative Verbs". Sixth Hispanic Linguistics Symposium. Univ. of Iowa.

  32. (2002) "A Searchable, Fully-Annotated, 100 Million Word Corpus of Historical Spanish Texts”. Kentucky Foreign Language Conference. Univ. of Kentucky.

  33. (2002) "A 100 Million Word Corpus of Historical and Modern Spanish, Searchable by Grammatical Category, Lemma, and Related Words".  XXX Romance Linguistics Symposium. Cambridge, England.

  34. (2002) "How to Make Large Corpora both Fast and Highly Annotated". 5th Annual CLUK (Computational Linguistics in the UK) Research Colloquium. Leeds, England. 

  35. (2001) "Multimillion Word Online Corpora as a Tool for Language Learning".  The Seventh Sloan-C International Conference on Online Learning: Emerging Standards of Excellence in Asynchronous Learning Networks. Orlando, FL.

  36. (2001) "Virtually Unlimited Annotation on Very Large Corpora".  IRCS Workshop on Linguistic Databases.  Philadelphia, PA.

  37. (2001) "Dialectal Variation and Diachronic Shifts with the "Preposition + Subject + Infinitive" Construction (para yo hacerlo)".  Fifth Hispanic Linguistics Symposium. Univ. Illinois.

  38. (2001) "Large Historical Corpora on the Web: Helping Students to Model Linguistic Change". The Third North American Symposium on Corpus Linguistics and Language Teaching. Boston, MA.

  39. (2000) "Diachronic Shifts in Spanish Raising Constructions".  4th Hispanic Linguistics Symposium, Indiana University.

  40. (2000) "Using Large Computer-Based Corpora as a Philological Tool:An Analysis of Five Medieval Spanish Bibles". 35th International Congress on Medieval Studies, Univ. Western Michigan, May 2000.

  41. (2000) "Using Large Computer-Based Parallel Texts to Study Lexical (and Other) Changes from Old Spanish to Modern Spanish". Kentucky Foreign Language Conference, April 2000.

  42. (1999)  "Diachronic shifts in the interpretation of Spanish infinitival complements". Conference on Spanish Semantics and Pragmatics. Ohio State Univ.

  43. (1999)  "Languages as dialects and dialects as languages: explaining parallel syntactic shifts in Spanish and Portuguese". International Conference on Historical Linguistics (ICHL XIV). Univ. of British Columbia.

  44. (1999)  "Creating Multimillion Word Corpora from Web-based Newspapers". North American Symposium on Corpora in Linguistics and Language Teaching. Univ. of Michigan.

  45. (1999)  "Modeling syntactic change: evidence from computer-based studies of infinitival complements in Spanish and Portuguese". Linguistic Symposium on Romance Languages (LSRL 29). Univ. of Michigan.

  46. (1998)  "Using multimillion word corpora of historical and dialectal Spanish texts to teach 'Advanced Spanish Syntax'".  Teaching and Language Corpora 1998 conference. Oxford University, England.

  47. (1998)  "A Corpus-Based Analysis of Subject Raising in Historical and Modern Spanish". Univ. Texas-Austin Colloquium on Romance Linguistics. Austin, TX

  48. (1997)  "Using Large Computer-Based Corpora to Investigate Language Variation and Change". Deseret Language and Linguistics Symposium. Provo, UT.

  49. (1996)  "The Use of Large Computer-Based Corpus in Research and Teaching". Colloquium on Spanish Linguistics. Roanoke, VA.

  50. (1995)  "The Diachronic Evolution of Portuguese Clitic Climbing". Annual Meeting of the Modern Language Association (MLA), Chicago, IL.

  51. (1995)  "Exploring Foreign Language Resources on the Internet". A series of presentations and workshops given to Illinois K-12 teachers; organized under the auspices of the Illinois Council on the Teaching of Foreign Languages (ICTFL).

  52. (1994)  "A Corpus-Based Approach to Modern Spanish Clitic Climbing." American Association of the Teachers of Spanish and Portuguese Annual Meeting (AATSP). Philadelphia, PA.

  53. (1994)  "Parameters, Passives, and Parsing: Explaining Diachronic Shifts in Spanish and Portuguese". Parasession at the Chicago Linguistics Society (CLS). Chicago, IL.

  54. (1992)  "Explaining Diachronic Shifts in Spanish and Portuguese Causative Constructions". Linguistic Symposium on Romance Languages (LSRL XXII). El Paso, TX.

  55. (1991)  "A Diachronic Look at Infinitival Complements in Spanish and Portuguese." Modern Language Association (MLA). San Francisco, CA.

  56. (1991)  "Parameters in Diachronic Spanish and Portuguese Causative Constructions." South Central Modern Language Association (SCMLA). Dallas, TX.

  57. (1991)  "Parameters in the Development of Diachronic Infinitival Complements in Spanish and Portuguese." Language Association of the Southwest (LASSO). Austin, TX.

  58. (1991)  "Towards a Unified Account of Diachronic Spanish Clitic Placement." Linguistic Symposium on Romance Languages (LSRL XXI). Santa Barbara, CA.

  59. (1990)  "Functional-Typological Explanations for Diachronic Shifts in Spanish Clitic Placement." "Explanation in Historical Linguistics" conference. Univ. Wisconsin-Milwaukee

CONSULTANCIES (LEGAL)


2004-05.  Expert witness for Wilmer Cutler Pickering Hale and Dorr LLP (Washington DC) in a case involving the generic status of a product name in Latin America

2008.  Expert witness for Wilmer Cutler Pickering Hale and Dorr LLP (Washington DC) in a case involving the generic status of a product name in Latin America (different lawsuit)

TEACHING EXPERIENCE


Brigham Young University (2003-present)

Graduate Courses (Illinois State University)

Undergraduate Courses (Illinois State University)

Graduate Course (Visiting Professor, Brigham Young University, Summer 1995)

  • Spanish Morphosyntax (625, Graduate Seminar)

HONORS AND AWARDS


Teaching-related

  • 2002. One of two mentors for the Distance Learning Training Program, to help train other professors at ISU to teach classes via the Internet.

  • 2001. Supplemental Travel Grant from the Center for the Advancement of Teaching (ISU) to present the paper "Multimillion Word Online Corpora as a Tool for Language Learning" at the Seventh Sloan-C International Conference on Online Learning: Emerging Standards of Excellence in Asynchronous Learning Networks. Orlando, FL.

  • 2001. One of three mentors for the Distance Learning Training Program, to help train other professors at ISU to teach classes via the Internet.

  • 2000. Grant from the ISU "Extended University" to develop online course: "Variation in Spanish Syntax".

  • 1999. Grant from the ISU "Extended University" to develop online course: "History of the Spanish Language".

  • 1998. Supplemental Travel Grant from the Center for the Advancement of Teaching (ISU) to present paper at Oxford University, England: ""Using multimillion word corpora of historical and dialectal Spanish texts to teach 'Advanced Spanish Syntax'"."

  • 1997. Grant from the Provost's Office at ISU to develop online course: "Foreign Language Resources on the Internet".

  • 1997. Teaching Initiative Award, Illinois State University (one of seven awarded at the university)

  • 1997. "Faculty Fellow" award from the Center for the Advancement of Teaching, Illinois State University, to develop an interactive, Internet-based, database-driven website for foreign language learners.

  • 1996. "Best of Illinois" award from the 1200+ member Illinois Council on the Teaching of Foreign Languages (ICTFL)

  • 1996. Office of Instructional Technology Grant to develop a course to be taught entirely by means of the Internet (one of eight awarded at the university)

  • 1995. Invited to Brigham Young University as Visiting Professor to teach graduate level seminar in Spanish syntax.

Research-related

External

Internal (ISU)

  • 2001. Summer Faculty Fellowship

  • 2000. University Research Grant

  • 1998. University Research Grant

  • 1997. University Research Grant

  • 1996. University Research Grant

  • 1996. Research Initiative Award (one of seven awarded at the university)

  • 1995. University Research Grant

SERVICE


Brigham Young University (2003-present)

  • Advisor for English Language and Linguistics majors and minors

  • Travel Committee

Illinois State University (1992-2003)

  • Chair, Technology Committee, Foreign Language Dept. (1994-96, 1999-present)

  • Graduate Advisor, Spanish Section, ISU (1996-97)

  • Spanish Section Coordinator, ISU (1997-99)

  • Representative, Institutional Review Board, ISU (1993-95)

  • Member, Linguistics Committee, Foreign Language Dept., ISU. (1993-present)

  • Member, Technology Committee, Illinois Council on the Teaching of Foreign Languages (ICTFL) (1996-present)