Day 2: Analyzing Vocabulary
Exploring Lexical Richness – Diversity and Sophistication
Overview
Day 2 focuses on analyzing vocabulary in corpus linguistics, introducing concepts of lexical richness, particularly diversity and sophistication.
Key Concepts
- Lexical Richness (text internal vs external measures)
- Lexical Diversity (Type-Token Ratio, MTLD)
- Lexical Sophistication (frequency, concreteness, phonological neighbors)
- Lexical profiling
- Frequency Lists and Zipf law
Preparation
Before Day 2:
- Read:
- Durrant Ch. 3
- Skim:
- Durrant Ch. 4 (Ignore R codes if you are not familiar)
- Eguchi, M., & Kyle, K. (2020). Continuing to Explore the Multidimensional Nature of Lexical Sophistication. The Modern Language Journal, 104(2), 381–400.
- Watch:
- Laurence Anthony’s intro to AntConc
Schedule
| Time | Activity |
|---|---|
| 10:30-12:00 | Session 4: Analyzing vocabulary (1) — Conceptual overview |
| 12:00-13:00 | Lunch |
| 13:00-14:30 | Session 5: Frequency Analysis and Lexical Profiling |
| 14:30-14:40 | Break |
| 14:40-16:10 | Session 6: Computing Lexical Measures |
| 16:10-17:00 | Office Hour (You can ask questions.) |
Assignments
- Due 8/6 (Wed): Corpus Lab Assignment 2
- Complete lexical analysis exercises using AntConc and web applications