Day 2: Analyzing Vocabulary

Exploring Lexical Richness – Diversity and Sophistication

Author

Masaki EGUCHI, Ph.D.

Modified

August 4, 2025

Overview

Day 2 focuses on analyzing vocabulary in corpus linguistics, introducing concepts of lexical richness, particularly diversity and sophistication.

Key Concepts

  • Lexical Richness (text internal vs external measures)
  • Lexical Diversity (Type-Token Ratio, MTLD)
  • Lexical Sophistication (frequency, concreteness, phonological neighbors)
  • Lexical profiling
  • Frequency Lists and Zipf law

Preparation

Before Day 2:

Schedule

Time Activity
10:30-12:00 Session 4: Analyzing vocabulary (1) — Conceptual overview
12:00-13:00 Lunch
13:00-14:30 Session 5: Frequency Analysis and Lexical Profiling
14:30-14:40 Break
14:40-16:10 Session 6: Computing Lexical Measures
16:10-17:00 Office Hour (You can ask questions.)

Assignments

Reflection