Home > Coverage
 Coverage Calculator v.1.2         Research coming...
  The percent of corpus words covered by a word list   New 2020: Elim.-Propers-from-Corpus option
This program calculates how many times the words on a list appear in a corpus. A list of the 2000 most common word families is often said to 'cover' up to 80% of the individual words in a general corpus of English. || How proper nouns in the corpus will be treated in this calculation is a checkbox option.|| Headword lists can be expanded into family/lemma lists here || List coverage in texts can be calculated here (Demo 7). || Known max of this routine mid-2020: 13k wds in list x 2.5m wds in corpus; with proper elimination 13k wds list x 1.2m wds corpus

COVERAGE RESEARCH: >   1. Nation (2006) 2. Laufer Ravenhorst (2010) 3. Schmitt Jiang Grabe (2011) 4. Schmitt Cobb et al (2015)  

DEMO
LISTS

AWL Heads

AWL Fams

BNC 1k Fams

BNC 1k Lems

NGSL Lems
[1k]   [1-2k]

BN-Coca Fams
[1k] [2k] [3k]
[1-2k] [1-3k]

NFL7
Nuclear
Fams@7%
[1k] [2k] [3k]
[1-3k]   [?]

(1) Click or paste LIST/name

 

  (2) Choose Corpus/Collection

 

(3) ELIM. Propers 
Max 1m

   

(4) Click
   

(5) See result