Home > Coverage (update 12 Nov 2025)
 Coverage Calculator v.2.3 +FRENCH       
    The percentage of words in a corpus that are the same as the words in a list
This program calculates how many times the words on a list appear in a corpus. A list of the 2,000 most common word families is often said to 'cover' up to 80% of the individual words in a general corpus of English - i.e., 80% of the words in the corpus will be words from that list. || Treatment of proper nouns is a checkbox option.|| Headword lists can be expanded into family/lemma lists here || List coverage in texts can be calculated here (Demo 7). || Known max of this routine mid-2020: 13k wds in list x 2.5m wds in corpus

RESEARCH: >   1. Nation (2006)   2. Laufer Ravenhorst (2010)   3. Schmitt Jiang Grabe (2011)   4. Schmitt Cobb et al (2015)     5. Cobb Laufer (2021)  


DEMO LISTS

AWL Heads | Fams

BNC-1k Fams | Lems

NGSL Lems 1k | 1-2k

BN-Coca Fams
1k | 2k | 3k
1-2k | 1-3k

Nuclear (Eng 1-3k)   [?]
nfl-0 nfl-1 nfl-2
nfl-3 nfl-4 nfl-5 nfl-6

French Nuclear
Listes de fréquence nucléaire françaises
LFNF-x (>x% of family)

fr_lfnf-0
fr_lfnf-1
fr_lfnf-2
fr_lfnf-3
fr_lfnf-4
fr_lfnf-5
fr_lfnf-6
fr_lfnf-7
fr_lfnf-8
fr_lfnf-9
fr_lfnf-10
(1) Click or paste LIST/name

 

  (2) Choose Corpus
      (Fr at bottom)

 

(3) Handle
Propers 
Subtract
from corpus
Add
to list

(4) Click
   

(5) See result