Home > Freq > Nuclear input
  Nuclear List Builder v.1
    Reduce BNC-Coca family lists to just the forms present in a corpus
The families of the BNC-Coca frequency lists are large, corpus based, and complete, in order to classify every word of any text in Vocabprofile. But there is no reason for learners to know all possible forms of every word. Also, different text types (general, scientific, medical) employ different family members. This program crosses a BNC-Coca frequency list (one k-level at a time) against the frequencies of BNC-Coca members in a chosen small (1-2 million word) corpus.

(1) Get List

(2) Choose Cross-Corpus

(3) View output as...

Complete frequencies & percentages
(To choose cutoff)

Reduced list
With these cutoffs
(Click to choose) 

(4) Choose
Exclude members
less than

of family

or fewer than instances

(5) Click


(6) Get Result