    Reduce BNC-Coca family lists to just the forms present in a corpus
The families of the BNC-Coca frequency lists are large, corpus based, and complete, in order to classify every word of any text in Vocabprofile. But there is no reason for learners to know all possible forms of every word. Also, different text types (general, scientific, medical) employ different family members. This program crosses a BNC-Coca frequency list (one k-level at a time) against the frequencies of BNC-Coca members in a chosen small (1-2 million word) corpus.

