Home > Keywords > English (BNC SPOKEN 10 million as reference)

KeyWords Extractor v. 2   NEW 8 SEPT 2012 : (1) FAMILIZED OUTPUT AND (2) 10x BIGGER BASE CORPUS
This program determines the defining lexis in a specialized corpus, by comparing frequency per word to frequency in a reference corpus (Spoken BNC 10-million; calculated on a per-million basis).

Input mode A: Type or paste smaller text (<50,000 words) below and click Submit_window


5000+ Wd Samples: Dracula | Love Story | Mutiny - Bounty | Jungle Book | Speckled Band |      

Exceptions: Words to eliminate from analysis (e.g. proper nouns). [Type or Dbl-click in textarea]             PROPER BLOCKER
  And/or all mid-sentence caps  *

Input mode B: Upload larger text files (To max 10 MB, 1.5 million words, depending on traffic, processor, etc)
1. ...on own drive; 2. Opt for PROPER BLOCKER *   3. Make ref. corpus BNC-Med (1.4 million) * and then   3.  

Developed for CNA-Q August 2007 (Last mod. 2012 March)