Vocabulary Profilers break texts down by word frequencies in the language at large, as opposed to in the text itself.
Most of the English Vocabprofilers on this site are based on Laufer and Nation's Lexical Frequency Profiler, and divide the words of texts into either first and second thousand levels, academic words, and the remainder or 'offlist,' or the BNC based 20 levels plus off-list. [Since this was written, several more frameworks have taken the field - see VP-Compleat.] VP is used for many research and teaching purposes (like matching text to learner via Levels Test (click here to see how).
Laufer & Nation's original 4-way sorter
250-word cuts for finer anlysis
current development version
on 1 interface
New! text_lex_compare output|
| Typical format
Integral text: buck did not read the newspapers or he would have known that trouble was brewing not only for himself but for every tide water dog strong of muscle and with warm long hair from puget sound to san diego
1k types: [families 27 : types 29 : tokens 31 ] and_ buck_ but_ did_ dog_ every_ for_ from_ have_ he_ himself_ known_ long_ newspapers_ not_ of_ only_ or_ read_ sound_ strong_ that_ the_ to_ trouble_ was_ water_ with_ would_
2k types: [3:3:3] hair_ tide_ warm_
OFF types: [ ?:5:5 ] brewing_ diego_ muscle_ puget_ san_