FreqList Builders now have their own area at ../freq. 1. Tag Stripper
Removes HTML tags.2. Corpus BuilderJoin up to 25 files - to about half a million words. *NEW!3. Sentence ExtractorFile to sentences.4. Random number generatorMoved 5 March 08 to ../rand collection5. The Compleat Stripper
Eight kinds of text clean-up for input to other routines.
Notes
Some of these routines require TEXT files as their input. A text file is a simple file that contains no codes for emphasis, font sizes, etc. To transform a Word file into a text file, simply SAVE it AS text. You will not thereby lose the original file, but create an additional text file (identifiable by the .txt extension). Most of these routines take their file inputs from a menu that accesses the hard drive; they have not been adapted for copy-paste text entry. They have not been tested for French. For complex jobs, combine routines (e.g., first strip tags of html file, save as text file, then build list or extract sentences).