Home > Text Tools > Tag Stripper
HTML (etc) Tag Stripper

This program applies the following REGEX (Regular Expression) to the indicated file.
  $file =~ s/<.*?>//g;
The effect is to substitute all < and > tags, plus everything between tags (.*?),
with nothing (//), throughout the file (globally).
Notes:
(File is usually HTML but not necessarily).
(Tag-stripping would be useful e.g. when building a corpus of web pages.)

  Do this: [1] Hard Disk for HTML file + [2]

  + [3] Save resulting stripped file back to own disk as *.txt.


Stay tuned for more text processing tools...

T Cobb - UQAM