Actions

Remove Wikipedia references from HTML

From OPOSSEM

Revision as of 09:34, 12 July 2011 by Philip Schrodt (talk | contribs) (Created page with "<!-- add any hidden notes here --> The following code will remove all of the Wikipedia references from an HTML file downloaded from FireFox 1. Use the "File/Save Page As..." to...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)


The following code will remove all of the Wikipedia references from an HTML file downloaded from FireFox

1. Use the "File/Save Page As..." to save the page to a file myfile.html

2. In the Terminal, run

perl -npe 's/<a href=\"http:\/\/en\.wikipedia\.org\/wiki.+?>(.+?)<\/a>/$1/g' myfile.html > mynicefile.html

3. Use "File/Open File" to open the new file, which you can then print.