Httrack

Mirror a website on your hard drive:

Copy all the html files from your local mirror of a website to a single directory:

find . -name "*.html" -exec cp {} /home/teresita/Desktop/html \;

Convert all the html files in a directory to text:

for i in *.html; do lynx --dump "$i" > "${i%%.*}.txt";done

About Linuxgal

Need a spiritual home? Consider joining us at Mary Queen of the Universe Latter-day Buddhislamic Free Will Christian UFO Synagogue of Vishnu
This entry was posted in Uncategorized. Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s