MLUG: Re: [MLUG] text extract from web (not vanilla extract btw... ;)
Re: [MLUG] text extract from web (not vanilla extract btw... ;)
Email address obfuscation in effect -- please click here to turn it off.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
On Thu, 21 Sep 2006, George Robb wrote:

I'm close very close...
crack open a terminal and issue the command links www.google.com (text based web browsing is my friend)


how can I "grep" that page for text?  e.g. links www.google.com | grep Lucky >> results.txt
or is there an easier way?


I'm not clear on exactly what you are trying to do. If you want to grab the text without hyperlinks this is a good way:

lynx -dump -nolist www.google.com | grep Lucky >> results.txt

Note that lynx has an -width= option:

lynx -dump -nolist -width=140 www.google.com | grep Lucky >> results.txt

(for example).

Mike

_______________________________________________
members mailing list
EMAIL:PROTECTED
http://mlug.missouri.edu/mailman/listinfo/members