[ale] Extraction of address and pages
Keith R. Watson
keith.watson at gtri.gatech.edu
Thu Nov 4 15:01:34 EST 2004
At 14:49 11/4/2004 -0500, you wrote:
>I'm trying to get http://addr:port/page
>
>from:
>
>GET http://www.google.com/ HTTP/1.1
>
>this sucks as it is too greedy. Anyone have a suggestion.
>$m =~ m/http:\/\/(.+)\/\s+/;
>
>Thanks,
>Chris
Chris,
$m =~ m/http:\/\/(.+?)\/\s+/;
. means any character
+ means one or more times
this causes it to match to the next new line as the match is greedy by default
the ? tells it to not be greedy
keith
-------------
Keith R. Watson GTRI/ISD
Systems Support Specialist III Georgia Tech Research Institute
keith.watson at gtri.gatech.edu Atlanta, GA 30332-0816
404-894-0836
More information about the Ale
mailing list