freeware

Threads Posts Archives


<< Previous Thread << Previous Post Topic 16064 of 16625
Posts 2 of 10
Next Post >> Next Thread >>

Re: Advice on webpage scraper


de w4tch3r 05/02/2006 02:20



> I have a friend looking for a webpage scraper application to extract=20

Do you really need a dedicated application for the job?=20

How about you just save the web page to a local copy and then use tools=20
to parse the source code. If the info is well delimited (eg with=20
quotes, semi colons or commas) then an editor that supports regular=20
expressions with strip them all out nicely (eg Regex Power!=20
http://www.ware4u.de/regexpower/download.html). =20

If the data is not delimited nicely but there is some other pattern, eg=20
CRLF around the names or addresses, or some tags on the fields (eg=20
"NAME:") then you could try an editor that supports macros (eg a macro=20
that deletes all but the names, then restart and use a macro that=20
deletes all but the first address line... etc) like notepad++=20
(http://notepad-plus.sourceforge.net/uk/site.htm)

W4tch3r =3D3F=3D3F=BF=3D3F=3D3F



Advice on webpage scraper Veign
|  Re: Advice on webpage scraper w4tch3r
  Re: Advice on webpage scraper Al Klein
    Re: Advice on webpage scraper Veign
      Re: Advice on webpage scraper Al Klein
        Re: Advice on webpage scraper Veign
  Re: Advice on webpage scraper Ned Goudy
    Re: Advice on webpage scraper Veign
      Re: Advice on webpage scraper Ned Goudy
        Re: Advice on webpage scraper Veign
 
freeware