Extracting street address, phone number, and email from a page
Posted: Wed Jul 10, 2013 11:19 pm
For the USA, is it possible to extract the street address, city, state, zip,(possibly phone and email) without knowing where this text exist on a webpage and return the extracted information in a text file? For example, have a spider read a text file of URLs.
http://www.kennedy-center.org
http://www.nbm.org
and return in a delimited text file the original URL and the address/phone from the page:
http://www.kennedy-center.org, 2700 F Street, NW Washington, DC 20566, 800-444-1324, 202-467-4600
http://www.nbm.org, 401 F Street NW, Washington, D.C. 20001, 202.272.2448
Thanks,
Gilbert
http://www.kennedy-center.org
http://www.nbm.org
and return in a delimited text file the original URL and the address/phone from the page:
http://www.kennedy-center.org, 2700 F Street, NW Washington, DC 20566, 800-444-1324, 202-467-4600
http://www.nbm.org, 401 F Street NW, Washington, D.C. 20001, 202.272.2448
Thanks,
Gilbert