Extract data from HTML that looks like this

BeownReclise is a programmable web spider. Scan a web site and retrieve from it the information you need. You could scan a Real Estate web site and collect all of the agent addresses, phone numbers and emails, and place all this data into a tab delimited database file. Then import this data in your Excel application for example.
Post Reply
steve1040
Posts: 5
Joined: Wed Jul 25, 2012 2:31 pm

Extract data from HTML that looks like this

Post by steve1040 » Fri Apr 05, 2013 6:21 am

I have website I'm trying to collect data from.
I'm needing help grabbing the data I want.

Thanks

I need the following exported to a pipe delimited text file.
From the html below I need
Field0 = 1010943
Field1 = EHS Hospital Admin Operating Summary Report
Field2 = ADT Statistics
Field3 = As Needed
Field4= Executive - Clinical; Executive - Financial
Field5 = ADT; BedTime
Field6 = Yes
Field7 = Determine your organization's health by reviewing census statistics and financial data for a selected date and month-to-date.

Repeat for all records.
There are 20 records on each page with 179 pages



The URL looks like this
<td class="paddedLinkCell" width="19%">
<a title="EHS Hospital Admin Operating Summary Report" href="/Data/Reports/Details/1010943">
EHS Hospital Admin Operating Summary Report</a>
</td>
<td class="paddedCell" width="16%">
ADT Statistics
</td>
<td class="paddedCell" width="15%">
As Needed
</td>
<td class="paddedCell" width="16%">
Executive - Clinical; Executive - Financial
</td>
<td class="paddedCell" width="16%">
ADT; BedTime
</td>
<td class="paddedCell" width="9%">
Yes
</td>

</tr>
<tr class="oddRow">
<td colspan="7" style="color: gray; padding-left: 20px">
Determine your organization's health by reviewing census statistics and financial data for a selected date and month-to-date.
</td>
</tr>

<tr class="evenRow">


<td class="paddedLinkCellImage" width="9%">
<div class="image-wrapper">
<a rel="lightbox" title="Administrative Operating Summary Report Template" href="/Data/ImageHandler.ashx?size=80&filename=U:\Images\2010 RELEASE\1000001To1100000\1015298.png">
<img src="/DataHandbook/ImageHandler.ashx?filename=U:\Images\2010 RELEASE\1000001To1100000\1015298.png" alt=""
style="border-width: 0px; width: 80px ;" />
<div class="usw-customer-icon-small usw-slg121-small superimposed" title="Epic"></div>
</a>
</div>
</td>


<td class="paddedLinkCell" width="19%">
<a title="Administrative Operating Summary Report Template" href="/Data/Reports/Details/1010945">
Administrative Operating Summary Report Template</a>
</td>
<td class="paddedCell" width="16%">
ADT Statistics
</td>
<td class="paddedCell" width="15%">
Go-Live Critical
</td>
<td class="paddedCell" width="16%">
Admissions Director; Executive - Clinical; Executive - Financial; Registration Supervisor
</td>
<td class="paddedCell" width="16%">
ADT; BedTime; Resolute Hospital Billing
</td>
<td class="paddedCell" width="9%">
No
</td>

</tr>
<tr class="evenRow">
<td colspan="7" style="color: gray; padding-left: 20px">
Review your organization's financial health and utilization rates using census statistics, financial data, appointment data, surgical case data, and emergency department data for a selected date, month-to-date, and year-to-date.
</td>
</tr>

steve1040
Posts: 5
Joined: Wed Jul 25, 2012 2:31 pm

Re: Extract data from HTML that looks like this

Post by steve1040 » Mon Apr 08, 2013 4:18 pm

Anyone here?
Is this forum still live?

User avatar
Support
Site Admin
Posts: 1848
Joined: Sun Oct 02, 2011 10:49 am

Re: Extract data from HTML that looks like this

Post by Support » Mon Apr 08, 2013 6:40 pm

You are basically asking us to make you a free script using the data you provided. We can't using that data, we need the actual URL, and you need to purchase a request to make a script. If it was something simple, no problem, but this is not.
Your support team.
http://SoftByteLabs.com

Post Reply