getting a vaule from an html tag to embed in a scan structure

BlackWidow scans websites (it's a site ripper). It can download an entire website, or download portions of a site.
Post Reply
andyn321
Posts: 3
Joined: Thu Aug 29, 2013 11:31 am

getting a vaule from an html tag to embed in a scan structure

Post by andyn321 »

is it possible to get a date string from an html tag and then use that date to structure the images extracted ?

e.g. the html contains multiple divs each something like:

Code: Select all

<article class="post">
<h3 class="date">August 22, 2013</h3>
<ul>
<li class="contributor_gallery" id="contributor_gallery_4118">
<div class="thumb"><a href="/en-gb/contributor_galleries/4118-tarnly"><img alt="9" src="http://www.website.com/assets/images/thumbs/0090/3208/9.jpg?1377056381"></a></div>
<div class="title"><a href="/en-gb/contributor_galleries/4118-tarnly">Tarnly</a></div>
<div class="title-bg"></div>
</li>
<li class="contributor_gallery" id="contributor_gallery_4120">
<div class="thumb"><a href="/en-gb/contributor_galleries/4120-veera"><img alt="3" src="http://www.website.com/assets/images/thumbs/0090/2872/3.jpg?1377056341"></a></div>
<div class="title"><a href="/en-gb/contributor_galleries/4120-veera">Veera</a></div>
<div class="title-bg"></div>
</li>
</ul>
</article>
Could the date string in each <h3 class="date"> tag be used to structure the images ?

I presume this would require a script rather than a simple extract ?

User avatar
Support
Site Admin
Posts: 1892
Joined: Sun Oct 02, 2011 10:49 am

Re: getting a vaule from an html tag to embed in a scan structure

Post by Support »

Actually, no, not with BlackWidow. With our BrownRecluse, you can do anything you want, it's entirely programmable.
Your support team.
http://SoftByteLabs.com

Post Reply