Filters Help

BlackWidow scans websites (it's a site ripper). It can download an entire website, or download portions of a site.
Post Reply
freeindeed
Posts: 4
Joined: Tue Sep 18, 2012 7:43 pm

Filters Help

Post by freeindeed » Tue Sep 18, 2012 7:50 pm

I am trying to download the repair manual pages for my car, a 2007 Prius. The site https://techinfo.toyota.com/t3Portal/do ... 8014529305 has user id: freeindeed and password: football. Many link errors occur when I scan entire site and if I stay confined to this url, I get nothing at all. Are you really that good?

User avatar
Support
Site Admin
Posts: 1830
Joined: Sun Oct 02, 2011 10:49 am

Re: Filters Help

Post by Support » Tue Sep 18, 2012 8:05 pm

When I login, it takes me to a page with no links in it, just a notice "When disconnecting the cable from the negative...". What are you trying to download? I mean, where are the links?
Your support team.
http://SoftByteLabs.com

freeindeed
Posts: 4
Joined: Tue Sep 18, 2012 7:43 pm

Re: Filters Help

Post by freeindeed » Wed Sep 19, 2012 12:59 pm

Thanks for the help!
Using Black Widow browser I enter:
https://techinfo.toyota.com/techInfoPor ... ota.com%2F
Then login, then paste in URL bar:

https://techinfo.toyota.com/t3Portal/re ... context=ti
Document tree at left lists RM (Repair Manual) pages displayed at right. Saving links at left produces HTML link not readable in my browser. Printing from link top right produces printer output or converted pdf file. Another link would be tree of SB, and other classes of documents for my car.

Otherwise, return to:
https://techinfo.toyota.com/t3Portal/ap ... air_search
Select from pull down menus Model=Prius and Year=2007 and click Search. Tabs below Search have decreasing value toward right but all pertain to my car. Selecting RM and saving one of the documents from context menu produces unreadable link in html file.
The next step requires logging into techinfo from IE9 because clicking the link for any of these documents opens a second window in IE which requires login but kicks out Black Widow login, as server detects and restricts to one concurrent login. All further processing by Black Widow produces java errors until program is shut down, reopened, and logged in.
After closing Black Widow, open IE and paste the above link. Select Model and Year as above Search. Select any tab such as RM.
Clicking on any of these document titles opens a second window in IE with dialog box to Save or Save As produces a pdf file. Selecting OPEN displays the PDF with URL such as (depending on which document):
https://techinfo.toyota.com/t3Portal/re ... context=ti

All documents appear to be on server as pdf files controlled by java script.

Can you write a script which saves them in Black Widow?
Thank you

PS- Form will not allow me to upload attachment: "Extension not allowed" whether .txt .bw6 or no extension. I wanted to include full text of this post because in preview the links are truncated. Hopefully you receive the full links.

User avatar
Support
Site Admin
Posts: 1830
Joined: Sun Oct 02, 2011 10:49 am

Re: Filters Help

Post by Support » Wed Sep 19, 2012 7:43 pm

I fixed the board to accept txt and bw6 uploads.

No matter what I do, I can't get it to the pdf, and the login thing is so sensitive!. And because it's using https, I'm not able to use the BW NetSpy! I'm very good at this, but for thsi site, it's too much jsp stuff for BW to handle.
Your support team.
http://SoftByteLabs.com

freeindeed
Posts: 4
Joined: Tue Sep 18, 2012 7:43 pm

Re: Filters Help

Post by freeindeed » Wed Sep 19, 2012 7:53 pm

At least you like a good challenge!!
Is BR better suited?

User avatar
Support
Site Admin
Posts: 1830
Joined: Sun Oct 02, 2011 10:49 am

Re: Filters Help

Post by Support » Wed Sep 19, 2012 8:32 pm

Yes I do :mrgreen:

I'll give it a try in BR see if I can get it to work.
Your support team.
http://SoftByteLabs.com

User avatar
Support
Site Admin
Posts: 1830
Joined: Sun Oct 02, 2011 10:49 am

Re: Filters Help

Post by Support » Thu Sep 20, 2012 11:49 am

I'm not having any luck with BR either! Is there a way to access the pages in non-secure http as oppose to secured https?
Your support team.
http://SoftByteLabs.com

freeindeed
Posts: 4
Joined: Tue Sep 18, 2012 7:43 pm

Re: Filters Help

Post by freeindeed » Thu Sep 20, 2012 6:02 pm

No access except through https://

Did you try using IE to this URL:
https://techinfo.toyota.com/t3Portal/ap ... air_search
Then selecting from pull down menus Model=Prius and Year=2007 and click Search. Select any tab such as RM.
Clicking on any of these document titles opens a second window in IE with dialog box to Save or Save As produces a pdf file?
I do not understand the java code used by their document server, but I do see a link such as:
https://techinfo.toyota.com/t3Portal/re ... context=ti
which produces the pdf file. I have saved these individually, but it is time consuming. A script would be so valuable to me.
I got this link by clicking OPEN in the second IE window above.
If BR or BW could sniff out these links, it could save each as a pdf file.

Is the scripting language for BR or BW documented on your website? I couldn't find it.
Thanks for the good fight. I know you are frustrated when beaten!!

User avatar
Support
Site Admin
Posts: 1830
Joined: Sun Oct 02, 2011 10:49 am

Re: Filters Help

Post by Support » Thu Sep 20, 2012 8:44 pm

Yes I so try that too, but somehow, it's not working. I have BR set to login via the built it browser, I can get the page and the links on those pages, then i can get the 2nd page where the libk to the PDF is, but it's not a link, it's some kind of redirect or something because it's not returning a pdf.

Here is a manual of the scripting language used in BR, the same apply to BW.
Attachments
BrownRecluseHelp.zip
(851.21 KiB) Downloaded 736 times
Your support team.
http://SoftByteLabs.com

Post Reply