Regarding Downnload images from the site
Hi
I want to download images from flipkart.com to my folder and i made a scraper script.
Also i does not want to download a single image , i want to download all the images which is running with paging and for this i made a script which is running fine to get the record from the site but in the case of downloading image it will create problem.even no any single image is being download.
I have made download script as follows
session.log(currentURL);
session.downloadFile(currentURL ,"C:/Users/nutsuser/Desktop/images/"+ session.getVariable("IMAGE_SECOND") +".jpg", 5);
and the pattern for this is as follows
url is this
http://www.flipkart.com/richard-scarry-best-story-book-book-0307165485?ref=c4b3c63b-fdfa-46f2-ae04-c66ce521c463
and i am using this tag for making data pattern
the main pattern is
and the sub pattern is this
img/~@IAMGE_SECOND@~.jpg"
When i run the script the in the log screen message are shown as follows.
Storing this value in a session variable.
URL_REFERENCE=ref=33359f11-3b34-4c5a-953d-55282cc3320d
Storing this value in a session variable.
BOOK_NAME_URL=The Good Earth
serach result: Processing scripts after a pattern application.
Processing script: "Scrap deatil page"
Scraping file: "Detail page"
Detail page: Preliminary URL: http://www.flipkart.com/~#PRODUCT#~
Detail page: Using strict mode.
Detail page: Resolved URL: http://www.flipkart.com/good-earth-pearl-s-buck-book-1416500189
Detail page: Sending request.
Detail page: Processing scripts before all pattern applications.
Detail page: Applying extractor pattern: Untitled Extractor Pattern
Detail page: Extracting data for pattern "Untitled Extractor Pattern"
Detail page: The following data elements were found:
Untitled Extractor Pattern--DataRecord 0:
DATARECORD=ge" content="http://img3.fkcdn.com/img/186/9781416500186.jpg"
IMAGE_FIRST=img3
IAMGE_SECOND=186/9781416500186
Detail page: Processing scripts after a pattern application.
Processing script: "Download script"
http://null.fkcdn.com/img/null.jpg
ERROR: Failed to save the file: C:\Users\nutsuser\Desktop\images\null.jpg. The error message was: null.fkcdn.com.
Detail page: Processing scripts once if pattern matches.
Detail page: Processing scripts after all pattern applications.
detail Link--DataRecord 2:
PRODUCT=itil-v3-foundation-handbook-ashley-book-0113311974
Storing this value in a session variable.
PAGING=4
Plese check this so that i could found where i was wrong.Also give me the solution for download the images from this site.
I remember when I made you a
I remember when I made you a demo of this scrape.
I would need to see your script, as there is a NULL in the image path that is the problem. Nevertheless, I see there is a typo in your IMAGE_SECOND token name that might be related.
Pankaj, Make sure you are
Pankaj,
Make sure you are saving the following two variables as session variables if you are referencing them as session variables in your download script.
IMAGE_FIRST
IAMGE_SECOND
As an alternative, you don't need to save them as session variables if instead you want to refer to them by dataRecord.get(), instead.
For consistency it is recommended that you correct the spelling of your second variable.
IAMGE_SECOND should be IMAGE_SECOND
-Scott