login to flickr
Hello
i want to login to flickr in order to scrape some pictures, but it always fails to do so.
http://www.flickr.com
Anybody did this before?
So i had the idea it would be good to be able to login manually and then continue.
So when its time to login i want screen scraper to open a browser - i login manually - then screen scraper should continue with me logged in.
Is that possible?
Regards
Ben
I don't have a way for
I don't have a way for screen-scraper to interact with a browser. What you'd need to do is make a script that has the credentials, or a script that reads in a file that has the credentials. See tutorial #2 for some instructions.
tutorial #2 is not helpful
what do you mean with writing a script that has the credentials?
in tutorial #2 login works with the parameters tab, where you insert the crendentials to post.
But this does not do it at flickr.
so what can you do?
I did a quick proxy, and used
I did a quick proxy, and used the Yahoo login. The credentials are on a page https://login.yahoo.com/config/login_verify2 in the POST parameters.
thanks i did the same, but
thanks i did the same, but after that you are not logged in at least i am not.
are you?
plese try
http://www.flickr.com/groups/geology/members/
and see if you get it or if you are refered to the login again
nebben, After proxying the
nebben,
After proxying the site I notice a number of potential transactions that you'll likely need to turn into scrapeable files. These are the base URLs (minus any post parameters) of those I think you'll want to account for.
http://www.flickr.com/
http://www.flickr.com/signin/
https://login.yahoo.com/config/login
https://login.yahoo.com/config/login
https://login.yahoo.com/config/validate?.src=flickrsignin&.pc=8190&.scrumb=8ayBQgx3bUT&.pd=c%3DJvVF95K62e6PzdPu7MBv2V8-&.intl=us&.done=http%3A%2F%2Fwww.flickr.com%2Fsignin%2Fyahoo%2F%3Fredir%3Dhttp%253A%252F%252Fwww.flickr.com%252F
http://www.flickr.com/signin/yahoo/?redir=http%3A%2F%2Fwww.flickr.com%2F&.data=<long string here>&.ys=Jwn3dgSzsZrBWXqZiFxcVl.Jmf4-
http://www.flickr.com/cookie_check.gne?pass=http%3A%2F%2Fwww.flickr.com%2F&fail=register_cookies.gne
http://www.flickr.com/
Of the above, you may only need to handle passing the correct data in either the POST or GET for the non-302 items. screen-scraper handles all 302's automatically (including the data sent in the POST or GET because that data is generated on the server and not on the client).
#5 could be the one that you need to pay the closest attention to.
Hope this helps,
Scott