Can't find how to get link to Product Pages
I am trying to scrpae this site: http://www.rygor.usedtrucks.mercedes-benz.co.uk/used
The product page itself looks easy enough to scrape, I just cannot get the URL of each product?
An XML file is produced that has the data to be able to construct the URL, but I do not know how to replace the blank spaces in one of the fields with a hyphen to enable me to build the url.
Can someone tell me how to do that or offer an alternative solution?
Thanks
Jason
Sorry I am not that advanced...
I don't understand your instructions I'm afraid could you please give me a few pointers to where I will get this information?
It may be that the scrapable file I am generating isn't right as I can't see the information (or I don't know how to get it)?
Thanks
Jason
Not as advanced as all that...
Hi
Thanks for your response
All I can see is this after window.location:
newcontent += '
I know the components of the seo url, but I cannot put it together as one of the fields has spaces in so I would need to insert a hyphen into it - this is where I am struggling.
I can get the info from a scrapable file:
{"fuel":"Diesel","vid":"19025291","colour":"White Cab Blue Body","model":"Axor","town":"","vin":"STOCK1824K","mileage":"1","variant":"1824k Skip Loader","weight":"4x2 Rigid","valve":"6 Cylinder","regyear":"2014","bodystyle":"Specialist","dealername":"Rygor Commercials Ltd","make":"Mercedes-Benz","trim":"","transmission":"Manual","make_id":"17","phone":"0844 411 9127","length":"N/A","approved":"0","mot":"","dgroupname":"Rygor Commercials Ltd","image_count":"6","price":"69,995","dealer":"5632","chassis":" Day Cab"}
This is how it is made up:
Var SEOurl = GetSEOurl('/used/'+vehicle['regyear']+'/'+vehicle['make']+'/'+vehicle['model']+'/'+vehicle['regyear']+'-'+vehicle['make']+'-'+vehicle['model']+'-'+vehicle['metacolour']+'-'+vehicle['town']+'-for-sale-'+vehicle['vin']);
Am I missing something very obvious here?
Thanks
Jason
The first item is a JSON
The first item is a JSON object, and they are "name":"value", so you scrape out each of those, and create the URL based on the JavaScript function you've pasted.
It's using JavaScript, and on
It's using JavaScript, and on each row there is a DIV tag with "onclick='window.location" and a relative URL.