![]() |
Allocine.fr (TV Shows) scraper - Printable Version +- Kodi Community Forum (https://forum.kodi.tv) +-- Forum: Development (https://forum.kodi.tv/forumdisplay.php?fid=32) +--- Forum: Scrapers (https://forum.kodi.tv/forumdisplay.php?fid=60) +--- Thread: Allocine.fr (TV Shows) scraper (/showthread.php?tid=25409) |
Allocine.fr (TV Shows) scraper - The_Dogg - 2007-03-23 I'm working on a TV Show scraper for allocine.fr. I'm down to the episode list, but I have a little problem: I use the scrap.exe tool to test it, and when the tool get the links for the episode list, there is a "&" sign that gets lost, let me show you: Code: </status><premiered> this is the output of the scrap.exe tool. You can see that in the <details> tag the URL are OK : Code: <url>http://www.allocine.fr/series/episodes_gen_csaison=1511&cserie=513.html</url> Code: Episodelist URL 1:http://www.allocine.fr/series/episodes_gen_csaison=1511cserie=513.html and here is the code from the scraper.xml Code: <RegExp input="$$8" output="<episodeguide>\1</episodeguide>" dest="5+"> I tried replacing the $amp; with only &, i tried putting it twice (&& and &&) the & sign never shows up. but when i try to change the & with " the " sign appears where I need it, only the & that doesnt seems to work. any help would be appreciated. The_Dogg - The_Dogg - 2007-03-23 After a little more research I found the way to have the missing & show ![]() I had to put Code: &amp; so the resulting scraper code is: Code: <RegExp input="$$8" output="<episodeguide>\1</episodeguide>" dest="5+"> ![]() - spiff - 2007-03-23 reason for this is: you are in an xml document. and you return xml.... each time xml is parsed, you need & or it will be stripped due to being a nonvalid xml char.... |