2010-07-24, 05:17
I am trying to optimze the imdb scraper and use the year info from the filename if there is any.
First i try to get the year. Then i try to get all titles and if i got the year it should just match the titles with the correct year. After that i rematch the previous result and add the tags for the result list.
It all works well as long as i disable the repeat="yes" in the third regex - which doesn't make much sense. As soon as i enable repeat i get all titles and it behave like my second regex does not exist. Help please - i am stuck :/
Part of GetSearchResults:
[HTML]
<RegExp input="$$1" output="\1" dest="4+">
<!-- first regex - get the year -->
<RegExp input="$$2" output="\1" dest="6">
<expression clear="yes">%20\(([0-9]{4})\)</expression>
</RegExp>
<!-- second - try to match only titles with the correct year - using this regex doesnt change anything:
(><a href="/title/[t0-9]*/[^>]*>[^<]*</a> *\({0,1}$$6\){0,1})![Sad Sad](https://forum.kodi.tv/images/smilies/frown.png)
-->
<expression repeat="yes" noclean="1">(><a href="/title.*$$6)</expression>
</RegExp>
<RegExp input="$$4" output="<entity><title>\2</title><year>\3</year><url>http://www.imdb.de/title/\1/#$$6</url><id>\1</id></entity>" dest="3+">
<!-- third regex - match the final titles and create nice result items -->
<expression repeat="yes" noclean="1,2">><a href="/title/([t0-9]*)/[^>]*>([^<]*)</a> *\(([0-9]*)</expression>
</RegExp>[/HTML]
First i try to get the year. Then i try to get all titles and if i got the year it should just match the titles with the correct year. After that i rematch the previous result and add the tags for the result list.
It all works well as long as i disable the repeat="yes" in the third regex - which doesn't make much sense. As soon as i enable repeat i get all titles and it behave like my second regex does not exist. Help please - i am stuck :/
Part of GetSearchResults:
[HTML]
<RegExp input="$$1" output="\1" dest="4+">
<!-- first regex - get the year -->
<RegExp input="$$2" output="\1" dest="6">
<expression clear="yes">%20\(([0-9]{4})\)</expression>
</RegExp>
<!-- second - try to match only titles with the correct year - using this regex doesnt change anything:
(><a href="/title/[t0-9]*/[^>]*>[^<]*</a> *\({0,1}$$6\){0,1})
![Sad Sad](https://forum.kodi.tv/images/smilies/frown.png)
-->
<expression repeat="yes" noclean="1">(><a href="/title.*$$6)</expression>
</RegExp>
<RegExp input="$$4" output="<entity><title>\2</title><year>\3</year><url>http://www.imdb.de/title/\1/#$$6</url><id>\1</id></entity>" dest="3+">
<!-- third regex - match the final titles and create nice result items -->
<expression repeat="yes" noclean="1,2">><a href="/title/([t0-9]*)/[^>]*>([^<]*)</a> *\(([0-9]*)</expression>
</RegExp>[/HTML]