2010-06-04, 17:16
Hello,
I've got serious problems on getting the "Altersfreigabe"/Certification used for my scraper. In the US-IMDB we have every "Altersfreigabe"/Certification with a link (href) and the scraper can use the repeat function. German is only:
No problem to determine the correct <div> from USA
... up to wan:GP with all the "Altersfreigabe"/Certification inside.
But how to separate them for a nice human readable view?
Next problem:
can be deliverd as well, meaning: any combinations with italic info and only one country is given back by database.
So I've got several problems:
- how to divide the countrys in a nice readable form?
- how to match the different types given by the website?
Regards,
Eisbahn
I've got serious problems on getting the "Altersfreigabe"/Certification used for my scraper. In the US-IMDB we have every "Altersfreigabe"/Certification with a link (href) and the scraper can use the repeat function. German is only:
Code:
<div class="info">
<h5>Altersfreigabe:</h5>
<div class="info-content">
USA:PG-13 <i>(certificate #45663)</i> | Südkorea:12 | UK:12A | Norwegen:11 | Irland:12A | Schweden:11 | Singapur:PG | Dänemark:11 | Brasilien:12 | Finnland:K-13 | Schweiz:12 <i>(canton of Vaud)</i> | Schweiz:12 <i>(canton of Geneva)</i> | Niederlande:12 | Philippinen:PG-13 <i>(MTRCB)</i> | Australien:M | Portugal:M/6 <i>(Qualidade)</i> | Kanada:14A <i>(British Columbia/Manitoba)</i> | Kanada:G <i>(Quebec)</i> | Kanada:PG <i>(Alberta/Ontario)</i> | Deutschland:12 | Neuseeland:M | Island:10 | Hong Kong:IIA | Taiwan:GP <i>(original rating)</i> | Argentinien:13 | Peru:14 | Japan:G | Mexiko:B | Taiwan:GP
</div>
</div>
![Tongue Tongue](https://forum.kodi.tv/images/smilies/razz.png)
Code:
<RegExp input="$$1" output="<certification>\2 \4</certification>" dest="5+">
<expression repeat="yes"><h5>Altersfreigabe:</h5>\n<div class="info-content">\n([^\n]*)?</expression>
</RegExp>
But how to separate them for a nice human readable view?
Next problem:
Code:
<div class="info">
<h5>Altersfreigabe:</h5>
<div class="info-content">
Deutschland:0 <i>(free)</i>
</div>
</div>
Code:
<div class="info">
<h5>Altersfreigabe:</h5>
<div class="info-content">
Deutschland:12 | USA:PG-13 <i>(certificate #45663)</i>
</div>
</div>
So I've got several problems:
- how to divide the countrys in a nice readable form?
- how to match the different types given by the website?
Regards,
Eisbahn