2010-06-04, 13:42
Hello,
trying to develop my first scraper for german DVDs and got a little problem...
I don't get the runtime parsed :=(
The source of the HTML is something like
What I want to get is the runtime (in german Länge), here it is 93 Min.
The corresponding scraper statement is
As result I only get <runtime></runtime>. Of course I would like to get something like <runtime>93 Min</runtime>. Where can I define how my result looks like, or better: which parts of my RegEx should be taken as result?
Any help? Regards,
Eisbahn
trying to develop my first scraper for german DVDs and got a little problem...
I don't get the runtime parsed :=(
The source of the HTML is something like
Code:
[...]
<a class="tn15more inline" href="/title/tt0195234/releaseinfo#akas" onClick="(new Image()).src='/rg/title-tease/akas/images/b.gif?link=/title/tt0195234/releaseinfo#akas';">Mehr ansehen</a> »
</div>
</div>
<div class="info">
<h5>Länge:</h5>
<div class="info-content">
93 Min
</div>
</div>
<div class="info">
<h5>Land:</h5>
<div class="info-content">
UK
</div>
[...]
The corresponding scraper statement is
Code:
<RegExp input="$$1" output="<runtime>\1</runtime>" dest="5+">
<expression trim="1"><h5>L&#xE4;nge:</h5>\n<div class="info-content">\n[0-9]* Min</expression>
</RegExp>
Any help? Regards,
Eisbahn