(2012-06-06, 10:41)Martijn Wrote: (2012-06-06, 09:38)john.doe Wrote: (2012-06-06, 09:33)olympia Wrote: This is not possible to achieve on scraper level.
Yes, this is a pitty, but I beleive the community at the end can create enough push on RT to correct these issues.
Aww that's bad news. Can't the scraper python code do an independent http request to RT and do the parsing steps I described? If that was possible, it *would* be possible to super accurately find the movie even though there is no IMDB TT id.
Why don't you just make sure RT is fixed?
I hate the idea of creating some work around instead of fixing the main issue. Sure it takes some time however that way you have a way more accurate result for now and in the future
Wow, you make it sound so easy. "Just fix RottenTomatoes". Yeah, let's all fix the nearly 1 million movies listed at RT, going through them one by one and ensuring that they all have an IMDb link. Geeze. Sounds so easy when you put it like that.
Call me when you are done with that.
In the meantime, it is a GREAT idea to use the method I described; you would get 100% accuracy in the matching if done the way I mentioned.
* Grab English title and Year at IMDb
* If that IMDb ID does not exist at RT, do a RT search for English title + Year
* Select top 5 actors ("top billed", ranked by fame at IMDb) from IMDb's cast list and use that to ensure that they are all also found at RT, thus guaranteeing that you did indeed find a 100% accurate match for the exact movie
* Use the RT info we now have access to.
* ACTUALLY SOLVE THE ISSUE AND CREATE A USABLE SCRAPER INSTEAD OF DREAMING ABOUT MANUALLY EDITING THE METADATA OF NEARLY A MILLION MOVIES AT RT.
The last point is the most important one.
![Rofl Rofl](https://forum.kodi.tv/images/smilies/rotfl.png)
In an ideal world, we could fix RT and link every single movie to its IMDb ID, but in reality it's just too much work. So by doing this very accurate workaround, we'd have an accurate scraper either way. In fact the scraper could log messages suggesting that the person contributes the IMDb <-> RT connection it has found for each movie where no such connection existed.