Do "reverse scrapers" exist?
#1
If I have understood correctly, currently one has to rely on TMDB to find the information of a given movie. However TMDB has significantly less titles than IMDB has. I know that I have quite a few (specially documentary) movies in my collection that are in IMDB but not in TMDB. I know this is true in some cases because I entered that information into IMDB myself, and now have it scraped back to the my Kodi database (hosted on a local SQL server).

Has anyone devised a "reverse scraper" to extract movie or television series information from the local Kodi database and push it over to a database website (eg. TMDB or THETVDB) over their API? Are database website API's comprehensive enough to be able to do this?
Reply
#2
I have thought about this myself in the past.  It is a FANTASTIC Idea, and wish it could be implemented in some way.  I am always filling in information that is missing, and always think it is a shame that it could not be  shared. I often create images for movies such as the DiskArt.png's.  I used to upload them to Fanart.tv, but they are overly zealous at denying images.  I had a VERY high resolution disklart that I created, denied due to LITERALLY 2 stray pixels.  I gave up and just create them foir myself.  I always wished they could be reverse scrapeed, as I feel an image with 2 stray pixels is better than nothing Wink

The problem is maintaing the accuracy and integrity of the information.  It would be so easy for someone to put the wrong description, year and all the other information.  To verify the information as it was "reverse scraped"  would take a full time staff.
------------------------------------------
Windows 11 Pro / Intel i9 13900k with an RTX 4070/ 64 gigs RAM/ MCE Remote /45 Tb storage / via HDMI to a 80" OLED TV / with just over 7,400 movies

Have you entered your 5 Movies? - Support themoviedb.org as THE best open source movie information site.
Reply
#3
On TMDB, you can read data with the API, but not write it. Movies and series data can only be updated using the website.
Reply
#4
(2024-12-12, 01:28)JohnWPB Wrote: The problem is maintaing the accuracy and integrity of the information. It would be so easy for someone to put the wrong description, year and all the other information.  To verify the information as it was "reverse scraped" would take a full time staff.
Then again, any information entered into the website (which is a lot easier) should have similar stewardship.
Reply
#5
Sad 
(2024-12-12, 09:41)jkirsch Wrote: On TMDB, you can read data with the API, but not write it. Movies and series data can only be updated using the website.
Such a shame. Though, a program that reads your local database and compares the information with the website API might be useful, if it could then prompt you to update the information on the website.
Reply

Logout Mark Read Team Forum Stats Members Help
Do "reverse scrapers" exist?0