508013 Website PHP Scraping Script Modification

I had a PHP/MySQL script created for me that does the following:

1. Takes a large list of every movie that's released

2. Runs a search for the title on [url removed, login to view]

example: [url removed, login to view]

- The script takes the first result under 'Popular Titles'

- If there are no Popular Title results, it simply takes the first link that it can find.

3. On the IMDB result page, it scrapes the rating out of 10, and the Metascore out of 100.

4. Next the script searches for the title at [url removed, login to view] and goes to the first result.

It scrapes the 'average rating' out of 10 from this page.

This method works fairly well for the most part, but there are a few ways I would like to improve the script.



- In Step #2, IF there are no 'Popular Title' results, instead of taking the first link that it finds on the page, I would like it to take the first result under 'Titles (Exact Matches)'. If neither of those exist, well, the script can just skip the movie.

- When going to the result page on IMDB, the script would make note of the IMDB ID#. This can be found right in the URL. For example:

[url removed, login to view] <-- the ID# is 0103064

- Instead of searching on RottenTomatoes, it would use the RottenTomatoes JSON API interface which can find a movie based on the IMDB ID#. This should be more accurate than a blind search on RottenTomatoes. To find the movie, we just use the following URL:

[url removed, login to view]

And the result we're looking for is the one under "links": {"alternate": - in this case it is [url removed, login to view]

The script would then proceed to the RottenTomatoes page like it does now, scraping the score.

Beceriler: Her şey Kabul, MySQL, PHP, Kabuk Betiği, Web Scraping, Web Sitesi Yönetimi

Daha fazlasını gör: www rottentomatoes com, www imdb com, www imdb, web scraping api, true results, skip searching, scraping the web, scraping com, rottentomatoes com, q find, imdb com, find php id of a website, We Scraping , url scraping, php json , panther, Movie Script, json script, JSON PHP, imdb rating api, average php mysql, php list json, php json list, web api json php, website scraping php

İşveren Hakkında:
( 19 değerlendirme ) Calgary, Canada

Proje NO: #2253940



No problem :)

1 gün içinde %selectedBids___i_sum_sub_4%%project_currencyDetails_sign_sub_5% USD
(1 Yorum)