The main technical skill for this project is web scraping. For this project I am wanting 12 years (years 2001/02 to 2012/13) of all UEFA Champions League football matches player data scraped off of a website.
The data I want is at the following website: [url removed, login to view]
Underneath the tab history there are the years 1955/56 to 2011/12. I will want the information described below for the years 2001/02 to 2011/12, and the current season which is not under the history tab, but is one the website stated above.
To get to the information, you need to click on the needed year under the history tab. For this example I will click on the year 2001/02 (http://www.uefa.com/uefachampionsleague/season=2001/[url removed, login to view]). Then click on the tab entitled “Matches” (http://www.uefa.com/uefachampionsleague/season=2001/matches/[url removed, login to view]). Then click on the tab entitled “All Matches” (http://www.uefa.com/uefachampionsleague/season=2001/matches/all/[url removed, login to view]). For the given season (for this example 2001/02) all of the matches played that year are listed. I will want information from each of the matches for each of the 12 years noted above. To get to the information for a given match you must click on the score between the two teams’ name (example: 0-1, 0-1, or 4-0). For this example (which is also the example data in the excel spreadsheet provided), I will click on the third match on the list of Levski vs Željezničar by clicking on the 4-0 between the two teams’ name (http://www.uefa.com/uefachampionsleague/season=2001/matches/round=1542/match=67818/[url removed, login to view]). Half of the information that will need to be scraped will be on this page. In the attached excel spreadsheet the data for column “round” through column “opposing teams red cards” (column A through column BG) can be found on this page. The rest of the information can be found by clicking on the hyperlink “more” under match statistics on the right side of the page (http://www.uefa.com/uefachampionsleague/season=2001/matches/round=1542/match=67818/postmatch/statistics/[url removed, login to view]). In the attached excel spreadsheet the data for column “goals scored (GS)” through column “red cards (RC)” (column BH through column BP) can be found on this page. The information needed from each of these pages are the column headings of the excel spreadsheet provided.
For the year 2012/13 data instead of choosing a year under the history tab, click on the “Matches” tab on the website at the top of this page (http://www.uefa.com/uefachampionsleague/season=2013/matches/[url removed, login to view]). Then click on the all matches tab (http://www.uefa.com/uefachampionsleague/season=2013/matches/all/[url removed, login to view]). Then you will be at the list of all games for the 2012/13 season.
As can be noted in the excel spreadsheet the data is to be organized at the player level with each row of data being for an individual player in the match. Each match should create 36 rows of information for each of the 36 players that are part of the match (18 individuals from each team). Since for each of the 12 years there are roughly 200 matches with 36 players in each match there should be roughly 86,400 rows of data.
I want to have this project finish in the next 3-4 days.
I will also want the data either in an excel spreadsheet or a csv file.
14 freelancers are bidding on average $218 for this job
Seasoned web scraper. I worked on many similar projects, I have big experience in data mining projects. I can finish this task in short time, with the best quality.
Hi, I can do the web scraping. I've done similar tasks before for ESPN and [url removed, login to view] I can deliver you without problems in 3-4 days as you need. Please let me know any doubt/comment. Thanks, David