I have a project that involves scraping for NCAA men's college basketball from statsheet: [url removed, login to view]
In particular, I need game statistics for all teams and all games for the years 2001-2012. Furthermore, I'd like these to be integrated with the RPI ranking data for each game date ([url removed, login to view]).
So I want a single csv file for each full season, with each row representing an individual game. Columns would be the home/away teams' identifiers, the ranking info for each team prior to that date's game (from the RPI tables for the relevant date), as well as previous wins/losses for the year, each team's conference, etc. Also, each row would include the in-game statistics for that particular game (shots attempted, points scored, turnovers, etc). These would have to be scraped from each team's page individually. (For instance, at: [url removed, login to view]).
If you decide to take this project, I will give more info on the statistics I want.