Chapter 3: Predicting Sports Winners with Decision Trees
More on pandas
http://pandas.pydata.org/pandas-docs/stable/tutorials.html
The pandas library is a great package—anything you normally write to do data loading is probably already implemented in pandas. You can learn more about it from their tutorial, linked above.
There is also a great blog post written by Chris Moffitt that overviews common tasks people do in Excel and how to do them in pandas: http://pbpython.com/excel-pandas-comp.html
You can also handle large datasets with pandas; see the answer, from user Jeff (the top answer at the time of writing), to this StackOverflow question for an extensive overview of the process: http://stackoverflow.com/questions/14262433/large-data-work-flows-using-pandas.
Another great tutorial on pandas is written by Brian Connelly: http://bconnelly.net/2013/10/summarizing-data-in-python-with-pandas/
More complex features
http://www.basketball-reference.com/teams/ORL/2014_roster_status.html...