자료 분석

강의

 * Predictive Analytics: Generalized Linear Regression 등

도구

 * pandas (Python Data Analysis Library)
 * 마이크로소프트의 정보생산 도구 - BI 도구 (Excel 2013의 PowerPivot과 Power View), Data Explorer, HDInsight Service (하둡), SQL Server 2012 PDW (하둡에 SQL 질의하는 PolyBase)

자료간 상관관계를 분석할 때 유의할 점 - For example, David Leinweber showed back in 2007 that data mining techniques could show a strong but spurious correlation between the changes in the S&P 500 stock index and butter production in Bangladesh. There's another great correlation between the use of Facebook and the rise of the Greek debt crisis.