Publications
Publications
- August 2018 (Revised September 2018)
- HBS Case Collection
LendingClub (B): Decision Trees & Random Forests
By: Srikant M. Datar and Caitlin N. Bowler
Abstract
This case builds directly on the LendingClub (A) case. In this case students follow Emily Figel as she builds two tree-based models using historical LendingClub data to predict, with some probability, whether borrower will repay or default on his loan.
Technical topics include: (1) Decision trees as a modelling technique, overfitting and induction bias, model validation; (2) Random forest as an ensemble-style modelling technique, bootstrapping, random feature selection; and (3) Log loss as a metric for evaluating and comparing models, feature impact.
Technical topics include: (1) Decision trees as a modelling technique, overfitting and induction bias, model validation; (2) Random forest as an ensemble-style modelling technique, bootstrapping, random feature selection; and (3) Log loss as a metric for evaluating and comparing models, feature impact.
Keywords
Data Science; Data Analytics; Decision Trees; Investment; Financing and Loans; Analytics and Data Science; Analysis; Forecasting and Prediction
Citation
Datar, Srikant M., and Caitlin N. Bowler. "LendingClub (B): Decision Trees & Random Forests." Harvard Business School Supplement 119-021, August 2018. (Revised September 2018.)