*** You must use the Project 2 Template provided for your written report. *** (if you prefer not to use Word, you can copy and paste this format in a different editor as long as you respect the stated page structure and page limit.)
In this project, we will use the Census-Income (KDD) Data Set (use the census-income.data.gz data file with k-fold cross-validation, so no need to use the census-income.test.gz data file). This dataset is available at the UCI Machine Learning Repository.
Run experiments with and without discretizing the predicting attributes; removing attributes that are too related to the target (e.g., casual and registered when predicting cnt) or that make the trees too long; and with any other pre-processing and post-processing that produce useful and meaningful models.