Description

Chapter 6, page 127, problems 5-10.

OccupationGenderAgeSalaryServiceFemale45$48,000 Male25$25,000 Male33$35,000ManagementMale25$45,000 Female35$65,000 Male26$45,000 Female45$70,000SalesFemale40$50,000 Male30$40,000StaffFemale50$40,000 Male25$25,000

Consider the data in above Table. The target variable is salary. Start by discretizing salary as follows:

Less than $35,000 Level 1

$35,000 to less than $45,000 Level 2

$45,000 to less than $55,000 Level 3

Above $55,000 Level 4

5. Construct a classification and regression tree to classify salary based on the other variables. Do as much as you can by hand, before turning to the software.
6. Construct a C4.5 decision tree to classify salary based on the other variables. Do as much as you can by hand, before turning to the software.
7. Compare the two decision trees and discuss the benefits and drawbacks of each.
8. Generate the full set of decision rules for the CART decision tree.
9. Generate the full set of decision rules for the C4.5 decision tree.
10. Compare the two sets of decision rules and discuss the benefits and drawbacks of each.