Regarding Weka:
java -Xmx768m -jar weka.jar
Regarding Python:
DATE OUTLOOK TEMPERATURE HUMIDITY WIND PLAYS 02/13/12 mostly sunny 47 25 strong no 03/10/12 mostly cloudy 66 57 weak yes 06/28/12 cloudy 91 75 medium yes 07/12/12 sunny 82 27 strong no 08/30/12 rainy 76 80 weak no 09/23/12 drizzle 66 70 weak yes 11/24/12 sunny 52 60 medium no 12/19/12 mostly sunny 41 30 strong no 01/12/13 cloudy 36 40 ? no 04/13/13 mostly cloudy 57 40 weak yes 05/20/13 mostly sunny 68 50 medium yes 06/28/13 drizzle 73 20 weak yes 07/06/13 sunny 95 85 weak yes 08/20/13 rainy 91 60 weak yes 09/01/13 mostly sunny 80 10 medium no 10/23/13 mostly cloudy 52 44 weak no
[mean - (k+1)*sd, mean - k*sd) for all integer values k, i.e. k = ..., -4, -3, -2, -1, 0, 1, 2, ...Assume that the mean of the attribute HUMIDITY above is 48 and that the standard deviation sd of this attribute is 22.5. Discretize HUMIDITY by hand using this new approach. Show your work.
See notes on using Matlab and Excel to calculate these matrices. Construct a visualization of each of these matrices (e.g., heatmap) to more easily understand them.(5 points) If you had to remove 2 of the attributes above from the dataset based on these two matrices, which attributes would you remove and why? Explain your answer.
MODEL | YEAR | COLOR | SALES |
Chevy |
2010 |
red |
5 |
Chevy |
2010 |
white |
87 |
Chevy |
2010 |
blue |
62 |
Chevy |
2011 |
red |
54 |
Chevy |
2011 |
white |
95 |
Chevy |
2011 |
blue |
49 |
Chevy |
2012 |
red |
31 |
Chevy |
2012 |
white |
54 |
Chevy |
2012 |
blue |
71 |
Ford |
2010 |
red |
64 |
Ford |
2010 |
white |
62 |
Ford |
2010 |
blue |
63 |
Ford |
2011 |
red |
52 |
Ford |
2011 |
white |
9 |
Ford |
2011 |
blue |
55 |
Ford |
2012 |
red |
27 |
Ford |
2012 |
white |
62 |
Ford |
2012 |
blue |
39 |
Email your slides to the cs548-staff by the deadline stated at the top of this webpage. The name of the file containing your slides should be:
[your_lastnames_in_alphabetical_order]_proj1_slides.[ext]This file should be either a PDF file (ext=pdf) or a PowerPoint file (ext=ppt or ext=pptx). Please use only lower case letters in the name file. For instance, the file with Kabir's and my slides for Project 1 would be named kabir_ruiz_proj1_slides.ppt