WPI Worcester Polytechnic Institute

Computer Science Department
------------------------------------------

CS539 Machine Learning - Spring 2007 
Project 9 - Final Project

PROF. CAROLINA RUIZ 

Due Date: Thursday, April 19th 2007. Slides are due at 3:00 pm and the written report is due at 4:00 pm. 
------------------------------------------

This final assignment consists of two parts:
  1. Work further on your weakest (or more precisely, your least-strong) project so that you turn it into your best one. Take advantage of the feedback that you received from me and/or from your classmates during that project presentation and on your written report. (Each one of you and I have agreed upon which of the projects is your weakest project.)

  2. Complete the following table summarizing each and everyone of your projects. Pick one of the datasets you used throughout the semester and re-run experiments as necessary so that you can report results using the same evaluation approach (if at all possible 10-fold cross-validation, if not 4-fold cross-validation), the same training and testing datasets, etc. The experimenter in the Weka system would be very helpful for this (see the experimenter tutorial included in the Weka package). Also, use the experimenter to determine whether or not the accuracy differences between pairs of these methods are statistically significant with a p value of 0.05 or less. Please include this table in your report and in your slides.

    Technique DecisionTrees ID3/J4.8 NeuralNetworks NaiveBayes/BayesNets Instance-Based IB1/IBk/LBR/LWR GeneticAlgorithms RuleLearning Prism/Foil Technique re-done for final project: ________
    Code (mine/other/adapted)              
    Programming Language              
    Dataset (name):              
    Accuracy              
    Stat. significantly better than: (list methods)              
    Size of the model              
    How readable is the model?              
    Number of attributes used              
    Num. of training instances              
    Num. of test instances              
    Missing values included?(y/n)              
    Pre-processing done              
    Evaluation method used
    (n-fold cross val, n=?)
                 
    Training Time              
    Testing Time              


REPORT AND DUE DATE