The
census-income (also called "adult") dataset
from the US Census Bureau which is
available at the
Univ. of California Irvine (UCI) Data Repository.
The census-income dataset contains census information for 48,842
people. It has 14 attributes for each person
(age,
workclass,
fnlwgt,
education,
education-num,
marital-status,
occupation,
relationship,
race,
sex,
capital-gain,
capital-loss,
hours-per-week, and
native-country)
and a boolean attribute class classifying the input
of the person as belonging to one of two categories >50K, <=50K.
Convert the census-income data to the arff format. For this
you can either use any tools provided by Weka, or you can make the
conversion outside the Weka system using other tools (e.g., a word
editor, Excel, etc.). Create a census-income.arff file with the converted
dataset.