创建并分类新实例WEKA



我已经使用WEKA API和JAVA创建了一个决策树(J48)。首先,我使用ARFF文件训练决策树。

public static void Tree(String Path) throws Exception {//Path path for the arff file
    J48 tree = new J48();         // new instance of tree
    DataSource source = new DataSource(Path);
    Instances data = source.getDataSet();
    // setting class attribute if the data format does not provide this information
    // For example, the XRFF format saves the class attribute information as well
    if (data.classIndex() == -1) {
        data.setClassIndex(data.numAttributes() - 1);
    }
    tree.buildClassifier(data);
    System.out.println(tree.toString());
}

使用的ARFF文件包含780个实例。每个实例都有6个属性{pt1,w1,d1,pt2,w2,d2} all Numeric和一个类{是,否}。我的代码正常工作,我可以使用

看到由此产生的决策树
System.out.println(tree.toString());

现在,我想创建一个新实例(不使用另一个ARFF文件)并对此新实例进行分类。说这个新实例的值是{50、5、5、800、74、3、760}。然后,决策树必须返回相应的类(a是"是"或"否")。

我找到了解决问题的解决方案,希望它会有用。

    //Declaring attributes
    Attribute PT1 = new Attribute("PT1");
    Attribute w1 = new Attribute("w1");
    Attribute d1 = new Attribute("d1");
    Attribute PT2 = new Attribute("PT2");
    Attribute w2 = new Attribute("w2");
    Attribute d2 = new Attribute("d2");
    // Declare the class attribute along with its values contains two nominal values yes and no using FastVector. "ScheduledFirst" is the name of the class attribute        
    FastVector fvClassVal = new FastVector(2);
    fvClassVal.addElement("yes");
    fvClassVal.addElement("no");
    Attribute Class = new Attribute("ScheduledFirst", fvClassVal);
    // Declare the feature vector
    FastVector fvWekaAttributes = new FastVector(7);
    // Add attributes 
    fvWekaAttributes.addElement(PT1);
    fvWekaAttributes.addElement(w1);
    fvWekaAttributes.addElement(d1);
    fvWekaAttributes.addElement(PT2);
    fvWekaAttributes.addElement(w2);
    fvWekaAttributes.addElement(d2);
    fvWekaAttributes.addElement(Class);
    // Declare Instances which is required since I want to use classification/Prediction
    Instances dataset = new Instances("whatever", fvWekaAttributes, 0);
    //Creating a double array and defining values
    double[] attValues = new double[dataset.numAttributes()];
    attValues[0] = 50;
    attValues[1] = 5;
    attValues[2] = 800;
    attValues[3] = 74;
    attValues[4] = 3;
    attValues[5] = 760;
    //Create the new instance i1
    Instance i1 = new Instance(1.0, attValues);
    //Add the instance to the dataset (Instances) (first element 0)        
    dataset.add(i1);
    //Define class attribute position
    dataset.setClassIndex(dataset.numAttributes()-1);
    //Will print 0 if it's a "yes", and 1 if it's a "no"
    System.out.println(tree.classifyInstance(dataset.instance(0)));
    //Here I call dataset.instance(0) since there is only one instance added in the dataset, if you do add another one you can use dataset.instance(0), etc.

最新更新