Choosing the best k value for the k-NN classification of the WDBC data set

Created: 2012-09-28 09:10:31      Last updated: 2012-09-28 11:24:07

The process determines the best value for the parameter k for the k-NN classification of the Breast Cancer Wisconsin (Diagnostic) data set available in the UCI Machine Learning Repository. The optimal k is computed by using 10-fold cross-validation. (To get better results each cross-validation is repeated 10 times and the averages of the runs are considered.) Finally, a k-NN classifier is built and evaluated on the entire data set using the optimal k. During the process the resulting average performances are logged for each k.

Information Preview

Information Run

Not available


Information Workflow Components

Unavailable

Information Workflow Type

RapidMiner

Information Uploader

Information License

All versions of this Workflow are licensed under:

Information Version 1 (of 1)

Information Credits (0)

(People/Groups)

None

Information Attributions (0)

(Workflows/Files)

None

Information Tags (2)

Log in to add Tags

Information Shared with Groups (0)

None

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

Information Statistics

 

Citations (0)

None


Version History

In chronological order:



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment




Workflow Other workflows that use similar services (0)

There are no workflows in myExperiment that use similar services to this Workflow.