Togaware DATA MINING
Desktop Survival Guide
by Graham Williams
Google

Resources and Further Reading

The example data for contact lenses comes from () and is available from the machine learning repository at ftp://ftp.ics.uci.edu/pub/machine-learning-databases/lenses/lenses.data.

A problem with naïve Bayes arises when the training database has no examples of a particular value of a variable for a particular class.

Bayesian networks relax the conditional independence assumption by identifying conditional independence among subsets of variables.

() addressed the problem of independence by combining naïve Bayes with decision trees. The decision tree is used to partition a database and for each resulting partition (corresponding to separate paths through the decision tree) a naïve Bayes classifier is built using variables not included in the corresponding path through the decision tree. Whilst some improvement in accuracy can result, the final knowledge structures tend to be less compact (with replicated structures). Nonetheless this may be a useful approach for very large databases.



Copyright © 2004-2010 Togaware Pty Ltd
Support further development through the purchase of the PDF version of the book.
The PDF version is a formatted comprehensive draft book (with over 800 pages).
Brought to you by Togaware. This page generated: Sunday, 22 August 2010