Togaware DATA MINING
Desktop Survival Guide
by Graham Williams
Google


Glossary

Image glossary
With the rather unruly explosion of interest in data mining and its well documented commercial successes, and the fact that data mining is the fusion of many disciplines, each with their own heritage, the terminology used in the data mining community is at times quite confusing and often redundant. The beginnings of a Glossary began here but has ceased. Wikipedia is now the canonical source.

Bias: The error in a model due to systematic inadequacies in the learning process. That is, those instances consistently incorrectly classified by models built by the learning algorithm. Modelling error due to bias can be reduced using Boosting. Compare with variance.



Subsections

Copyright © 2004-2008 Togaware Pty Ltd
Support further development through the purchase of the PDF version of the book.
PDF version is properly formatted and forms a comprehensive book (draft with over 600 pages).
Brought to you by Togaware.