Abstract
Data mining seeks unexpected, interesting, or valuable structures in large data sets. There are two distinct classes of data mining tool, modeling, and pattern discovery. Difficulties arise in coping with data distortion and errors. The tools of data mining hold great promise for scientific and medical advance, but many theoretical questions remain open.