Mining Imperfect Data
Author | : Ronald K. Pearson |
Publisher | : SIAM |
Total Pages | : 315 |
Release | : 2005-01-01 |
ISBN-10 | : 0898717884 |
ISBN-13 | : 9780898717884 |
Rating | : 4/5 (84 Downloads) |
Download or read book Mining Imperfect Data written by Ronald K. Pearson and published by SIAM. This book was released on 2005-01-01 with total page 315 pages. Available in PDF, EPUB and Kindle. Book excerpt: Data mining is concerned with the analysis of databases large enough that various anomalies, including outliers, incomplete data records, and more subtle phenomena such as misalignment errors, are virtually certain to be present. Mining Imperfect Data describes in detail a number of these problems, as well as their sources, their consequences, their detection, and their treatment. Specific strategies for data pretreatment and analytical validation that are broadly applicable are described, making them useful in conjunction with most data mining analysis methods. Examples are presented to illustrate the performance of the pretreatment and validation methods in a variety of situations, both simulation based, where "correct" results are known unambiguously, and real data examples that illustrate typical cases met in practice.