Data Mining Methods and Models applies a “white box”
methodology, emphasizing an understanding of the model structures underlying
data mining software, while walking you through the various algorithms and
operations on actual large data sets, including a detailed case study, “Modeling
Response to Direct-Mail Marketing.” Also includes a companion Web site,
www.dataminingconsultant.com, where the data sets used in the book may be
downloaded, along with a comprehensive set of data mining resources.
Provides an introduction into data mining methods and models, including
association rules, clustering, K-nearest neighbor, statistical inference, neural
networks, linear and logistic regression, and multivariate analysis Presents a
unified approach based on CRISP methodology, which involves Strategic Risk
Assessment based on Organizational Mode A companion Web site features downloads
of large data sets used in the chapter projects, with a discussion area and
message board, where readers are encouraged to exchange ideas
From the Back Cover
Apply powerful Data Mining Methods and Models to Leverage your Data for
Data Mining Methods and Models provides:
- The latest techniques for uncovering hidden nuggets of information
- The insight into how the data mining algorithms actually work
- The hands-on experience of performing data mining on large data sets
Data Mining Methods and Models:
Applies a "white box" methodology, emphasizing an understanding of the model
structures underlying the softwareWalks the reader through the various
algorithms and provides examples of the operation of the algorithms on actual
large data sets, including a detailed case study, "Modeling Response to
- Tests the reader''s level of understanding of the concepts and methodologies,
with over 110 chapter exercises
- Demonstrates the Clementine data mining software suite, WEKA open source data
mining software, SPSS statistical software, and Minitab statistical software
- Includes a companion Web site, www.dataminingconsultant.com, where the data
sets used in the book may be downloaded, along with a comprehensive set of data
mining resources. Faculty adopters of the book have access to an array of
helpful resources, including solutions to all exercises, a PowerPoint
1. Dimension Reduction Methods.
Need for Dimension Reduction in Data Mining.
Principal Components Analysis.
2. Regression Modeling.
Example of Simple Linear Regression.
Coefficient or Determination.
The ANOVA Table.
Outliers, High Leverage Points, and Influential Observations.
The Regression Model.
Inference in Regression.
Verifying the Regression Assumptions.
An Example: The Baseball Data Set.
An Example: The California Data Set.
Transformations to Achieve Linearity.
3. Multiple Regression and Model Building.
An Example of Multiple Regression.
The Multiple Regression Model.
Inference in Multiple Regression.
Regression with Categorical Predictors.
Variable Selection Methods.
An Application of Variable Selection Methods.
Mallows’ C p Statistic.
Variable Selection Criteria.
Using the Principal Components as Predictors in Multiple Regression.
4. Logistic Regression.
A Simple Example of Logistic Regression.
Maximum Likelihood Estimation.
Interpreting Logistic Regression Output.
Inference: Are the Predictors Significant?
Interpreting the Logistic Regression Model.
Interpreting a Logistic Regression Model for a Dichotomous Predictor.
Interpreting a Logistic Regression Model for a Polychotomous Predictor.
Interpreting a Logistic Regression Model for a Continuous Predictor.
The Assumption of Linearity.
The Zero-Cell Problem.
Multiple Logistic Regression.
Introducing Higher Order terms to Handle Non-Linearity.
Validating the Logistic Regression Model.
WEKA: Hands-On Analysis Using Logistic Regression.
5. Naïve Bayes and Bayesian Networks.
The Bayesian Approach.
The Maximum a Posteriori (MAP) Classification.
The Posterior Odds Ratio.
Balancing the Data.
Naïve Bayes Classification.
Numeric Predictors for Naïve Bayes Classification.
WEKA: Hands-On Analysis Using Naïve Bayes.
Bayesian Belief Networks.
Using the Bayesian Network to Find Probabilities.
WEKA: Hands-On Analysis Using Bayes Net.
6. Genetic Algorithms.
Introduction to Genetic Algorithms.
The Basic Framework of a Genetic Algorithm.
A Simple Example of Genetic Algorithms at Work.
Modifications and Enhancements: Selection.
Modifications and enhancements: Crossover.
Genetic Algorithms for Real-Valued Variables.
Using Genetic Algorithms to Train a Neural Network.
WEKA: Hands-On Analysis Using Genetic Algorithms.
7. Case Study: Modeling Response to Direct-Mail Marketing.
The Cross-Industry Standard Process for Data Mining: CRISP-DM.