Data Preparation for Data Mining Using SAS ( The Morgan Kaufmann Series in Data Management Systems )

Publication series :The Morgan Kaufmann Series in Data Management Systems

Author: Refaat   Mamdouh  

Publisher: Elsevier Science‎

Publication year: 2010

E-ISBN: 9780080491004

P-ISBN(Paperback): 9780123735775

P-ISBN(Hardback):  9780123735775

Subject: TP Automation Technology , Computer Technology;TP39 computer application

Language: ENG

Access to resources Favorite

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Description

Are you a data mining analyst, who spends up to 80% of your time assuring data quality, then preparing that data for developing and deploying predictive models? And do you find lots of literature on data mining theory and concepts, but when it comes to practical advice on developing good mining views find little “how to” information? And are you, like most analysts, preparing the data in SAS?

This book is intended to fill this gap as your source of practical recipes. It introduces a framework for the process of data preparation for data mining, and presents the detailed implementation of each step in SAS. In addition, business applications of data mining modeling require you to deal with a large number of variables, typically hundreds if not thousands. Therefore, the book devotes several chapters to the methods of data transformation and variable selection.

  • A complete framework for the data preparation process, including implementation details for each step.
  • The complete SAS implementation code, which is readily usable by professional analysts and data miners.
  • A unique and comprehensive approach for the treatment of missing values, optimal binning, and cardinality reduction.
  • Assumes minimal proficiency in SAS and includes a quick-start chapter on writing SAS macros.

Chapter

Front Cover

pp.:  1 – 4

Copyright Page

pp.:  5 – 6

Contents

pp.:  6 – 16

List of Figures

pp.:  16 – 18

List of Tables

pp.:  18 – 22

Preface

pp.:  22 – 26

CHAPTER 1. INTRODUCTION

pp.:  26 – 32

CHAPTER 2. TASKS AND DATA FLOW

pp.:  32 – 40

CHAPTER 3. REVIEW OF DATA MINING MODELING TECHNIQUES

pp.:  40 – 54

CHAPTER 4. SAS MACROS: A QUICK START

pp.:  54 – 68

CHAPTER 5. DATA ACQUISITION AND INTEGRATION

pp.:  68 – 88

CHAPTER 6. INTEGRITY CHECKS

pp.:  88 – 108

CHAPTER 7. EXPLORATORY DATA ANALYSIS

pp.:  108 – 124

CHAPTER 8. SAMPLING AND PARTITIONING

pp.:  124 – 140

CHAPTER 9. DATA TRANSFORMATIONS

pp.:  140 – 166

CHAPTER 10. BINNING AND REDUCTION OF CARDINALITY

pp.:  166 – 196

CHAPTER 11. TREATMENT OF MISSING VALUES

pp.:  196 – 232

CHAPTER 12. PREDICTIVE POWER AND VARIABLE REDUCTION I

pp.:  232 – 236

CHAPTER 13. ANALYSIS OF NOMINAL AND ORDINAL VARIABLES

pp.:  236 – 258

CHAPTER 14. ANALYSIS OF CONTINUOUS VARIABLES

pp.:  258 – 272

CHAPTER 15. PRINCIPAL COMPONENT ANALYSIS

pp.:  272 – 282

CHAPTER 16. FACTOR ANALYSIS

pp.:  282 – 292

CHAPTER 17. PREDICTIVE POWER AND VARIABLE REDUCTION II

pp.:  292 – 304

CHAPTER 18. PUTTING IT ALL TOGETHER

pp.:  304 – 322

APPENDIX. LISTING OF SAS MACROS

pp.:  322 – 398

Bibliography

pp.:  398 – 400

Index

pp.:  400 – 418

About the Author

pp.:  418 – 426

The users who browse this book also browse