Skip to main content

Page of 4
and
  1. No Access

    Chapter

    Installing Rattle

    Rattle relies on an extensive collection of free and open source software. Some preliminary steps need to be followed in installing it. The latest installation instructions are maintained at http://rattle.toga...

    Graham Williams in Data Mining with Rattle and R (2011)

  2. No Access

    Chapter

    Support Vector Machines

    A support vector machine (SVM) searches for so-called support vectors which are observations that are found to lie at the edge of an area in space which presents a boundary between one of these classes of observa...

    Graham Williams in Data Mining with Rattle and R (2011)

  3. No Access

    Chapter

    Deployment

    Once a model is developed and evaluated, and we have determined it to be suitable, we then need to deploy it. This is an often-overlooked issue in many data mining projects. It also seems to receive little att...

    Graham Williams in Data Mining with Rattle and R (2011)

  4. No Access

    Chapter

    Interactive Graphics

    There is more to exploring data than simply generating textual and statistical summaries and graphical plots. As we have begun to see, R has some very signi_cant capabilities for generating graphics that assis...

    Graham Williams in Data Mining with Rattle and R (2011)

  5. No Access

    Chapter

    Sample Datasets

    The following sections introduce the datasets that we use throughout the book to demonstrate data mining. R provides quite a collection of datasets. Each of the datasets we introduce here is available through ...

    Graham Williams in Data Mining with Rattle and R (2011)

  6. No Access

    Chapter

    Descriptive and Predictive Analytics

    Modelling is what we most often think of when we think of data mining. Modelling is the process of taking some data (usually) and building a simplified description of the processes that might have generated it...

    Graham Williams in Data Mining with Rattle and R (2011)

  7. No Access

    Chapter

    Introduction

    Data mining is the art and science of intelligent data analysis. The aim is to discover meaningful insights and knowledge from data. Discoveries are often expressed as models, and we often describe data mining...

    Graham Williams in Data Mining with Rattle and R (2011)

  8. No Access

    Chapter

    Boosting

    The Boosting meta-algorithm is an eficient, simple, and easy-touse approach to building models. The popular variant called AdaBoost (an abbreviation for adaptive boosting) has been described as the \best off-t...

    Graham Williams in Data Mining with Rattle and R (2011)

  9. No Access

    Chapter

    Working with Data

    Data is the starting point for all data mining—without it there is nothing to mine. In today's world, there is certainly no shortage of data, but turning that data into information, knowledge, and, eventually,...

    Graham Williams in Data Mining with Rattle and R (2011)

  10. No Access

    Chapter

    Model Performance Evaluation

    The preceding chapters presented a number of algorithms for building descriptive and predictive models. Before we can identify the best from amongst the different models, we must evaluate the performance of th...

    Graham Williams in Data Mining with Rattle and R (2011)

  11. No Access

    Chapter

    Transforming Data

    An interesting issue with the delivery of a data mining project is that in reality we spend more of our time working on and with the data than we do building actual models, as we suggested in Chapter 1. In bui...

    Graham Williams in Data Mining with Rattle and R (2011)

  12. No Access

    Chapter

    Cluster Analysis

    The clustering technique is one of the core tools that is used by the data miner. Clustering gives us the opportunity to group observations in a generally unguided fashion according to how similar they are. Th...

    Graham Williams in Data Mining with Rattle and R (2011)

  13. No Access

    Chapter

    Association Analysis

    Many years ago, a number of new Internet businesses were created to sell books on-line. Over time, they collected information about the books that each of their customers were buying. Using association analysi...

    Graham Williams in Data Mining with Rattle and R (2011)

  14. No Access

    Chapter

    Getting Started

    New ideas are often most effectively understood and appreciated by actually doing something with them. So it is with data mining. Fundamentally, data mining is about practical application—application of the al...

    Graham Williams in Data Mining with Rattle and R (2011)

  15. No Access

    Chapter

    Loading Data

    Data can come in many different formats from many different sources.By using R's extensive capabilities, Rattle provides direct access to such data. Indeed, we are fortunate with the R system in that it is an ...

    Graham Williams in Data Mining with Rattle and R (2011)

  16. No Access

    Chapter

    Decision Trees

    Decision trees (also referred to as classification and regression trees) are the traditional building blocks of data mining and the classic machine learning algorithm. Since their development in the 1980s, dec...

    Graham Williams in Data Mining with Rattle and R (2011)

  17. No Access

    Chapter

    Exploring Data

    As a data miner, we need to live and breathe our data. Even before we start building our data mining models, we can gain signi_cant insights through exploring the data. Insights gained can deliver new discover...

    Graham Williams in Data Mining with Rattle and R (2011)

  18. No Access

    Chapter

    Random Forests

    Building a single decision tree provides a simple model of the world, but it is often too simple or too specific. Over many years of experience in data mining, it has become clear that many models working toge...

    Graham Williams in Data Mining with Rattle and R (2011)

  19. No Access

    Book and Conference Proceedings

    New Frontiers in Applied Data Mining

    PAKDD 2009 International Workshops, Bangkok, Thailand, April 27-30, 2009. Revised Selected Papers

    Thanaruk Theeramunkong in Lecture Notes in Computer Science (2010)

  20. No Access

    Chapter and Conference Paper

    A Survey of Open Source Data Mining Systems

    Open source data mining software represents a new trend in data mining research, education and industrial applications, especially in small and medium enterprises (SMEs). With open source software an enterpris...

    **aojun Chen, Yunming Ye, Graham Williams in Emerging Technologies in Knowledge Discove… (2007)

Page of 4