Cluster Analysis with R and SAS Training Course

Duration

14 hours (usually 2 days including breaks)

Requirements

  • Experience with R programming
  • SAS experience

Audience

  • Data Analysts

Overview

R is a programming language and software environment for statistical computing. SAS is a statistical software platform for predictive analysis, data management, advanced analytics, and more. With R in SAS, users can find natural groups of data for cluster analysis that are essential to data mining.

This instructor-led, live training (online or onsite) is aimed at data analysts who wish to program with R in SAS for cluster analysis.

By the end of this training, participants will be able to:

  • Use cluster analysis for data mining
  • Master R syntax for clustering solutions.
  • Implement hierarchical and non-hierarchical clustering.
  • Make data-driven decisions to help to improve business operations.

Format of the Course

  • Interactive lecture and discussion.
  • Lots of exercises and practice.
  • Hands-on implementation in a live-lab environment.

Course Customization Options

  • To request a customized training for this course, please contact us to arrange.

Course Outline

Introduction

Cluster Analysis

  • What is cluster analysis?
  • Types of cluster types

Cluster Analysis Continued

  • Cluster analysis vs object segmentation
  • Hierarchical vs non-hierarchical clustering

Preparing the Development Environment

  • Installing and configuring SAS
  • Installing and configuring R

Cluster Analysis with SAS

  • Importing data
  • Standardizing data
  • Implementing hierarchical clustering
  • Interpretting output
  • Working with K means clustering for non-hierarchical
  • Interpretting output

Cluster Analysis with R

  • Using hierarchical clustering functions
  • Working with non-hierarchical clustering functions

Summary and Conclusion