Duration
21 hours (usually 3 days including breaks)
Requirements
- An understanding of database concepts.
Audience
- Developers
Overview
Pivotal Greenplum is a Massively Parallel Processing (MPP) Data Warehouse platform based on PostgreSQL.
This instructor-led, live training (online or onsite) is aimed at developers who wish to set up a multi-node Greenplum database.
By the end of this training, participants will be able to:
- Install and configure Pivotal Greenplum.
- Model data in accordance to current needs and future expansion plans.
- Carry out different techniques for distributing data across multiple nodes.
- Improve database performance through tuning.
- Monitor and troubleshoot a Greenplum database.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction
Setting up Pivotal Greenplum
Overview of Pivotal Greenplum Features and Architecture
Accessing Data
- DDL, DML, and DQL
Implementing a Table Storage Model
- Understanding tablespaces
- Compressing table data
Distributing the Data
- Distribution keys and partitioning
- Managing joins and indexing
Loading Data
- Table partitioning
OLAP Querying
- Implementing Greenplum functions
Modeling the Data
- Physical design considerations
Expanding the System
- Adding nodes
- Migrating data
Monitoring a Greenplum system
- Database activity and performance
Performance Tuning
- Optimizing queries
- Optimizing SQL joins
- Indexing optimization
Greenplum Best Practices
Troubleshooting
Summary and Conclusiond