Duration
14 hours (usually 2 days including breaks)
Requirements
Good understanding of traditional technologies for data storage (MySQL, Oracle, SQL Server, etc…)
Overview
When traditional storage technologies don’t handle the amount of data you need to store there are hundereds of alternatives. This course try to guide the participants what are alternatives for storing and analyzing Big Data and what are theirs pros and cons.
This course is mostly focused on discussion and presentation of solutions, though hands-on exercises are available on demand.
Course Outline
Limits of Traditional Technologies
- SQL databases
- Redundancy: replicas and clusters
- Constraints
- Speed
Overview of database types
- Object Databases
- Document Store
- Cloud Databases
- Wide Column Store
- Multidimensional Databases
- Multivalue Databases
- Streaming and Time Series Databases
- Multimodel Databases
- Graph Databases
- Key Value
- XML Databases
- Distribute file systems
Popular NoSQL Databases
- MongoDB
- Cassandra
- Apache Hadoop
- Apache Spark
- other solutions
NewSQL
- Overview of available solutions
- Performance
- Inconsitencies
Document Storage/Search Optimized
- Solr/Lucene/Elasticsearch
- other solutions