Library

Course: Learn by Example :HBase - The Hadoop Database

Learn by Example :HBase - The Hadoop Database

Description

Prerequisites: Working with HBase requires knowledge of Java

Taught by a team which includes 2 Stanford-educated, ex-Googlers and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing jobs. 

Relational Databases are so stuffy and old! Welcome to HBase - a database solution for a new age. 

HBase: Do you feel like your relational database is not giving you the flexibility you need anymore? Column oriented storage, no fixed schema and low latency make HBase a great choice for the dynamically changing needs of your applications. 

What's Covered: 

25 solved examples covering all aspects of working with data in HBase

CRUD operations in the shell and with the Java API, Filters, Counters, MapReduce 

Implement your own notification service for a social network using HBase

HBase and it’s role in the Hadoop ecosystem, HBase architecture and what makes HBase different from RDBMS and other Hadoop technologies like Hive. 

Using discussion forums

Please use the discussion forums on this course to engage with other students and to help each other out. Unfortunately, much as we would like to, it is not possible for us at Loonycorn to respond to individual questions from students:-(

We're super small and self-funded with only 2 people developing technical video content. Our mission is to make high-quality courses available at super low prices.

The only way to keep our prices this low is to *NOT offer additional technical support over email or in-person*. The truth is, direct support is hugely expensive and just does not scale.

We understand that this is not ideal and that a lot of students might benefit from this additional support. Hiring resources for additional support would make our offering much more expensive, thus defeating our original purpose.

It is a hard trade-off.

Thank you for your patience and understanding!

Who is the target audience?

  • Yep! Anyone who is interested in understanding HBase, NoSQL and distributed database management
  • Yep! Engineers who want to move away from RDBMS and use HBase for their database solution
  • Yep! Folks who are using Hadoop ecosystem tools for analytical processing and want a single system for both OLAP and OLTP
Basic knowledge
  • You'll need to have an IDE where you can run Java code, Intellij IDEA and Eclipse are both good options
  • You'll need to have some experience with Java programming
What you will learn
  • Set up a database for your application using HBase
  • Integrate HBase with MapReduce for data processing tasks
  • Create tables, insert , read and delete data from HBase
  • Get an all round understanding of HBase and it's role in the Hadoop ecosystem
Curriculum
Lectures quantity: 42
Common duration: 04:56:42
Session #1 You, This Course and Us
  • You, This Course and Us  

    We start with an introduction to the course, the instructors and what lies ahead for you. 

Session #2 Introduction to HBase
  • The problem with distributed computing  

    Distributed computing is really exciting, but was a really hard problem before the Hadoop ecosystem evolved.

  • Installing HBase  

    Let's get started by Installing HBase. Get a taste of whats to come, start HBase, create a table and insert some data. 

    Please use the attached text file for copying any commands that are shown on the screen. 


  • The Hadoop ecosystem  

    HBase is a database built for the Hadoop ecosystem. Before we get there, let's get a quick understanding of the Hadoop ecosystem.

  • The role of HBase in the Hadoop ecosystem  

    Hadoop has limitations that make HBase a necessary part of the Hadoop ecosystem

  • How is HBase different from RDBMS?  

    HBase is a database - but its quite different from a traditional RDBMS. Get a quick overview of the differences before we dig deeper in future videos

  • HBase Data Model  

    See how column oriented storage and denormalization make HBase's data model quite different from a traditional database. 

  • Introducing CRUD operations  

    HBase supports a limited set of operations - Create, Read, Update and Delete

  • HBase is different from Hive  

    HBase is a database solution in the Hadoop ecosystem, so it seems is Hive. What makes them different?

Session #3 CRUD operations using the HBase Shell
  • Creating a table for User Notifications  

    Example1 - Creating a table for User Notifications

  • Inserting a row  

    Example 2 - Inserting a row

  • Updating a row  

    Example 3 - Updating a row

  • Retrieving a row  

    Example 4 - Retrieving a row

  • Retrieving a range of rows  


    Example 5 - Retrieving a range of rows

  • Deleting a row  


    Example 6 - Deleting a row

  • Deleting a table  

    Example 7 - Deleting a table

Session #5 CRUD operations using the Java API
  • Creating a table with HBaseAdmin  


    Example 8 - Creating a table with HBaseAdmin

  • Inserting a row using a Put object  

    Example 9 - Inserting a row using a Put object

  • Inserting a list of Puts  

    Example 10 - Inserting a list of Puts

  • Retrieving data - Get and Result objects  


    Example 11 - Retrieving data - Get and Result objects


  • A list of Gets  


    Example 12 - A list of Gets

  • Example 13 - Deleting a row  

    Example 13 - Deleting a row

  • A list of Deletes  

    Example 14 - A list of Deletes

  • Mix and match with batch operations  

    Example 15 - Mix and match with batch operations

  • Scanning a range of rows  

    Example 16 - Scanning a range of rows

  • Example 17 - Deleting a table  

    Example 17 - Deleting a table

Session #6 HBase Architecture
  • HBase Architecture  


    HBase Architecture

Session #7 Advanced operations - Filters and Counters
  • Filter by Row id - RowFilter  

    Example 18 - Filter by Row id - RowFilter

  • Filter by column value - SingleColumnValueFilter  

    Example 19 - Filter by column value - SingleColumnValueFilter

  • Apply multiple conditions - Filterlist  

    Example 20 - Apply multiple conditions - Filterlist

  • Retrieve rows within a time range  

    Example 21 - Retrieve rows within a time range

  • Atomically incrementing a value with Counters  

    Example 22 - Atomically incrementing a value with Counters

Session #8 MapReduce with HBase
  • Example 23 : A MapReduce task to count Notifications by Typ  

    Use MapReduce for complex data processing tasks. See how to set up a MapReduce task to count Notifications by type

  • Example 23 continued: Implementing the MapReduce in Java  

    Implement the MapReduce task to summarize Notifications by type in Java. 

  • Demo : Running a MapReduce task  

    See how to run a MapReduce task using the Java JAR.

Session #9 Build a Notification Service
  • Example 24 : Implement a Notification Hierarchy  

    Set up a class hierarchy to represent different types of user notifications for a social network

  • Example 25: Implement a Notifications Manager  

    Use a Notifications Manager to manage tasks such as adding notifications and retrieving notifications for a specific user

Session #10 Installing Hadoop in a Local Environment
  • Hadoop Install Mode  

    Hadoop has 3 different install modes - Standalone, Pseudo-distributed and Fully Distributed. Get an overview of when to use each

  • Hadoop Install Step 1 : Standalone Mode  

    How to set up Hadoop in the standalone mode. Windows users need to install a Virtual Linux instance before this video.

  • Hadoop Install Step 2 : Pseudo-Distributed Mode  

    Set up Hadoop in the Pseudo-Distributed mode. All Hadoop services will be up and running! 

Session #11 Appendix
  • [For Linux/Mac OS Shell Newbies] Path and other Environment Variables  

    If you are unfamiliar with softwares that require working with a shell/command line environment, this video will be helpful for you. It explains how to update the PATH environment variable, which is needed to set up most Linux/Mac shell based softwares.

  • Setting up a Virtual Linux Instance - For Windows Users  

    Hadoop is basically for Linux/Unix systems. If you are on Windows, you can set up a Linux Virtual Machine on your computer and use that for the install.

reviews (1)
Average rating
5
1 voices
Detailed rating
5 stars
100%
4 stars
0%
3 stars
0%
2 stars
0%
1 stars
0%
3 months before
Must have course for better understanding of Hadoop concept...