From the course: Learning Hadoop

Unlock the full course today

Join today to access over 24,900 courses taught by industry experts.

Understand Hive and HBase

Understand Hive and HBase - Hadoop Tutorial

From the course: Learning Hadoop

Understand Hive and HBase

- [Instructor] Next, we're going to learn about the library called Hive. It's a SQL-like query that generates MapReduce code. The library was developed at Facebook and open sourced. It uses H-SQL, so it's a SQL like syntax. It's really important to understand though that it's batch based, not interactive like SQL in a relational database and it relies on HBase, which uses schema on read against the underlying source files. Understanding a bit more about HBase, this is the schema or structure that is applied to the underlying file system, which is usually HDFS. So the Hadoop Distributed File System. HBase is a NoSQL wide-column store type of a database that allows for keys shown here as IDs and semi-structured data in key value pairs that does not have to be identical, so it's called a column store. This visualization helps you to understand how Hive and Hadoop libraries and daemons interact. It's quite complex, but key…

Contents