From the course: Learning Hadoop
Unlock the full course today
Join today to access over 24,900 courses taught by industry experts.
Understand Hive and HBase - Hadoop Tutorial
From the course: Learning Hadoop
Understand Hive and HBase
- [Instructor] Next, we're going to learn about the library called Hive. It's a SQL-like query that generates MapReduce code. The library was developed at Facebook and open sourced. It uses H-SQL, so it's a SQL like syntax. It's really important to understand though that it's batch based, not interactive like SQL in a relational database and it relies on HBase, which uses schema on read against the underlying source files. Understanding a bit more about HBase, this is the schema or structure that is applied to the underlying file system, which is usually HDFS. So the Hadoop Distributed File System. HBase is a NoSQL wide-column store type of a database that allows for keys shown here as IDs and semi-structured data in key value pairs that does not have to be identical, so it's called a column store. This visualization helps you to understand how Hive and Hadoop libraries and daemons interact. It's quite complex, but key…