800-CEO-READ is closed until Monday, August 19th, when we will become Porchlight Book Company! Read the news here.

Impala in Action: Querying and Mining Big Data

By Ricky Saltzer, Istvan Szegedi, Paul De Schacht

Hadoop queries in Pig or Hive can be too slow for real-time data analysis. Impala, an ultra-speedy query engine from Cloudera, supercharges Hadoop by avoiding the typical Map-Reduce overhead and parallelizing queries so that they can run on multiple nodes. This is a big deal for big data, because with Impala, querying Hadoop takes seconds rather than minutes. Impala's dialect is close to standard SQL, and Impala seamlessly accesses HBase and HDFS (Hadoop Distributed File System), allowing considerable freedom in choice of data formats.

Impala in Action is a hands-on guide to querying Hadoop using Impala. It starts by comparing Impala to traditional databases and database services on Hadoop. Then it explains Impala's SQL dialect and the basics of data access. Next, it tackles data visualization tasks and provides techniques for securing Impala with Apache Sentry. The book also shows how to embed Impala queries in a Java client and how to connect to JDBC and ODBC clients. Advanced readers will appreciate the deep dive into Impala's architecture and the practical insights into the issues complicated configurations and complex queries can cause.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.





    SHARE THIS
Embedicon
 
9781617291982
eBook

Shopping cart is not available

800-CEO-READ is currently closed, and not accepting new orders until Monday, August 19th. If you have questions about an existing order, please call 800-236-7323 or email customerservice@porchlight.com.

Price: $44.99/ea

1 $44.99 9781617291982 No volume discount available.

About the Paperback

Publisher Not available
Publish date 03/31/2015
Pages 250
ISBN-13 9781617291982
ISBN-10 1617291986
Language English

Categorized Under