Perform interactive, real-time in-memory analytics on large amounts of data using the massive parallel processing engine Cloudera Impala. This title offers step-by-step guidance to get you started with Impala on your Hadoop cluster. It then shows readers how to manipulate data rapidly by writing proper SQL statements.