Deep Dive into Spark SQL with Advanced Performance Tuning

时间:2021-07-01 03:33:02
【文件属性】:

文件名称:Deep Dive into Spark SQL with Advanced Performance Tuning

文件大小:4.43MB

文件格式:PDF

更新时间:2021-07-01 03:33:02

spark SQL apache spark

Spark SQL is a highly scalable and efficient relational processing engine with ease-to-use APIs and mid-query fault tolerance. It is a core module of Apache Spark. Spark SQL can process, integrate and analyze the data from diverse data sources (e.g., Hive, Cassandra, Kafka and Oracle) and file formats (e.g., Parquet, ORC, CSV, and JSON). This talk will dive into the technical details of SparkSQL spanning the entire lifecycle of a query execution. The audience will get a deeper understanding of Spark SQL and understand how to tune Spark SQL performance.


网友评论