文件名称:Data Analytics with Spark Using Python
文件大小:9.5MB
文件格式:EPUB
更新时间:2021-07-08 15:23:55
EPUB
Addison-Wesley Data & Analytics Series Solve Data Analytics Problems with Spark, PySpark, and Related Open Source Tools Coverage includes: • Understand Spark’s evolving role in the Big Data and Hadoop ecosystems • Create Spark clusters using various deployment modes • Control and optimize the operation of Spark clusters and applications • Master Spark Core RDD API programming techniques • Extend, accelerate, and optimize Spark routines with advanced API platform constructs, including shared variables, RDD storage, and partitioning • Efficiently integrate Spark with both SQL and nonrelational data stores • Perform stream processing and messaging with Spark Streaming and Apache Kafka • Implement predictive modeling with SparkR and Spark MLlib