文件名称:A Brief Introduction to PySpark
文件大小:1003KB
文件格式:PDF
更新时间:2022-05-04 14:14:48
pyspar
关于 PySpark 的简介,适合新手入门学习。PySpark is a great language for performing exploratory data analysis at scale, building machine learning pipelines, and creating ETLs for a data platform. If you’re already familiar with Python and libraries such as Pandas, then PySpark is a great language to learn in order to create more scalable analyses and pipelines. The goal of this post is to show how to get up and running with PySpark and to perform common tasks.