文件名称:BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark:该存储库包含代码文件,这些代码文件特别是UC Berkeley和Databricks在edX上针对“用Apache Spark引入大数据”课程中的作业分配的IPython笔记本。
文件大小:26.98MB
文件格式:ZIP
更新时间:2024-06-12 10:47:52
BerkeleyX-CS100.1x-具有Apache-Spark的大数据 该存储库包含代码文件,这些代码文件特别是UC Berkeley和Databricks在edX上针对“用Apache Spark引入大数据”课程中的作业分配的IPython笔记本。
【文件预览】:
BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark-master
----Reference Material()
--------visualapi.pdf(3.61MB)
----.gitignore(702B)
----README.md(231B)
----Week 5 - Introduction to Machine Learning with Apache Spark()
--------Lab 4 Quiz Questions.pdf(400KB)
--------lab4_machine_learning_student.ipynb(70KB)
----LICENSE(1KB)
----Week 2 - Introduction to Apache Spark()
--------lab1_word_count_student.ipynb(32KB)
--------Week2Lec4.pdf(597KB)
--------Week2Lec3.pdf(7.14MB)
----Week 1 - Data Science Background and Course Software Setup()
--------Week1Lec1.pdf(7.88MB)
--------lab0_student.ipynb(61KB)
--------Week1Lec2.pdf(4.6MB)
----Week 3 - Data Management()
--------Week3Lec5.pdf(1.12MB)
--------lab2_apache_log_student.ipynb(257KB)
--------Week3Lec6.pdf(1.16MB)
----Week 4 - Data Quality, Exploratory Data Analysis, and Machine Learning()
--------lab3_text_analysis_and_entity_resolution_student.ipynb(152KB)
--------Week4Lec7.pdf(1.26MB)
--------Week4Lec8.pdf(1.37MB)
--------Lab 3 Quiz Questions.pdf(801KB)