文件名称:SparkLearning_NoteBook:Spark 学习notebook
文件大小:617KB
文件格式:ZIP
更新时间:2024-06-06 17:19:27
JupyterNotebook
SparkLearning_NoteBook README 一、目录 [TOC] 二、项目描述 基于Spark的学习实践笔记,内附jupyter notebook实践,可以根据里面的一步步操作学习Spark RDD的基本API操作、Spark MLlib 相关操作和Spark实践Demo等。 本项目配有完整依赖环境的实战Docker镜像,具体Docker Hub路径为:https://hub.docker.com/r/jeanheodh/pyspark_env/ 。环境配置步骤如下: 后台运行镜像docker run -d -p 23333:23333 --name notebook -w /root/notebook jeanheodh/pyspark_env jupyter notebook --ip=0.0.0.0 --allow-root --port=23333 运行后可通过容器
【文件预览】:
SparkLearning_NoteBook-master
----completed_notebook()
--------L06_SparkALSCollaborativeFiltering.ipynb(15KB)
--------.ipynb_checkpoints()
--------L03_SparkMLLibPractice.ipynb(9KB)
--------L04_UserBaseCollaborativeFiltering.ipynb(17KB)
--------L02_SparkMapReducePractice.ipynb(23KB)
--------L05_ItemBaseCollaborativeFiltering.ipynb(61KB)
--------L01_SparkRDDAPIPractice.ipynb(127KB)
----README.md(6KB)
----.ipynb_checkpoints()
--------ALS_CF-checkpoint.ipynb(15KB)
--------ItemBaseCollaborativeFiltering-checkpoint.ipynb(19KB)
--------L01_Spark_RDD_API-checkpoint.ipynb(125KB)
--------UserBaseCollaborativeFiltering-checkpoint.ipynb(17KB)
--------L03_Spark_MLLib_Practice-checkpoint.ipynb(9KB)
--------L02SparkMapReducePractice-checkpoint.ipynb(23KB)
----pysrc()
--------ItemBaseCF.py(15KB)
--------ALSCF.py(10KB)
--------UserBaseCF.py(10KB)
--------WordCount.py(2KB)
--------InvertedIndex.py(3KB)
--------Spark_RDD_API.py(7KB)
----images()
--------ItemBase_FlowChart.png(61KB)
--------UserBase_FlowChart.png(49KB)
--------ALS_FlowChart.png(46KB)
----data()
--------ratings.dat(237KB)
--------invertedIndex()
--------movies.dat(520KB)
--------wordcount.txt(4KB)
----uncompleted_notebook()
--------L06_SparkALSCollaborativeFiltering.ipynb(15KB)
--------.ipynb_checkpoints()
--------L03_SparkMLLibPractice.ipynb(9KB)
--------L04_UserBaseCollaborativeFiltering.ipynb(15KB)
--------L02_SparkMapReducePractice.ipynb(13KB)
--------L05_ItemBaseCollaborativeFiltering.ipynb(15KB)
--------L01_SparkRDDAPIPractice.ipynb(115KB)
----requirement.txt(909B)