big_data:大数据实验室的任务

时间:2024-03-12 08:42:54
【文件属性】:

文件名称:big_data:大数据实验室的任务

文件大小:149KB

文件格式:ZIP

更新时间:2024-03-12 08:42:54

JupyterNotebook

大数据 大数据实验室的任务 Stanford网络图( )用于实验5中的某些任务。 实验1: Task: - Building wordcloud from words in a book 实验2: Task 1: - divide book into chapters (treat each chapter as separate document) - TF-IDF for all words in all chapters - wordclouds for each chapter Task 2: - words with highest TF-IDF - for given word list most matching chapters (according to TF-IDF) 实验3: Task 1: - Jaccard distance


【文件预览】:
big_data-master
----lab5()
--------BDA_l5.ipynb(17KB)
----lab4()
--------BDA_l4.ipynb(16KB)
----lab3()
--------.gitignore(47B)
--------shingling.ipynb(7KB)
----README.md(1KB)
----lab2()
--------count_words_pl.ipynb(7KB)
--------.gitignore(37B)
----lab1()
--------count_words_pl.ipynb(98KB)
--------.gitignore(91B)
--------count_words.ipynb(126KB)

网友评论