文件名称:BigDataAnalysis_Exp4:实时大数据分析_文本相似-Shingling、Minhash算法
文件大小:147KB
文件格式:ZIP
更新时间:2024-06-05 03:25:26
Java
实时大数据分析实验四——文本相似-Shingling、Minhash算法 一、实验内容 采用Shinling及Minhash技术分析以下两段文本的Jaccard相似度: (1) The TOEFL test is an English language assessment that is often required for admission by English-speaking universities and programs around the world. In addition to being accepted at more than 10,000 institutions in over 130 countries, including Australia, Canada, and the US, TOEFL scores help you get noticed
【文件预览】:
BigDataAnalysis_Exp4-master
----.project(379B)
----result03.png(20KB)
----src()
--------com()
----Flowsheet04.png(66KB)
----result02.png(20KB)
----.settings()
--------org.eclipse.jdt.core.prefs(587B)
----eclipse04.png(7KB)
----README.md(11KB)
----.classpath(295B)
----result01.png(20KB)
----bin()
--------com()