【文件属性】:
文件名称:Search-Engine:云计算项目4
文件大小:20.85MB
文件格式:ZIP
更新时间:2021-05-21 15:49:57
Java
搜索引擎
云计算项目4
Built an input text predictor using MapReduce jobs to get n-grams from 6000 books;
Generated a statistical language model using probability of n-gram counts stored in Hbase;
Built a search-term ranking function using Term Frequency - Inverse Document Frequency (TF-IDF) to find the most relevant documents for a search-term using the Apache Spark framework
Built a document ranki