Learning Apache Mahout(PACKT,2015)

时间:2018-06-18 19:10:05
【文件属性】:

文件名称:Learning Apache Mahout(PACKT,2015)

文件大小:13.77MB

文件格式:PDF

更新时间:2018-06-18 19:10:05

Mahout hadoop analytics

In the past few years the generation of data and our capability to store and process it has grown exponentially. There is a need for scalable analytics frameworks and people with the right skills to get the information needed from this Big Data. Apache Mahout is one of the first and most prominent Big Data machine learning platforms. It implements machine learning algorithms on top of distributed processing platforms such as Hadoop and Spark. Starting with the basics of Mahout and machine learning, you will explore prominent algorithms and their implementation in Mahout development. You will learn about Mahout building blocks, addressing feature extraction, reduction and the curse of dimensionality, delving into classification use cases with the random forest and Naïve Bayes classifier and item and user-based recommendation. You will then work with clustering Mahout using the K-means algorithm and implement Mahout without MapReduce. Finish with a flourish by exploring end-to-end use cases on customer analytics and test analytics to get a real-life practical know-how of analytics projects.


网友评论

  • 比较广泛,还需要多点深度,感谢分享!
  • 初步了解。。。感谢分享
  • 好书。 有mahout最新的信息。 对我的project很有帮助。