文件名称:tuning-hadoop-on-dell-poweredge-servers
文件大小:546KB
文件格式:PDF
更新时间:2021-03-16 03:31:37
hadoop
Hadoop is a Java-based distributed framework designed to work with applications implemented using MapReduce modeling. This distributed framework makes it possible to pass the load on to thousands of nodes across the whole Hadoop cluster. The nature of distributed framework also allows for node failure without cluster failure. The Hadoop market is predicted to grow at a compound annual growth rate over the next several years. Several good tools and guides describe how to deploy Hadoop clusters, but very little documentation tells how to increase performance on a Hadoop cluster once it is deployed. This white paper explains several BIOS, OS, file system, Hadoop and Java tunings that can increase performance of a Hadoop cluster. These performance tunings require changes to the BIOS,OS, Hadoop and JVM configuration files.