During the Masters project at the University of Oslo we investigated how cluster resources can be optimally and appropriately utilized during MapReduce operation in order to result in a better job parallelism and throughput. To achieve this, an optimal design which we call Adaptive Parameter Tuning of Hadoop (APTH) is developed that can dynamically change the resources allocation by changing the system level parameters at the run-time. We developed two algorithms namely progress aware algorithm and current accumulated progress algorithm for our APTH approach. Our comprehensive results showed that the resources are optimally and appropriately utilized using our APTH approach during job execution which resulted in better job parallelism and throughput. This was the script that was deploy in bash to make work out.
Ramsum123/Bash-Script
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|