Virtual Hadoop: The Study and Implementation of Hadoop in Virtual Environment using CloudStack KVM
Arun S Devadiga,  Shalini P.R,  Aditya Kumar Sinha
The paper focuses on using Hadoop tool in virtual environment using CloudStack KVM for solving big data related problems. Hadoop is an apache tool which is used to process a huge amount of data concurrently. Since, Hadoop is an open source application; it has been used throughout the industry. Using Hadoop in virtual environment provides a way for parallel computing, and helps in deployment and management of applications for distributed computing. MapReduce component of Hadoop is used here for large-scale parallel applications and via virtualization we can improve the existing computing resources, which is essential in cloud computing field. By deploying virtual machine management of Hadoop we can have effective management of resource for large number of node in terms of configuration, deployment and resource utilization. Currently, there are many open source solutions for building cloud environment. One among them is CloudStack, which is an open source cloud platform that allows building all kind of cloud environment including private, public and hybrid cloud. KVM virtual machine provides the virtual environment. Hence, this article explains the work involved in integrating the Hadoop, CloudStack and KVM. This integration will result in virtual Hadoop which will allow user to process huge amount of data concurrently in virtual environment, with efficient use of resources.
Keywords- KVM, Virtualization, Hadoop, MapReduce, Distributed Environment, CloudStack.
Cite this Article
Arun S Devadiga,  Shalini P.R,  Aditya Kumar Sinha,   "Virtual Hadoop: The Study and Implementation of Hadoop in Virtual Environment using CloudStack KVM"
, International Journal of Engineering Development and Research (IJEDR), ISSN:2321-9939, Volume.2, Issue 2, pp.1899-1906, June 2014, Available at :http://www.ijedr.org/papers/IJEDR1402099.pdf