What is Apache Hadoop?


Please explain why do you think this question should be reported?

Report Cancel

Hadoop emerged as a solution to the “Big Data” problems. It is a part of the Apache project sponsored by the Apache Software Foundation (ASF). It is an open source software framework for distributed storage and distributed processing of large data sets. Open source means it is freely available and even we can change its source code as per our requirements. Apache Hadoop makes it possible to run applications on the system with thousands of commodity hardware nodes. It’s distributed file system has the provision of rapid data transfer rates among nodes. It also allows the system to continue operating in case of node failure. Apache Hadoop provides:

Storage layer – HDFS
Batch processing engine – MapReduce
Resource Management Layer – YARN

Please follow and like us:

About the Author