Tag: Hadoop


Hadoop is an Apache open source framework written in java that allows distributed processing of large datasets across clusters of computers using simple programming models. The Hadoop framework application works in an environment that provides distributed storage and computation across clusters Read more…


Hadoop is an Apache open source framework written in java that allows distributed processing of large datasets across clusters of computers using simple programming models. The Hadoop framework application works in an environment that provides distributed storage and computation across clusters Read more…


Hadoop is an Apache open source framework written in java that allows distributed processing of large datasets across clusters of computers using simple programming models. The Hadoop framework application works in an environment that provides distributed storage and computation across clusters Read more…


This post explains how to setup Yarn master on hadoop 3.1 cluster and run a map reduce program. Before you proceed this document, please make sure you have Hadoop3.1 cluster up and running. if you do not have a setup, Read more…


This documents explains step by step Apache Hadoop installation version (hadoop 3.1.1) with master node (namenode) and 3 worker nodes (datanodes) cluster on Ubuntu. Below are the 4 nodes and it’s IP addresses I will be referring here. 192.168.1.100    Read more…