Tutorial by Examples

A Pseudo Distributed Cluster Setup Procedure Prerequisites Install JDK1.7 and set JAVA_HOME environment variable. Create a new user as "hadoop". useradd hadoop Setup password-less SSH login to its own account su - hadoop ssh-keygen << Press ENTER for all prompt...
Creating Hadoop User: sudo addgroup hadoop Adding a user: sudo adduser --ingroup hadoop hduser001 Configuring SSH: su -hduser001 ssh-keygen -t rsa -P "" cat .ssh/id rsa.pub >> .ssh/authorized_keys Note: If you get errors [bash: .ssh/authorized_keys: No such file or di...
Hadoop is an open-source software framework for storage and large-scale processing of data-sets in a distributed computing environment. It is sponsored by Apache Software Foundation. It is designed to scale up from single servers to thousands of machines, each offering local comput...

Page 1 of 1