Hadoop Training (1) - PowerPoint PPT Presentation

About This Presentation
Title:

Hadoop Training (1)

Description:

Hadoop Training By OrienIT. The Best Institute For Hadoop And Oracle Courses, We Are The Best Trainers And We Are Providing The Unique Course Of Hadoop Training In Hyderabad. Our Faculties Are Experienced And Experts. – PowerPoint PPT presentation

Number of Views:29
Slides: 14
Provided by: martinvictorjames
Category:

less

Transcript and Presenter's Notes

Title: Hadoop Training (1)


1
Hadoop Training In Hyderabad
By
Address
Flat No 204, Annapurna Block,Aditya Enclave,
Ameerpet, Hyderabad - 500038. Phone 040
65142345Mobile 970 320 2345email
info_at_orienit.com
http//www.orienit.com
2
About Hadoop
  • Hadoop
  • Hadoop Cluster 
  • A Real Time Example
  • Core Components of Hadoop Cluster
  • Slaves

http//www.orienit.com
3
About Us
  • ORIEN IT has been established with the primary
    objective of offering superior IT training
    services and support for different business
    organizations. 
  • ORIEN IT provide integrated IT training services
    and the complete range of IT consulting support
    to cater all the requirements of both individual
    learners and corporate clients. 

http//www.orienit.com
4
Hadoop
  • Hadoop is an open source framework, that supports
    the processing of large data sets in a
    distributed computing environment. Hadoop
    consists of MapReduce, the Hadoop distributed
    file system (HDFS) and a number of related
    projects such as Apache Hive, HBase and
    Zookeeper. MapReduce and Hadoop distributed file
    system (HDFS) are the main component of Hadoop. 

http//www.orienit.com
5
Hadoop Cluster
  • Normally any set of loosely connected or tightly
    connected computers that work together as a
    single system is called Cluster. In simple words,
    a computer cluster used for Hadoop is called
    Hadoop Cluster. Hadoop cluster is a special
    type of computational cluster designed for
    storing and analyzing vast amount of unstructured
    data in a distributed computing environment.
    These clusters run on low cost commodity
    computers. 

http//www.orienit.com
6
Hadoop Cluster
Hadoop clusters are often referred to as "shared
nothing" systems because the only thing that is
shared between nodes is the network that connects
them. Large Hadoop Clusters are arranged in
several racks. Network traffic between different
nodes in the same rack is much more desirable
than network traffic across the racks. 
http//www.orienit.com
7
A Real Time Example
Here is a picture of Yahoo's Hadoop cluster. They
have more than 10,000 machines running Hadoop and
nearly 1 petabyte of user data. 
http//www.orienit.com
8
Core Components of Hadoop Cluster
Hadoop cluster has 3 components Client Master Sla
ve The role of each components are shown in the
below image. 
http//www.orienit.com
9
Client
  • It is neither master nor slave, rather play a
    role of loading the data into cluster, submit
    MapReduce jobs describing how the data should be
    processed and then retrieve the data to see the
    response after job completion. 

http//www.orienit.com
10
Masters
The Masters consists of 3 components NameNode,
Secondary Node name and JobTracker.
http//www.orienit.com
11
Masters consists of 3 components
  • NameNodeNameNode does NOT store the files but
    only the file's metadata. In later section we
    will see it is actually the DataNode which stores
    the files. NameNode oversees the health of
    DataNode and coordinates access to the data
    stored in DataNode. Name node keeps track of all
    the file system related information such as to
  • Which section of file is saved in which part of
    the cluster
  • Last access time for the files
  • User permissions like which user have access to
    the file
  • JobTrackerJobTracker coordinates the parallel
    processing of data using MapReduce. 
  • Secondary Name NodeDon't get confused with the
    name "Secondary". Secondary Node is NOT the
    backup or high availability node for Name node. 

http//www.orienit.com
12
Slaves
  • Slave nodes are the majority of machines in
    Hadoop Cluster and are responsible to
  • Store the data
  • Process the computation

http//www.orienit.com
13
Best Hadoop Training In Hyderabad
http//www.orienit.com
Write a Comment
User Comments (0)
About PowerShow.com