Introduction to Zookeeper
Apache Zookeeper is a centralized service and a Hadoop Ecosystem component for maintaining configuration information, naming, providing distributed synchronization, and providing group services.
ZooKeeper is a distributed co-ordination service to manage large set of hosts. Co-ordinating and
managing a service in a distributed environment is a complicated process. ZooKeeper solves this issue with its simple architecture and API.
Zookeeper is a unit where the information regarding configuration, naming and group services are stored which are responsible for synchronization of Hadoop tasks.
The ZooKeeper framework was originally built at “Yahoo!” for accessing their applications in an easy and robust manner. Later, Apache ZooKeeper became a standard for organized service used by Hadoop, HBase, and other distributed frameworks.
zookeeper 3 important services are used widely in hadoop.
zookeeper failover controller: use in name node ha.
zookeeper statestore : use in resource manager ha.
zookeeper dynamic service discovery : use in hive server2 ha.
No comments:
Post a Comment