Hadoop Administration
Introduction to Big Data and Hadoop
- What is Big Data?
- What are the challenges for processing big data?
- What technologies support big data?
- Distributed systems
- What is Hadoop?
- Why Hadoop?
- History of Hadoop
- Use Cases of Hadoop
- Hadoop eco System
- HDFS
- Map Reduce
- Statistics
Understanding the Cluster
- Typical workflow
- Writing files to HDFS
- Reading files from HDFS
- Rack Awareness
- 5 daemons
Best Practices for Cluster Setup
- Best Practices
- How to choose the right hadoop distribution
- How to choose right hardware
Cluster Setup
- Install Pseudo cluster
- Install Multi node cluster
- Configuration
- Setup cluster on Cloud – EC2
- Tools
- Security
- Benchmarking the cluster
Routine Admin procedures
- Metadata & Data Backups
- Filesystem check (fsck)
- File system Balancer
- Commissioning and decommissioning nodes
- Upgrading
- Using DFSAdmin
Monitoring the Cluster
- Using the Web user interfaces
- Hadoop Log files
- Setting the log levels
- Monitoring with Nagios
Install ,Configure and use
- PIG
- HIVE
- HBASE
- Flume and Sqoop
- zookeeper
Subscribe to:
Post Comments
(
Atom
)
Check it once for an information on Hadoop admin Online Training Hyderabad
ReplyDelete