CCHD - Cloudera Certified Hadoop Developer - CDH5
I would like to share some tips on preparing for the cloudera certification. Mostly importantly cloudera exam is now on CDH Version 5. Please note that you should have hands-on experience on CDH5 and also have good exposure to MapReduce programming/design patterns else consider yourself half prepared. 30-40% Questions in the exam were scenario based and Map Reduce programming and Sqool, Hive or File system commands options.
I suggest to download and install the vm on your machine and practice
http://www.cloudera.com/content/cloudera/en/downloads/quickstart_vms/cdh-5-4-x.html
If you are very serious about the Hadoop certification, I highly recommend the Cloudera Developer Training for Apache Hadoop
Material referred during the preparation
- Hadoop Definitive Guide(4th Edition) by Tom White
- MapReduce Design Pattern by Donald Miner
- Good understanding of Sqoop1.4, Sqoop2 and Hive is required for passing the exam. Overview of Pig, Oozie, Flume, Avro,Crunch is required as you may see few questions in the exam.
- An overview of an Eco System can be referred via link below http://hadoopecosystemtable.github.io/
- https://developer.yahoo.com/hadoop/tutorial/
- Understand HDFS commands with various options.
- http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/FileSystemShell.html
- Deep understanding of classic map reduce and YARN Architecture, replicate and re-partition Joins and Sorting etc.
- You should be good at core java, IO or NIO and regular expressions
- knowledge of available input/output format and you should be able to create your own input/output formats
- Custom Writable,WritableComaparable,RawComparator
- Should be comfortable in writing map reduce job given a complex SQL queries(HiveQL)
Here is my certificate:
No comments:
Post a Comment