Experience in cleansing and transforming data on Cloudera Hadoop/Spark, SQL based databases, Impala, Pig, Hive, ELT/ETL, Real-time processing and Hadoop Ecosystem.
Experience in Oozie, Talend/Pentaho Job Scheduler, Crontab Scheduler.
Should have set up Cloudera Hadoop architecture for at least 1-2 projects for large scale data processing.
Experience on Cloudera Analytics DB as a Cloudera Hadoop architect
Experience in setting layered Hadoop architecture - staging, aggregation etc
Experience in delivering end to end Hadoop project
Experience on SQL required
Experience on Java required
Collaborative team player
Roles And Responsibilities
Developing new Hadoop base ELT/ETL models
Develop visualization solution on top of Hadoop Results
Implement a data pipeline process for consume data in secure fashion
Deploy Hadoop ELT/ETL in production environment
Monitor Hadoop environment, ELT and ETL
Handle Customer projects on Cloudera Hadoop
Performance tuning of Hadoop ELTs, queries
Schedule advanced workflows for Hadoop Job
Performance tuning of large scale Hadoop architecture