You will be someone who loves and can analyze raw data and exploit algorithms to bring out crucial insights to tackle complex problems.-
You will be responsible for developing data pipeline, creating stunning reports and visualizations and develop real-world machine learning appliances. You will be taking the bottom-line of managing a range of database (SQL, NoSQL) platforms in the cloud and scaling to the needs of our internal applications and SaaS platforms.-
Conducting data analysis and report generation.-
Doing a thorough requirement analysis-
Entire project coordinationSkill-set:-
Data pipeline development - batch, streaming and distributed processing fundamentals with on-job experience. Working knowledge of ETL, ELT tools, map-reduce, spark.-
Information models design and development - analyze, design and build relational, dimensional and document models.-
Distributed databases - Experience with Hadoop, MongoDB, Full-text search engine configuration and management-
Machine learning - Python, SciKit, Pandas-
Hands-on experience in AWS.-
Experience in performance tuning of complex ETL mappings for relational and non-relational workloads-
Hands-on experience in AWS, big data tools (Spark, Kafka) preferably data-heavy / analytics applications leveraging relational and No SQL databases, data warehouse, and big data.-
Data Mining, Data Modeling, and Data Provisioning (acquisition from various sources, transformation, and sharing)-
Experience with Data streaming paradigms.-
Working knowledge in one NoSQL database like MongoDB/Cassandra/HBase/Couchbase.-
Demonstrated ability in solutions covering data ingestion, data cleansing, ETL, data mart creation and exposing data for consumers.-
Big data tools: Hadoop, Spark, Presto, Kafka, etc.-
Applying Job & Updating your profile. Please wait…
Update/Review profile information
Please review & update following critical information(s). Update & apply to become a matching Job applicant! Without this information, your profile may not get shortlisted.