Bachelors (IT/Computer Science Preferred); Master's Degree preferred (IT/Computer Science Preferred) or equivalent experienceJob Description :- 12+ years of industry experience in analyzing source system data and data flows, working with structured and unstructured data, and delivering data and solution architecture designs.- Experience with clustered/distributed computing systems, such as Hadoop/MapReduce, Spark/SparkR, Lucene/ElasticSearch, Storm, Cassandra, Graph Databases, Analytics Notebooks like Jupyter, Zeppelin etc-
experience building data pipelines for structured/unstructured, real-time/batch, events/synchronous/asynchronous using MQ, Kafka, Steam processing.-
5+ years of experience as a data engineer/solution architect designing and delivering large scale distributed software systems, preferably in large scale global business- preferably using open source tools and big data technologies such as Cassandra, Hadoop, Hive, Prestodb, Impala, HBase, Spark, Storm, Redis, Drill etc- Strong hands-on experience of programming with Java, Python, Scala etc-
experience with SQL, NoSQL, relational database design, and methods for efficiently retrieving data for Time Series Analytics.-
knowledge of Data Warehousing best practices; modeling techniques and processes and complex data integration pipelines-
experience gathering and processing raw data at scale (including writing scripts, web scraping, calling APIs, write SQL queries, etc.)-
excellent technical skills -
able to effectively lead a technical team of business intelligence and big data developers as well as data analysts.-
Optional Skills:-
experience in Machine Learning, Deep Learning, Data Science-
experience with Cloud architecture & service like AWS, Azure-
experience with Graph, Sematic Web, RDF Technologies-
experience with Text Analytics using SOLR or ElasticSearch