Project Description
Client is establishing a centralized team to develop digital transformation products and services. This role would be part of associated projects contributing towards development of such products or services
Responsibilities
1. Data Scientists to support the analysis of the data and the preparation of additional big data and data
analysis technologies like data modelling, wrangling, pipelines, implementation or building machine learning models
Review and IT testing of methodology to match clients via name matching
Mandatory experience in one or more of following-
Data modelling, wrangling, pipelines implementation or building machine learning models
Machine learning/Natural Language processing related projects
Familiarity with assembling and analyzing data sets from disparate sources, applying quantitative methodologies, computational frameworks and systems
Identifying the best methodologies in analytics/Machine Learning for the specific problem to solve
Developing reliable, autonomous and scalable data pipelines
Skills
Must have
PySpark, Data Science, Bash-scripting, Apache Spark, R Programming Language, SQL, Python, Banking, Hadoop, HDFS
Nice to have
Agile Methodology, Java, NoSQL, Scala, DevOps, Big Data
Languages
English: A1 Beginner