Jr Data Scientist
Location: Hyderabad
Job Category: Data Science
Primary Responsibilities include:
- Familiar in applying multiple approaches for project execution
- Adapt existing assets to Operations use cases
- Explore third-party and open source solutions for speed to execution and for specific use cases
- Engage in fundamental research to develop novel solutions
- Work with the Lead in building scalable, performance intensive NoSQL databases
- Develop routines using python, spark, NoSQL for exposing data to the product stack.
- Develop data mining techniques using state-of-the-art methods.
- Processing, cleansing and verifying integrity of data that is collected from transactional systems for analysis.
- Research, evaluate, and develop analytical and decision-making methodologies
- Select/create algorithms and software needed to perform analyses
- Review industry publications and relevant research to develop client-specific solutions and keep current on trends and new offerings in the Data Science space.
- Aggregate, analyze data and information and create models to create relevant and actionable insights
- Synthesize data, information, and knowledge to form conclusions and recommendations
- Test and validate data, information, and models against reality
- Research and implement machine learning, data mining and other statistical modeling techniques
- Perform ad-hoc analysis and present results in a clear and concise manner
Desired skills:
- Experience identifying, gathering, and analyzing large datasets
- Experience with Cognitive Services from – Microsoft, Google, IBM and/or Amazon
- Collaborate and create Machine Learning solutions in areas of optimization, simulations, Predictive analytics, and deep learning
- Experience with deep learning and cognitive service – speech, language, test, and knowledge
- MUST have scientific programming experience (Python, etc.); hands-on with Numpy and the Python scientific stack is a must
- Demonstrated ability to track and work Big data environment
- Exposure to CNN & LSTM Networks
- Familiarity with one of the popular NLP libraries like Stanford Core NLP, Apache OpenNLP or NLTK
- Strong in programming language like Java or Python
- Candidate has to have strong hands-on knowledge on SQL & noSQL databases
- Strong Exposure to or Knowledge on NoSQL databases, and distributed computing systems
- Excellent interpersonal skills; Must be self-motivated, detail-oriented, and willing to learn.
- Hands-on in Python, Matplotlib, SciKitlearn, Numpy, Pandas, Elasticsearch
- Strong in Maths/ Statistics
- Understanding of domain specific applications
Qualification & Experience: 3-5 Years as a Strong Application Developer for Software Products or Large Applications using Relational/NoSQL Databases
Behavioural Competences:
- Team player with a ‘can do’ attitude
- Ability to work in an interdisciplinary and multi-cultural environment
- High degree of flexibility, independent and proactive working style
- Ability to work well under pressure and on multiple and conflicting priorities
- Strong commitment to quality and timely customer service
About You:
- Excellence in academics with computer science, computational linguistics or information science (or equivalent) background
- Interest and exposure in the area of natural language processing and information extraction
- Sound analytical & conceptual skills to understand business needs and apply NLP/analytics solutions to solve specific business problems
- Familiarity with regular expressions
- Interested in learning new things
- To be proactive, flexible and determined, able to work under pressure and to match deadlines.
- Aptitude to learn, think creatively to solve real world business problems, and work in a global collaborative team environment
- Good verbal and written English skills.
Key Tasks Include:
- Using command line tools to perform data conversion and analysis
- Supporting other team members in retrieving and archiving experimental results
- Quickly writing scripts to automate routine analysis tasks
- Creating insightful, simple graphics to represent complex trends
- Involve in writing linguistic rules to support information extraction and automatic categorization
- Perform strong NLP tasks like part of speech tagging, named entity extraction and text classification and recognition
- Evaluating performance of NLP systems against metrics like precision, recall etc.
- Hands-on experience with visualization tools like Tableau, QlikView, Qlik Sense and others
- Collaborate with other Language Engineers and Data Specialists in data analysis and feature design efforts
- Work with Engineering teams to deploy your solutions into a production environment