Data Scientist

/
/
Data Scientist
ID : 1720
Education level  : Bachelor's degree
Work experience level  : Expert- (more than year 7)
Work experience in total  : Years
Job type  : Online
Job time  : Maandelijks
Last date of registration :
2022-12-19
Profile description

Data Scientist with a Business Analytics degree from Indian School of Business (ISB), Hyderabad with 10+ years of overall and 5+ years of relevant experience in Machine Learning.

Work experience In details :
Job position
Job description

Data Scientist - Cognizant, Bengaluru - (May 2017 - Present)

  • Loan Matching(JPMC) - LOB – Trade Matching(Commercial Business)
    This solution helps the internal users to match the Receivables & Wires. With this solution the effectiveness of the user’s increases by approx. 90% and the matching which used to take months and getting stacked, now gets done in the same day & there are no pending matching tasks. Using Lavenstein Distance the string distance is calculated among relavant fields of Receivables & Wires data. Also dollar difference of inflow & outflow amount is checked. And finally a classification model is built to give the probability scores, which says the probability match between the given wire & receivable. Here two separate models are built for agency & trading accounts.

    • Tools - Python, sklearn, AWS Sagemaker
  • Compliance Monitoring (JPMC) LOB – Asset & Wealth Management (AWM)
    • Sampling Service for Performance Outlier review
      This solution addresses which group of underperforming accounts/mandates should be investigated by the compliance monitoring officer.
      For dimensionality reduction, predictors are auto-encoded using fully connected neural networks and the dimensionality is reduced. Optimum number of clusters are determined through Sillhouette and clustering is done using DBScan. The results goes through a percentile ranking where a 50 percentile cut off is chosen for sampling. The percentile is done on Composite, Benchmark Z score & Variance. Accounts sampled out are submitted to the compliance monitoring officer for review.
      Local Explainability of individual clusters is implemented using Single Feature Introduction test, which has been implemented from scratch based on a white paper (https://arxiv.org/pdf/1905.09849.pdf) (https://arxiv.org/pdf/1909.13381.pdf)
    • Supervised model to predict underperforming accounts for Performance Outlier review
      The data generated from the above mentioned Sampling Service is used to build a binary classification model – Random Forest, which can predict whether the mandate/account should be investigated by compliance monitoring officer.
  • Prior Authorization( Healthcare )
    This project involves automation of the manual tasks conducted by insurance companies & medical staff. Before conducting any medical procedures / examinations / operation’s prior authorization from provider i.e. insurance company is needed to get the payments.

    • NLP techniques used :- NER, Bert
    • Tools – AWS Textract(OCR), AWS ecosystem (sagemaker, lambda, S3, api gateway etc.)
  • BIO Age for Vitality Determining Biological age from the output of Bioelectrical Impedance Analysis (BIA) devices. When a person stands on the BIA device, outputs are fed into our model (GBM) after data manipulation and the user is given a Biological Age based on his body characteristics
    • Tools - Python, sklearn, AWS Lambda, Sagemaker
  • Loss Control services using Computer Vision(Deep Learning – CNN) This solution hastens the premium generation process by eliminating Risk agent, with premium determination by the underwriter being faster. Solutions built include counting the people wearing hard hats out of those working in warehouse, determining if forklift is driven distractedly, identifying clutters near exit doors & determining if fire extinguisher is out of service.
  • Performed end to end implementation of the models starting from collecting data to creating Endpoints (deployment) in AWS platform.
    • Possess good knowledge of Convolutional Neural Networks. Dealt with image classification using Deep Neural Networks and Rekognition API of AWS. Models were created from scratch as well as Transfer learning was used for classification.
    • Architectures Used:- VGG16, Framework:- Keras Tools:- AWS services(Sagemaker, Lambda, S3, API Gateway)
  • Call Health (Health Care Domain)
    Call Health Services Private Limited, takes health services to door steps of customers in India. It provides health care services at home as well as diagnostic centers, while providing nursing facilities at home. Performed secondary research and collecting data of the various states of India to identify Global cohorts and descriptive analysis. Using CLINKS data of CH, identified the CH cohorts and identified the key outputs and related them with the Global Cohorts and recommending what further can be done with Call Health Business to improve presence and launching extra services in places having Business potential. Built a product/service level recommendation engine to recommend customers other products. Found bags of correlated Diseases & Business using Market Basket Analysis.
  • Private Edge(AIG) Project for an Insurance client, involving calculation of IRPM factor to be used to calculate insurance premium using customer inputs

Module Lead - Mindtree Limited, Bhubaneswar - (February 2015 – March 2017)

Assurant - Policy Admin

  • It was a P&C commercial insurance product offering its services to Farmers of USA.
  • The project had a unique architecture, where implementation is a combination of Multiple LOB’s.
  • The uniqueness was that it had the capability of having Monoline as well as Package policy.
  • Worked on the Policy Admin – implementing coverage’s, Premium calculation algorithm
  • Technology and tools: Duck Creek 4.2.1,TSV Debugger,Trace Monitor,XML Altova spy,XML,SQL Server2008

Senior Software Engineer - Discoverture, Bhubaneswar (Acquired by MindTree) - (August 2012 – February 2015)

  • P&C commercial insurance product offering services to Farmers of USA.
  • Concept of Master and Child policies where certificates were issued in the form of child policies.
  • Worked on Batch requirements of forms, automated processes (Renewal, Non-renewal, Cancellation)
  • Worked on Migration of policies from Mainframe system to DCT
Hard skills
  • Proficiency in Statistics, Linear & Logistic Regression, Cross Validation, Ridge & Lasso Regression, Decision Tree, Ensemble methods (Random Forest, Gradient Boosting etc), Principal Components Regression, Principal Component Analysis (PCA), Clustering (Kmeans, DBScan), NER
  • Experience of working in the complete Analytical and Software development life cycle
  • Knowledge of AWS ecosystem (S3, Lambda, Sagemaker, Rekognition, API gateway, Textract) - Deployed Deep Learning & SCIKIT models in AWS and writing lambda functions to make API calls.
  • Worked on Image Classification CNN models using VGG16
  • Worked on Named Entity Recognition (NER) Models
  • Writing Unit test cases using pytest & unittest for production deployment
  • Experience with getting insights using Tableau.
  • Expertise with Machine Learning using R & Python.
Soft skills
Achievements
  • Indian Statistical Institute (ISI), Bangalore (Machine Learning) - 2016/BA03/05
  • Big Data Certification from Edureka – C89E6E3G
  • Introduction to Probability and Data from Coursera - TPZHJJAE73KV
  • Awarded Mastery Level Certificate from Duck Creek University
Special notes

Data Scientist with proficiency in Statistics, Linear & Logistic Regression, Cross Validation, Ridge & Lasso Regression, Decision Tree, Ensemble methods, Principal Components Regression, PCA, Clustering, NER, S3, Lambda, Sagemaker, Rekognition, API gateway, Textract, Tableau, R & Python.

Meer person

ID : 2042
Associate Software Engineer
Education level: Bachelor's degree
Work experience level: Intermediate- (2-4 year experience)
ID : 2041
System Engineer IT
Education level: Bachelor's degree
Work experience level: Experienced- (4-7 year experience)
ID : 2040
Associate Engineer
Education level: Bachelor's degree
Work experience level: Associate- (1-2 year experience)
ID : 2039
Intern
Education level: Bachelor's degree
Work experience level: Beginner- (internship- 1 year experience)
Mis geen enkele belangrijke kennisgeving houd jezelf update
Begin met chatten!
Wij staan u graag te woord!
Hallo 👋
Kunnen we je helpen?