Analytics: R Studio & Python
Statistical Analysis, Supervised & Unsupervised ML
model building, Clustering techniques, Data ensemble
methods. Applying Factor Analysis/PCA.
Packages : tidyverse, tidymodels,
dplyr,sklearn,pandas,numpy,saeaborn,XGBoost,Boosting
Visualizations : Tableau, Google Data
Studio, Infogram
Deriving Insights, Data Manipulations & Visualizations
and Building Interactive Dashboards & Story Boards.
Graphics Design : Canva
Ability to create attractive Social media posts and
editing.
BigData: Hadoop & Pyspark
Experienced in handling large datasets using pyspark
API. Competent in building and enhancing data pipe
lines using reusable frameworks to support data need
for financial forecast using pyspark.
Analytics: Google sheets
Scheduled data import from GBQ. Data transfer
within google sheets with filters.
Tools: VLOOKUP, IMPORTRANGE, FILTER,
ARRAYFORMULA.
Relational Database : Postgre SQL,
Google Big Query, MySQL
Importing & Storing data, Data Manipulation
(i.e: Filtering, Sorting, Pattern Matching), Writing
sub-queries and etc.
Cloud Architecture: Google Cloud
Platform
Creating a docker image of R-script and deploying it
into google cloud as a scheduled cron job.
Tools: GCP APIs, Docker.