Open to Data Scientist & Analytics Roles

Mukesh
Kumar

Data Scientist & Data Analyst · Machine Learning · Business Intelligence

I transform complex datasets into actionable business insights — building ML models, ETL pipelines, interactive dashboards, and full-stack data applications across retail, hospitality, and finance domains.

4+
End-to-end Projects
4
Professional Certifications
40%
Query Time Reduction
$2.7M
Inventory Insight Found
About

Who I Am

I'm a Data Scientist & Data Analyst with hands-on experience delivering end-to-end analytical and machine learning solutions across multiple domains. My work spans building optimized SQL pipelines, conducting EDA and hypothesis testing in Python, training ML models, and creating interactive Power BI dashboards that surface real business value.

Currently pursuing an M.E. in Georesource & Geoenergy at Politecnico di Torino, I bring a research-oriented mindset to every data problem — combining technical rigour with clear, stakeholder-friendly communication.

I'm actively seeking Data Scientist and Data Analytics roles where I can drive measurable impact through machine learning and data-driven decision making.

Location
Turin, Italy 🇮🇹
Role
Data Scientist & Analyst
Education
M.E. Georesource & Geoenergy
University
Politecnico di Torino
Languages
English, Urdu, Sindhi
Domain Focus
Retail, Hospitality, Finance
Technical Skills

What I Work With

A full-stack data toolkit spanning analysis, visualisation, and engineering.

🐍
Languages & Libraries
PythonSQLPandasNumPyScikit-learnMatplotlibSeabornPlotlyBokehSciPy
📊
Analytics & Statistics
EDAHypothesis TestingRegressionClassificationClusteringK-Means / DBSCANk-NNData Wrangling
🛠️
Tools & Platforms
Power BIFastAPIStreamlitMySQLJupyterGitHubDashExcel
🔄
Data Engineering
ETL PipelinesData ModelingCTEsKPI ReportingRelational DBsDashboard DevGeospatial Analysis
Projects

Work That Speaks

End-to-end data projects — from raw SQL to interactive dashboards and ML pipelines.

🏪
Vendor Performance Analysis — Retail Inventory & Sales
Vendor Performance Dashboard
  • Developed and optimised a complex SQL ETL pipeline to build an aggregated summary table from multiple tables; improved query performance using CTEs, significantly reducing processing time for large datasets.
  • Conducted EDA and hypothesis testing in Python to evaluate vendor profitability, pricing strategy effectiveness, and inventory turnover.
  • Identified over-dependence on top 10 vendors (65.7% of purchases) and uncovered $2.71M in unsold inventory from low-performing vendors, recommending diversification and inventory optimisation.
  • Built an interactive Power BI dashboard visualising vendor performance, profit margins, and bulk purchasing impact (72% cost reduction).
SQLPythonPower BIEDAHypothesis Testing
🏨
Hospitality Analytics — AtliQ Hotels
Revenue by Booking Platform
  • EDA on 10,000+ hotel booking records across multiple properties
  • Identified top 3 drivers of booking cancellations
  • Insights estimated to reduce cancellation rates by 15–20%
  • Delivered dynamic pricing & demand forecasting recommendations
PythonPandasNumPyMatplotlibEDA
📦
Vendor Invoice Intelligence System
Freight Cost Prediction Invoice Risk Flagging
  • Built a dual-model machine learning system for freight prediction and invoice risk detection in finance workflows.
  • Achieved R² ~97% (MAE ~24) in regression and ~89% accuracy (F1 ~0.82) in classification using Random Forest.
  • Implemented end-to-end pipeline (SQL feature engineering → model training → inference) and deployed interactive Streamlit app enabling real-time invoice evaluation and decision support.
  • Performed EDA & statistical testing (t-tests) to identify key drivers of invoice anomalies
  • Deployed a Streamlit dashboard for real-time predictions and explainable insights
PythonMachine LearningRandom ForestSQLStreamlitGridSearchCV
💰
Full-Stack Expense Tracking System
Expense Tracker Form Expense Category Breakdown
  • Built REST API backend with FastAPI + Pydantic and MySQL
  • Real-time Streamlit dashboard replacing spreadsheet tracking
  • Reduced manual reporting time by 100%
  • Tracks & visualizes spending across multiple budget categories
FastAPIStreamlitMySQLPythonPydantic
More on GitHub
Certifications

Credentials

Industry-recognized certifications from Google and IBM.

Google
Advanced Data Analytics Professional Certificate
Coursera
View Certificate →
Google
Data Analytics Professional Certificate
Coursera
View Certificate →
IBM
Data Scientist Professional Certificate
Coursera
View Certificate →
Codebasics
Python: Beginner to Advanced for Data Professionals
Codebasics
View Certificate →
Education

Academic Background

2025 – Present
M.E. in Georesource and Geoenergy
Politecnico di Torino
Turin, Italy
2013 – 2017
B.E. in Mining Engineering
Mehran University of Engineering & Technology
Pakistan · First Class Distinction

Download My Resume

Full CV with all projects, technical skills, certifications, and contact details — ready for ATS and recruiter review.

Contact

Let's Connect

Open to Data Scientist and Data Analytics roles. Feel free to reach out.

I'm actively seeking opportunities in Data Scientist, Data Analytics, and Machine Learning. Whether you have a role, a collaboration idea, or just want to talk data — my inbox is open.

📍 Based in Turin, Italy — open to remote and on-site roles across Europe.

Send an Email