Menu
Jay Thakur

Jay Thakur

Data Analyst

Find me on

About Me

Hi, I'm Jay Thakur — a Data Analyst with a passion for transforming raw data into actionable insights. With a Master's in Data Science and hands-on experience across finance, e-commerce, and AI, I specialize in Python, SQL, and Power BI to solve real-world problems.

What I Do Best

  • Data Storytelling: Turn complex analyses into clear business recommendations (like my NYSE Stock Analysis project)
  • Full-Cycle Analytics: From cleaning messy data (Titanic Dataset) to building ML models (Lead Scoring System)
  • Impact-Driven: Boosted dataset reliability by 40% at Sigma AI Data through precise annotation
  • Technical Toolkit: Python (Pandas, Scikit-learn), SQL, Power BI, Tableau, and statistical modeling

My Approach

I believe data should solve problems, not just sit in spreadsheets. Whether optimizing marketing campaigns or identifying countries needing humanitarian aid, I focus on delivering actionable results.

🔬 Educated at: Kingston University (MSc Data Science) & IIIT Bangalore (PG Diploma)

🏆 Recent Achievement: Mentored students with Exploratory Data Analysis and SQL, achieving 90% proficiency gains

Skills

MS Excel
Microsoft SQL Server/SQL
MS Power BI
Python
Data Cleaning
Dashboard
Problem Solver
Data Story Telling
Pandas
NumPy
Matplotlib
Seaborn
Machine Learning
Data Visualization
Statistical Analysis

Projects

Database Projects

Database Projects (SQL/MySQL/HQL)

Creating and optimizing database queries, designing schemas, and performing complex data analysis using SQL and related technologies.

Exploratory Data Analysis Projects

Exploratory Data Analysis Projects

Analyzing datasets to uncover patterns, trends, and insights using Python, Pandas, and visualization tools like Matplotlib and Seaborn.

Machine Learning Projects

Machine Learning Projects

Building predictive models and implementing machine learning algorithms using Scikit-Learn, TensorFlow, and Keras.

BI Tool Projects

Business Intelligence Projects

Creating interactive dashboards and reports using Power BI and Tableau to visualize and communicate data insights effectively.

Experience

Oct 2024 - Present
Data Annotator (Sigma AI Data Ltd)
Hammersmith, UK
Led live data interpretation using annotation systems, maintaining an error rate below 5%. Enhanced dataset integrity for AI training, improving reliability by over 40% across projects. Optimized audio annotation with data classification and tagging, processing 2,500 files daily. Achieved 98% transcription accuracy, reducing project turnaround times by 30%.
Data Interpretation Data Annotation Data Classification Data Integrity Data Accuracy
Feb 2025 - March 2025
Data Science Intern (Prodigy InfoTech)
Remote
Extracted, transformed, and loaded (ETL) structured data for analysis, ensuring high data integrity and accuracy. Visualized cleaned data using Python, Pandas, NumPy, Matplotlib, and Seaborn to analyze population distribution across 100+ countries. Applied a Decision Tree Classifier on demographic and behavioral data to predict customer purchases with 85% accuracy. Performed sentiment analysis using Python and basic NLP techniques to assess social media opinions on specific topics.
Extract, Transform, and Load (ETL) Python Data Analysis Data Cleaning Data Visualization Decision Tree Classifier Natural Language Programming (NLP) Customer Behavioral Analysis
Feb 2022 - Dec 2022
Implementation Engineer (Sutherland Healthcare Solutions)
Remote
Led Allscripts EMR integration with PACS and RIS systems, and PM systems across 15+ healthcare facilities. Conducted HL7 system integrations and optimized SQL queries, reducing data retrieval times by 30%. Designed and delivered training on onboarding, issue escalation, and administrative documentation. Improved operational efficiency by 40%, generating more than $300,000 in revenue for Allscripts.
Microsoft SQL Server Advanced Excel EMR Integration with PACS and RIS systems PM system Integration HL7 system Integration Data Cleaning Training on Onboarding Issue Escalation
Apr 2021 - Jul 2021
Data Analyst (Kantar Analytics Practice)
Remote
Analyzed marketing campaigns using data assessment, increasing sales by 12% and 8% across industries. Applied advanced Excel functions and statistical analysis to improve Marketing Mix Model accuracy by 20%. Optimized marketing ROI by $500K through strategic budget reallocations based on identified KPIs. Delivered actionable insights that enhanced strategic decision-making and boosted brand awareness by 15%.
Data Assessment Advanced Excel (VLOOKUP, Pivot Table, Conditional Formatting) Data Cleaning Statistical Analysis decision-making Marketing Mix Model Marketing Data Analysis

Education

Kingston University, London
2023 - 2024
Master of Science Data Science
Kingston Upon Thames, UK
Machine Learning Artificial Intelligence Python SQL Database Management Data Analytics and Visualization
International Institute of Information Technology Bangalore (IIIT-B)
2020 - 2021
Postgraduate Diploma in Data Science (Data Analysis)
Bangalore, India
Machine Learning Advanced Excel Hive Query Language AWS Data Analysis Python Statistics SQL Time-Series Analysis
NBN Sinhgad School of Engineering (Pune University)
2014 - 2020
Bachelor of Engineering Computer Science
Pune, India
Machine Learning Artificial Intelligence Data Structures and Algorithms Object Oriented Programming SQL Database Management