Jayita Chatterjee

I do Data Science

About Image

About

From the winding streets of a small town in Eastern India to the bustling avenues of Los Angeles, my journey has been driven by a deep curiosity and passion for discovering patterns in data. As a child, I found fascination in mathematics and physics, which naturally led me to the world of engineering and eventually to data science and machine learning.

During my undergraduate studies, I undertook a pivotal project—using computer vision and machine learning to predict fruit quality on an assembly line. This experience opened my eyes to the immense potential of data-driven solutions in solving real-world challenges.

After earning my bachelor’s degree, I joined Wipro as an Associate Consultant, where I worked closely with clients, tackling data problems and contributing to innovative projects in natural language processing and machine learning. This experience fueled my desire to deepen my expertise, leading me to pursue a master’s degree in Electrical Engineering with a focus on Machine Learning and Data Science at the University of Southern California.
Armed with a strong technical foundation and three years of consulting experience, I bring a unique blend of skills and a passion for transforming data into actionable insights. If you're hiring or interested in collaboration, I’d love to connect. Explore my portfolio below or on GitHub!

Skills

Programming

Python

Python

MySQL

MySQL

R

R

Visual Basic

Visual Basic

Data Analytics & Visualization

Tableau

Tableau

Power BI

Power BI

MS Excel

MS Excel

Looker

Looker

Tools

Alteryx

Alteryx

GitHub

GitHub

JIRA

JIRA

SAP

SAP

MS Office

MS Office

Resume

Education

Master of Science - Electrical & Computer Engineering

2022 - 2024

University of Southern California, Los Angeles, CA

B-Tech - Electronics & Communications Engineering

2015 - 2019

University of Engineering & Management, Kolkata, India

Professional Experience

Lead Data Analyst

2022 - 2023

University of Southern California, Los Angeles, CA

Senior Data Analytics Consultant

2021 - 2022

Wipro India Limited, Kolkata, India

Data Analytics Consultant

2019 - 2020

Wipro India Limited, Kolkata, India

Projects

  • All
  • Data Analysis & Visualization
  • Exploratory Data Analysis
  • Machine Learning
AtliQo Bank Credit Card Launch

AtliQo Bank Credit Card Launch

Conducted exploratory data analysis on AtliQo Bank's credit card launch data, analyzing 3 datasets. Identified income disparities across occupations, with business owners earning significantly more. Statistically validated that the new card increased usage and transaction amounts by 15%, uncovering an untapped 18-25 age group market (26% of total).

Random Forest Regression | A/B Testing | Z-test | EDA | Python | Seaborn | Statsmodels

GitHub
Student Performance

Student Performance Insights through EDA

Performed an exploratory data analysis on student performance records, identifying key trends such as gender and parental education impacting scores across math, reading, and writing. Hypothesis testing revealed students who completed a test preparation course scored significantly higher. Visualized findings led to actionable insights for improving academic outcomes.

Statistics | Hypothesis Testing | Z-test | EDA | Python | Seaborn | Statsmodels

GitHub
Most Streamed Songs

Decades of Streaming: Music Insights Across 40 Years

This project performed an exploratory data analysis (EDA) on a dataset of the most streamed songs from the last four decades, analyzing 15+ streaming and social media platforms. It uncovered key trends such as a 25% dominance of non-explicit tracks, peak album releases in May, and strong correlations between TikTok and Spotify metrics.

Statistics | Hypothesis Testing | EDA | Python | Seaborn | Statsmodels

GitHub
House Price Prediction

ANN Regression House Price Prediction

Developed an Artificial Neural Network (ANN) model to predict house prices using the California Housing Prices dataset. Preprocessed 20,000+ data rows, optimized the model with early stopping, and achieved a mean absolute error (MAE) of 0.3. Deployed the model via a Streamlit app for real-time user interaction and predictions.

ML | DL | Python | Keras | Tensorflow | Streamlit

GitHub
Classifying Customer Churn

ANN Classification Customer Churn Prediction

Developed an Artificial Neural Network (ANN) model to predict customer churn with 86.4% accuracy. The project involved data preprocessing, model training, and evaluation using metrics like confusion matrix and F1-score. Deployed the model via a Streamlit app, enabling real-time predictions based on customer input for business decisions.

ML | DL | Python | Keras | Tensorflow | Streamlit

GitHub
Flight Price Prediction and Real-Time Deployment

Flight Price Prediction and Real-Time Deployment

Developed and deployed a flight price prediction model using Random Forest in a Streamlit app, improving prediction accuracy from an initial R² of 0.1607 to 0.7461, and providing real-time price insights based on features like Travel Time, Cabin, and Airline.

Python | EDA | OLS Regression | Random Forest Regression | Streamlit

GitHub
Default Credit Card Prediction

Credit Card Default Prediction

Developed a credit card default prediction model using machine learning techniques. Preprocessed data, trained Logistic Regression as a baseline (AUC 0.7474), and enhanced accuracy using XGBoost with under-sampling (AUC 0.7764, Gini 0.5529). Deployed the best model using a Streamlit app for real-time prediction of customer defaults.

Python | Classification | Logistic Regression | XGBoots | Random Forest Classifier | Support Vector Classifier

GitHub
Human Resources Dashboard

Human Resources Dashboard

Developed an interactive HR dashboard using Excel and DAX to visualize key metrics like employee performance and recruitment trends, improving decision-making and boosting recruitment efficiency by 20%

Advanced MS Excel | VBA | Macros | Pivot Table | Slicer | Power Query

GitHub
Healthcare Management Optimization

Healthcare Management Optimization

Improved hospital capacity management by analyzing patient stay duration, admissions, and resource utilization using SQL and Tableau, leading to data-driven decisions that enhanced patient flow and reduced overcrowding by 12%, optimizing bed allocation and hospital operational efficiency.

MySQL | MS Excel | Interactive Tableau Dashboard

GitHub
Sales Performance Enhancement Using Data Analytics and Visualization

Sales Performance Enhancement Using Data Analytics and Visualization

Boosted sales performance by analyzing product sales, regional trends, and customer behavior using Excel, SQL, and Tableau, identifying high-performing segments that led to targeted marketing strategies and a 15% increase in customer retention and a 20% improvement in sales efficiency.

MySQL | MS Excel | Interactive Tableau Dashboard

GitHub
Supply Chain Optimization and Performance Enhancement

Supply Chain Optimization and Performance Enhancement

Boosted sales performance by analyzing product sales, regional trends, and customer behavior using Excel, SQL, and Tableau, identifying high-performing segments that led to targeted marketing strategies and a 15% increase in customer retention and a 20% improvement in sales efficiency.

MySQL | MS Excel | Interactive Tableau Dashboard

GitHub

Blogs

Blog 1

Healthcare Management Optimization: A Comprehensive SQL-Driven Analysis

Read more on Medium
Blog 2

Healthcare Management Optimization: A Comprehensive Analysis using Excel, SQL, and Tableau

Read more on Medium
Blog 3

From Foodie to Restaurant Data Detective: Cracking the Code of Restaurant Success on Yelp

Read more on LinkedIn