
Jayita Chatterjee
I do Data Science

About
From the winding streets of a small town in Eastern India to the bustling avenues of Los Angeles, my journey has been driven by a deep curiosity and passion for discovering patterns in data. As a child, I found fascination in mathematics and physics, which naturally led me to the world of engineering and eventually to data science and machine learning.
During my undergraduate studies, I undertook a pivotal project—using computer vision and machine learning to predict fruit quality on an assembly line. This experience opened my eyes to the immense potential of data-driven solutions in solving real-world challenges.
After earning my bachelor’s degree, I joined Wipro as an Associate Consultant, where I worked closely with clients, tackling data problems and contributing to innovative projects in natural language processing and machine learning. This experience fueled my desire to deepen my expertise, leading me to pursue a master’s degree in Electrical Engineering with a focus on Machine Learning and Data Science at the University of Southern California.
Armed with a strong technical foundation and three years of consulting experience, I bring a unique blend of skills and a passion for transforming data into actionable insights. If you're hiring or interested in collaboration, I’d love to connect. Explore my portfolio below or on GitHub!
Skills
Programming
Python
MySQL
R
Visual Basic
Data Analytics & Visualization

Tableau

Power BI
MS Excel

Looker
Tools
Alteryx
GitHub
JIRA

SAP

MS Office
Resume
Education
Master of Science - Electrical & Computer Engineering
2022 - 2024
University of Southern California, Los Angeles, CA
B-Tech - Electronics & Communications Engineering
2015 - 2019
University of Engineering & Management, Kolkata, India
Professional Experience
Lead Data Analyst
2022 - 2023
University of Southern California, Los Angeles, CA
Senior Data Analytics Consultant
2021 - 2022
Wipro India Limited, Kolkata, India
Data Analytics Consultant
2019 - 2020
Wipro India Limited, Kolkata, India
Projects
- All
- Data Analysis & Visualization
- Exploratory Data Analysis
- Machine Learning

AtliQo Bank Credit Card Launch
Conducted exploratory data analysis on AtliQo Bank's credit card launch data, analyzing 3 datasets. Identified income disparities across occupations, with business owners earning significantly more. Statistically validated that the new card increased usage and transaction amounts by 15%, uncovering an untapped 18-25 age group market (26% of total).
Random Forest Regression | A/B Testing | Z-test | EDA | Python | Seaborn | Statsmodels
GitHub
Student Performance Insights through EDA
Performed an exploratory data analysis on student performance records, identifying key trends such as gender and parental education impacting scores across math, reading, and writing. Hypothesis testing revealed students who completed a test preparation course scored significantly higher. Visualized findings led to actionable insights for improving academic outcomes.
Statistics | Hypothesis Testing | Z-test | EDA | Python | Seaborn | Statsmodels
GitHub
Decades of Streaming: Music Insights Across 40 Years
This project performed an exploratory data analysis (EDA) on a dataset of the most streamed songs from the last four decades, analyzing 15+ streaming and social media platforms. It uncovered key trends such as a 25% dominance of non-explicit tracks, peak album releases in May, and strong correlations between TikTok and Spotify metrics.
Statistics | Hypothesis Testing | EDA | Python | Seaborn | Statsmodels
GitHub
ANN Regression House Price Prediction
Developed an Artificial Neural Network (ANN) model to predict house prices using the California Housing Prices dataset. Preprocessed 20,000+ data rows, optimized the model with early stopping, and achieved a mean absolute error (MAE) of 0.3. Deployed the model via a Streamlit app for real-time user interaction and predictions.
ML | DL | Python | Keras | Tensorflow | Streamlit
GitHub
ANN Classification Customer Churn Prediction
Developed an Artificial Neural Network (ANN) model to predict customer churn with 86.4% accuracy. The project involved data preprocessing, model training, and evaluation using metrics like confusion matrix and F1-score. Deployed the model via a Streamlit app, enabling real-time predictions based on customer input for business decisions.
ML | DL | Python | Keras | Tensorflow | Streamlit
GitHub
Flight Price Prediction and Real-Time Deployment
Developed and deployed a flight price prediction model using Random Forest in a Streamlit app, improving prediction accuracy from an initial R² of 0.1607 to 0.7461, and providing real-time price insights based on features like Travel Time, Cabin, and Airline.
Python | EDA | OLS Regression | Random Forest Regression | Streamlit
GitHub
Credit Card Default Prediction
Developed a credit card default prediction model using machine learning techniques. Preprocessed data, trained Logistic Regression as a baseline (AUC 0.7474), and enhanced accuracy using XGBoost with under-sampling (AUC 0.7764, Gini 0.5529). Deployed the best model using a Streamlit app for real-time prediction of customer defaults.
Python | Classification | Logistic Regression | XGBoots | Random Forest Classifier | Support Vector Classifier
GitHub
Human Resources Dashboard
Developed an interactive HR dashboard using Excel and DAX to visualize key metrics like employee performance and recruitment trends, improving decision-making and boosting recruitment efficiency by 20%
Advanced MS Excel | VBA | Macros | Pivot Table | Slicer | Power Query
GitHub
Healthcare Management Optimization
Improved hospital capacity management by analyzing patient stay duration, admissions, and resource utilization using SQL and Tableau, leading to data-driven decisions that enhanced patient flow and reduced overcrowding by 12%, optimizing bed allocation and hospital operational efficiency.
MySQL | MS Excel | Interactive Tableau Dashboard
GitHub
Sales Performance Enhancement Using Data Analytics and Visualization
Boosted sales performance by analyzing product sales, regional trends, and customer behavior using Excel, SQL, and Tableau, identifying high-performing segments that led to targeted marketing strategies and a 15% increase in customer retention and a 20% improvement in sales efficiency.
MySQL | MS Excel | Interactive Tableau Dashboard
GitHub
Supply Chain Optimization and Performance Enhancement
Boosted sales performance by analyzing product sales, regional trends, and customer behavior using Excel, SQL, and Tableau, identifying high-performing segments that led to targeted marketing strategies and a 15% increase in customer retention and a 20% improvement in sales efficiency.
MySQL | MS Excel | Interactive Tableau Dashboard
GitHubBlogs

Healthcare Management Optimization: A Comprehensive Analysis using Excel, SQL, and Tableau
Read more on Medium