Projects
Sentiment Analysis


๐ง NLP Sentiment Rating Prediction Project โ Key Highlights
๐ Project Goal: Built an NLP model to predict sentiment ratings (0โ10) from textual feedback in a coaching institute context.
๐ Data: 200,000+ unique reviews generated with realistic coaching-related expressions (e.g., "teachers good", "not up to mark", "average", etc.).
๐งน Preprocessing: Applied text cleaning, tokenization, stopword removal, and lemmatization using NLTK and re.
๐ง Feature Engineering: Used TF-IDF vectorization and Word2Vec to convert text into numerical features.
๐ Modeling: Trained regression models including Linear Regression to predict sentiment scores.
๐งช Evaluation: Achieved high prediction accuracy with reduced MAE and RMSE; selected the best-performing model for deployment.
๐ Deployment: Deployed the final model using Streamlit for real-time prediction with a user-friendly web interface.
๐ผ Business Impact: Automated sentiment scoring reduced manual review time, minimized human bias, and improved feedback analysis efficiency by 74%


Objective:
To analyze one year of realistic, multi-source e-commerce data and deliver data-backed strategies to achieve at least ๐ 15% revenue growth through optimization across marketing, logistics, and customer experience.
๐ Defined and monitored KPIs: Net Revenue, Profit Margin, AOV, Return Rate, Cart Abandonment, CAC, ROAS
๐ง Built Excel and Power BI dashboards to explore trends across loyalty, product category, city tier, and conversion funnel
๐ ๏ธ Used SQL (BigQuery) for deep-dive analysis using joins, aggregations, and window functions
๐งฌ Applied Cohort Analysis & RFM Segmentation in Python to identify customer lifecycle groups (e.g., Champions, At-Risk)
๐ Conducted statistical testing (T-Test, ANOVA, Chi-square, Z-Test) to validate differences with statistical confidence
๐ Simulated 9+ business strategies (e.g., loyalty conversion, AOV uplift, funnel improvement) to forecast revenue impact
๐ Developed a clean, interactive Streamlit app for real-time KPI tracking and scenario simulation
๐งฐ Tools Used:
Python ๐ | SQL ๐งพ | Power BI ๐ | Excel ๐ | Streamlit ๐ | Plotly ๐ | SciPy ๐ฌ
โ Outcome:
Achieved a 15.03% increase in projected annual revenue through strategic optimization, simulation, and data storytelling โ all backed by domain knowledge and statistical evidence.
Blinkit Revenue Optimization & Simulation Dashboard




Student Churn Prediction Engine
๐ฏ Objective
To proactively identify students at risk of dropping out from a coaching institute and enable timely interventions to improve retention and reduce revenue loss.
๐ ๏ธ Tools & Technologies
Python, pandas, scikit-learn, Streamlit, pickle (for model deployment)
๐ Project Overview
I developed a machine learning model using Random Forest to predict student churn based on academic performance, attendance, feedback sentiment, and behavioral scores. The final model achieved 97% accuracy and was deployed as an interactive Streamlit web app for real-time prediction and counselor use.
๐ผ Business Impact
Correctly flagged 14,142 high-risk students
Enabled interventions that retained 5,656 students
Saved approximately โน2.83 Crores in potential revenue loss
๐ Outcome
This project showcases my ability to solve real business problems using end-to-end data science โ from feature engineering and model building to deployment and impact simulation.

