I’m a data science professional with 2+ years of experience in client-facing roles across healthcare, pharma, e-commerce, manufacturing, and enterprise software. I’m passionate about solving real-world problems through data, and my experience spans across analytics, data engineering, machine learning, deep learning, and generative AI — building predictive models, streamlining pipelines, and driving impactful insights.
Outside of work, I’m a naturally curious and creative soul. Whether I’m sketching out an idea, getting lost in a book (currently diving into The Alchemist), or writing to reflect and unwind. I speak six languages and thrive in conversations that cross cultures and perspectives. I also enjoy stepping into leadership roles and have been actively involved in student and professional organizations throughout my journey.
• Statistical analysis • Data storytelling • Dashboarding • Predictive modeling • A/B testing • Business insights generation .
• Supervised, Unsupervised, Reinforcement Learning • Natural Language Processing (NLP) • Model deployment & monitoring • ML pipeline automation • Model optimisation (hyperparameters and featrue engineering)
• ETL development • Data pipeline design & optimization • Data warehousing (SQL, Snowflake, BigQuery) • Workflow scheduling (Airflow, Prefect)
• Prompt engineering • OpenAI, LangChain, HuggingFace • Fine-tuning transformer models • LLM integration into applications
Jan 2024 - Present
- Supported 130+ students as a TA for database concepts course; researched privacy-preserving cardinality estimation using deep autoregressive models.
- Identified $15M+ in healthcare savings by analyzing patient data and optimizing treatment pathways using predictive modeling.
May 2024 - Dec 2024
- At ASTM, I worked in the Market Intelligence division, contributing to Merger & Acquisation prediction systems, dashboarding, cloud basesd database management, NLP, web & data scraping, and LLM-based application development.
Aug 2022 - Jul 2023
- Delivered data solutions for pharma clients by building ETL pipelines, Power BI dashboards, and predictive models, improving patient retention and enabling early disease detection through EHR data analysis.
May 2021 - Jul 2022
- Led e-commerce analytics initiatives to improve marketing effectiveness,demand forecasting, segmenting customers, and automating business workflows through ML models, time series forecasting, and SAP ERP development.
LLM-powered semantic book recommender—discover your next favorite read with emotion-aware filtering and smart search.
HistGradientBoostingClassifier . TF-IDF . SMOTE . CosineSimilarity . GPT . OpenAI . PCA . t-SNE
Curates real-time posts using Reddit API based on custom preferences using LLMs, stores summaries on AWS, and visualizes insights via dashboards.
Airflow . GPT-3.5 . Python . AWS (S3, Glue, Athena) . Redshift . QuickSight . Docker
Deep learning–powered loan default prediction using multi-source financial data to drive smarter, inclusive lending decisions.
logistic regression . Multi-Layer Perceptron . GridSearchCV . PyTorch . TensorBoard
Forecasting petrol prices using LSTM, ARIMA, and AutoML with an interactive dashboard for model comparison and evaluation.
AutoKeras. AutoML statsmodels. Plotly Dash. ARIMA . LSTM
Detect and explain credit card fraud using anamaly detection unsupervised models, visualize predictions, and explore insights with an interactive application.
Python . PyCaret . XGBoost . scikit-learn . SHAP . Pandas . Seaborn.
Processes raw food delivery data into insights using GCP, Airflow, Beam, and Tableau dashboards for decisions.
Google Cloud Storage . Apache Airflow . Apache Beam . BigQuery . Tableau . Python . Jupyter . Docker.
Upload any WhatsApp group chat to get instant sentiment and network-based interaction analysis.
Python . Streamlit . Pandas . NLTK . Gensim . VADER . NetworkX
CNN-based image classifier with Flask UI for multiclass scenery prediction, data augmentation, and PCA-based feature visualization.
Keras . TensorFlow . Flask . Scikit-learn . OpenCV . Matplotlib . PCA . HTML/CSS (Flask UI).
In collaboration with IU Center of Excellence. Retail A/B testing pipeline to compare checkout versions of IU Retail chain using dbt, Snowflake, S3, and Tableau.
Python . AWS S3 . Snowflake . dbt . Tableau.
Led the university’s IEEE Robotics chapter, delivering lectures in 10+ tech seminars.
Collaborated in events promoting women in tech, mentored junior data science peers.
Led 100+ member student group, organized national-level competitions on automotive innovations.
Engaged 1000+ attendees as the host for TEDx OU’s second edition.