NOOPUR DIVEKAR

Data Analyst · Data Scientist · AI Engineer · Forward Deployed Engineer

Turning complex data into strategic decisions, from $994M risk mitigation dashboards to multi-agent AI systems, I build solutions and products that move the needle.

Noopur Divekar
Noopur Divekar

Building things that solve problems and drive impact.

I’m a Data Scientist and AI Engineer who recently graduated with a Master’s in Data Science from Indiana University Bloomington (May 2026).

I specialize in architecting end-to-end Data and AI projects that automate manual processes and transform raw information into actionable business intelligence. From engineering ML pipelines on AWS, building multi-agent AI full-stack systems to building Power BI dashboards that mitigate nearly $1B in operational risk, I bring a rare blend of technical depth and business acumen. I’ve worked across startups (Project 990 Inc.), Fortune 500 companies (Zoetis), administration (IU Admissions) and academic research (The Millennium Project).

My approach is centered on active stakeholder collaboration: I deep-dive into the problem, engineer the right solution, and distill technical findings into clear, persuasive narratives that drive decisions. I build solutions that matter.

Where I’ve made impact.

Jan 2026 — Present

Data Scientist

Project 990 Inc.
  • Led 2 analysts to architect a prompt-agnostic LLM classification pipeline (Llama 3.3 70B, Gemma, Tree-of-Thought reasoning) to categorize 300K+ nonprofit mission statements into 27 NTEE codes, reducing manual classification by ~83%.
  • Architected a regex-based text extraction pipeline to strip IRS boilerplate and surface clean mission statement data from raw 990 filings, establishing the high-fidelity data foundation required for reliable downstream NTEE classification.
  • Built Tableau dashboards on millions of IRS 990 & 990PF records to map funder similarity networks and charitable giving patterns, enabling data-driven funding strategy for grant officers and nonprofit researchers.
May 2025 — Dec 2025

Data Analytics Intern

Zoetis Inc. — Kalamazoo, Michigan $994M Impact
  • Surfaced $994M in potential annual operational risk by engineering Power BI dashboards and reporting tool on animal health pharmaceutical supply chain data, equipping 10+ senior leaders to drive strategic capacity utilization decisions.
  • Eliminated siloed capacity reporting by consolidating 4+ SQL servers, SAP ECC, and Excel sources into a unified Power BI data model with the first standardized capacity planning formula across packaging, and formulation teams.
  • Drove $345M in cost savings by deploying an interactive Power BI scenario simulation tool with advanced DAX, enabling leaders to model OEE improvements, line shifts, and demand spikes to inform equipment upgrade decisions.
Aug 2024 — May 2026

Graduate Data Analyst and Assistant

Indiana University Admissions
  • Orchestrated and delivered IU Information presentations to diverse stakeholder groups including over 4000 prospective students, articulating university value propositions to drive enrollment interest.
  • Served as the primary liaison for large-scale admissions events, coordinating real-time communications between staff and attendees.
  • Translated complex outreach and recruitment data of last 3 years into actionable business insights using advanced Excel and Power BI, presenting findings to senior leadership.
Oct 2023 — Dec 2023

Data Science Intern

TechnoHacks EduTech — Mumbai, India
  • Engineered Random Forest and XGBoost ensemble models in Python to forecast financial budget allocation for consulting clients, achieving 25% improvement in prediction accuracy and enabling more reliable resource planning.
  • Architected an automated feature engineering pipeline using SQL and Python to preprocess financial datasets, reducing model training time from 6 hrs to 2 hrs and accelerating ML model iteration across client workflows.
  • Deployed interactive Tableau dashboards monitoring ML model performance and financial KPIs, surfacing insights that contributed to an 18% increase in client conversion rates.
Jul 2023 — Sep 2023

Data Science & Business Analytics Intern

The Sparks Foundation — Mumbai, India
  • Uncovered 4+ operational bottlenecks in grant disbursement workflows by querying 1,500+ grant records using SQL and Excel, delivering findings that reduced application processing time by ~18% across program departments.
  • Designed Tableau dashboards tracking Key Risk Indicators (KRIs) and KPIs, including budget burn rate and grant approval velocity, equipping 5 program directors with real-time visibility to support weekly operational risk reviews.
  • Facilitated UAT for a grant management system rollout, validating 27+ functional requirements and flagging 3 critical data integrity issues pre-deployment to ensure enterprise risk management compliance.

What I bring to the table.

Python
SQL
R
PySpark
DAX
Power BI
Tableau
Looker
Advanced Excel
Plotly
Matplotlib
Weights & Biases
XGBoost
Random Forest
NLP
Transformers
HuggingFace
NLTK
Computer Vision
OpenCV
TensorFlow/Keras
PyTorch
Causal Inference
Uplift Modeling
A/B Testing
Databricks
Microsoft Fabric
dbt
ETL/ELT Pipelines
Prefect
PostgreSQL
MySQL
SQL Server
SQLite
Oracle
Google BigQuery
AWS (S3, EC2, SageMaker)
Azure Cloud
Git
GitHub Actions
SAP ECC
Claude Code
ChatGPT
Gemini
Prompt Engineering
Vibe Coding
LLM Fine-Tuning
Groq API
LangGraph

What I’ve been working on.

ReadyAid: RAG-Based Offline Mobile First-aid Assistant with LLM Streaming

Full-stack Android first-aid app with local RAG pipeline, real-time LLM streaming via FastAPI, ChromaDB, and Ollama.

KotlinJetpack ComposeFastAPIRAGChromaDBOllama

Amazon Marketplace Intelligence Platform

End-to-end cloud analytics pipeline: BigQuery ELT with dbt for 1.4M products across 4 Amazon domains with Power BI reporting.

BigQuerydbtPower BIPythonSQLETL Pipeline

Smart Hospital Multi-Agent AI System

5-agent LangGraph system automating clinical logistics: triage, scheduling, pharmacy, lab, and patient agents using Groq LLM.

LangGraphLLMPostgreSQLPythonAgentic AIAPI Integration

Telco Customer Retention Optimizer

Causal uplift modeling with Power BI integration to optimize retention strategies using A/B testing.

PythonScikit-LearnUplift ModelingPower BIA/B TestingCausal Analysis

Cloud-Native Customer Churn Prediction Pipeline

Full AWS pipeline: PySpark ETL, feature engineering, XGBoost on SageMaker Canvas, Power BI dashboard.

AWSPySparkXGBoostSageMakerPower BI

Super Bowl Win Probability Model

Logistic regression model predicting Super Bowl outcomes using historical NFL statistics.

PythonScikit-LearnLogistic RegressionSQLnfl_data_py

Global Policy Scenario Dashboard

Power BI dashboard with DAX what-if sliders and sensitivity analysis tracking 29 SOFI indicators for a UN think tank.

Power BIDAXPythonSensitivity AnalysisAdvanced ExcelWhat-If Analysis

AI-Powered YouTube Learning Path Generator

NLP-powered system that structures YouTube videos into personalized learning paths using TF-IDF and cosine similarity.

PythonYouTube APINLTKNetworkXTF-IDF

Global Internet Penetration Trend Analysis

World-scale analysis of internet usage data with interactive Plotly visualizations and statistical analysis.

PythonSQLPlotlyK-Means ClusteringSQLite

Diet Conversational AI — LLM Fine-Tuning

Fine-tuned Llama 3.2 on nutrition QA datasets using QLoRA and HuggingFace for a dietary advice chatbot.

PyTorchHuggingFaceFlan-T5Weights & BiasesNLP

Industry-validated credentials.

Microsoft

Power BI Data Analyst Associate (PL-300)

Credential ID: 5F24CF7B8DDFAE2C
Microsoft

Fabric Analytics Engineer Associate (DP-600)

Credential ID: 4E6EC08E0131E3B7
DataCamp

Associate Data Analyst in SQL

Credential ID: DAA0013414919458
DataCamp

Data Analyst in Power BI

Credential ID: DA0011635069697
IBM

Data Science Professional Certificate

Credential ID: BPCL3UEPXHLT

Academic foundation.

Master of Science, Data Science

Indiana University Bloomington
GPA: 3.82 / 4.0 May 2026
Relevant Coursework: Applied Machine Learning, Information Visualization, Big Data Analytics, Statistics, Data Visualization, Social Media Mining, Management of Big Data, Applied Database Technologies, Independent Study

Bachelor of Engineering, Electronics

University of Mumbai
GPA: 3.7 / 4.0 May 2024
Relevant Coursework: Database Management System (SQL), Database Management System (MongoDB), Programming for Problem Solving (Python, JavaScript, Node.js), Machine Learning

What it’s like to have me on the team.

I had the pleasure of working with Noopur on a visualization project of the State of the Future Index (SOFI) of the Millennium Project, as part of her Information Visualization course at Indiana University. The task was to create an interacting model to show the SOFI usefulness to analysis and policy making. As a team leader, Noopur demonstrated exceptional leadership both in communicated with us as a client, as well as guiding the project to a professional-grade deliverable. She proved an exceptional understanding of the large, deeply interconnected dataset and the goal, engineering a sophisticated solution using advanced Power BI and complex DAX. An interactive ‘What-If’ included in the model allows to simulate various scenarios compounding environmental changes in real-time. Her ability to execute such a complex model under tight deadlines proves exceptional combination of technical and leadership skills. I would definitely work with her again and highly recommend her for any data science or analytics role.

Elizabeth Florescu
Elizabeth Florescu Director of Research, The Millennium Project Project Mentor

I highly recommend Noopur for her outstanding skills in data analytics, especially with Power BI. During her internship, Noopur demonstrated impressive technical expertise and initiative by developing a complex capacity utilization dashboard in record time. Her ability to quickly understand business requirements, translate them into actionable insights, and deliver a polished, interactive dashboard exceeded expectations. Noopur combines strong analytical thinking with a hands-on approach to problem-solving. Her proficiency with Power BI — including data modeling, visualization, and DAX — was evident throughout the project, and the dashboard she built has become an invaluable tool for our team. I have no doubt that Noopur will continue to excel and add value wherever she goes. I wholeheartedly endorse her Power BI and data analytics skills!

Steven Rivard
Steven Rivard Senior Team Leader — KZO NextGen (SAP S/4 Hana), Zoetis Manager at recent internship

Milestones & recognition.

🏆

Luddy Hackathon Winner

Second runner-up — designed and deployed the Smart Hospital Multi-Agent AI System (SHMAS) within a strict 48-hour deadline, implementing a 5-agent architecture to automate clinical logistics.

September 2025

Noopur at Luddy Hackathon
📖

Conference Publication — Multicon 2023 International Conference

Published and presented: “Hybrid Approach of Seed Spreading and Weed Plucking for Enhanced Crop Yields using Machine Learning.” Featured in Industrial Internet of Things for Responsible Technology.

ISBN 9781032829401 · May 2023

Got a role in mind or just want to connect? Let’s talk.

I’m always open to discussing data science opportunities, collaborative projects, or just connecting over shared interests in AI and analytics.