Now
Stanford Institute for Human-Centered Artificial Intelligence (HAI) · Graduate Researcher
Stanford, CA
2024–Present
- Developed ConvoLearn, a 40K-turn post-training dataset for strengthening dialogic capabilities of LLMs in tutoring contexts.
- Directed large-scale human data operations (750+ teachers across provisioning, annotation, and evaluation): recruitment, task design, qualification gating, and pilot protocols for 20-turn dialogues aligned to a pedagogical framework.
- Designed a multi-stage QA pipeline with hybrid human-LLM annotation, inter-rater reliability analysis, safety filtering, and expert adjudication, producing a high-quality, SFT-ready dataset.
- Conducted an ecological validity analysis linking the dataset to meaningful signal on real-world educational outcomes.
- Fine-tuned multiple 7–8B open-weight LLMs (Mistral-7B-Instruct, Qwen2.5-7B-Instruct, Llama-3.1-8B-Instruct) using QLoRA with progressive multi-turn sampling.
- Built intrinsic and human evaluation pipelines (RoBERTa-based effectiveness model and blinded teacher study), benchmarking against a proprietary state-of-the-art model.
- Led the research program across platform development, data operations, modeling, evaluation, grants, paper writing, and public dataset release on Hugging Face.
IBM Research · Research Collaborator
Stanford, CA
2025–Present
- Joint industry-lab work on AI-based simulation of personas and their evaluation, grounding persona models in real datasets and designing human-in-the-loop mechanisms for co-design and controllability.
Department of Biology, Stanford University · AI Assessment Developer
Stanford, CA
Jul 2025–Present
- Developed a RAG system for prerequisite assessment generation across three courses in Stanford’s introductory Biology sequence.
- Integrated SME review into item generation and piloted assessments over three quarters (N > 500 students).
- Performed IRT-based refinement of prerequisite assessments, including validation, reliability, and multiple forms of validity analysis, to inform the department about factors influencing student performance.
Rising Academies · NLP Researcher
Remote
2025
- Developed a large-scale binary annotated dataset (12,000+ instances) of feedback from West Africa on actionability (actionable vs. vague).
- Developed a semi-supervised technique to build the dataset: rubric grounded in academic literature on feedback, manual coding of a subset, inter-annotator agreement analysis, and RoBERTa fine-tuning on the hand-labeled data to scale labeling.
- Performed interpretability analysis to quantify trends in linguistic features of actionable feedback from the classified dataset; paper published at the 20th ACL BEA workshop.
Stanford University · MS, Education Data Science
Stanford, CA
2024–Present
- Specialization in natural language processing and measurement.
- Awarded the Stanford HAI RA position: 50% tuition + stipend.
- Awarded travel scholarships to ACL 2025 (Vienna) and ASU+GSV 2026 (San Diego).
- Awarded the Stanford-Peking Fellowship on Generative AI + Healthcare for a two-week bootcamp in Beijing and Shenzhen.
- Graduate Specialist for Outcomes at Stanford's Office for Inclusion, Belonging & Intergroup Communication.
Earlier
UNESCO MGIEP · Research & Data Science Officer
Delhi, India
Jul 2020–Sep 2024
- Led evaluation of 5+ quasi-experimental educational interventions across multi-country sites: data collection protocols and analysis plans to inform policymakers in UNESCO member states.
- Designed and validated English and Hindi scales for social-emotional learning in students and teachers using psychometric analysis (factor analysis, IRT, measurement invariance, reliability and validity); all published in peer-reviewed journals.
- Managed cross-site data collection with implementation partners, ensuring data quality, ethical compliance, and standardized research procedures.
- Published six peer-reviewed publications; rated Exceeding Expectations in 2021 and 2023.
Alliance Française de Delhi · DELF B1
Delhi, India
2019
- Obtained the Diplôme d'Études en Langue Française (B1), the official certificate of intermediate proficiency in French (mostly self-taught).
United Nations Volunteers · Infographics Volunteer
Remote
2018–2019
- Transformed statistics-dense reports into infographics for UNICEF Nepal, UNV DRC, OECD, and FAO Rome.
Delhi Technological University (DTU) · B.Tech, Electrical & Electronics Engineering
Delhi, India
2016–2020
- Concentration in electrical engineering and data science.
- Secretary, Rotaract Club of DTU.
- Merit Scholar, ranked 1st of 85 (senior year).