experience | Mayank Sharma

Now

Stanford Institute for Human-Centered Artificial Intelligence (HAI) · Graduate Researcher

Stanford, CA

2024–Present

Developed ConvoLearn, a 40K-turn post-training dataset for strengthening dialogic capabilities of LLMs in tutoring contexts.
Directed large-scale human data operations (750+ teachers across provisioning, annotation, and evaluation): recruitment, task design, qualification gating, and pilot protocols for 20-turn dialogues aligned to a pedagogical framework.
Designed a multi-stage QA pipeline with hybrid human-LLM annotation, inter-rater reliability analysis, safety filtering, and expert adjudication, producing a high-quality, SFT-ready dataset.
Conducted an ecological validity analysis linking the dataset to meaningful signal on real-world educational outcomes.
Fine-tuned multiple 7–8B open-weight LLMs (Mistral-7B-Instruct, Qwen2.5-7B-Instruct, Llama-3.1-8B-Instruct) using QLoRA with progressive multi-turn sampling.
Built intrinsic and human evaluation pipelines (RoBERTa-based effectiveness model and blinded teacher study), benchmarking against a proprietary state-of-the-art model.
Led the research program across platform development, data operations, modeling, evaluation, grants, paper writing, and public dataset release on Hugging Face.

IBM Research · Research Collaborator

Stanford, CA

2025–Present

Joint industry-lab work on AI-based simulation of personas and their evaluation, grounding persona models in real datasets and designing human-in-the-loop mechanisms for co-design and controllability.

Department of Biology, Stanford University · AI Assessment Developer

Stanford, CA

Jul 2025–Present

Developed a RAG system for prerequisite assessment generation across three courses in Stanford’s introductory Biology sequence.
Integrated SME review into item generation and piloted assessments over three quarters (N > 500 students).
Performed IRT-based refinement of prerequisite assessments, including validation, reliability, and multiple forms of validity analysis, to inform the department about factors influencing student performance.

Rising Academies · NLP Researcher

Remote

2025

Developed a large-scale binary annotated dataset (12,000+ instances) of feedback from West Africa on actionability (actionable vs. vague).
Developed a semi-supervised technique to build the dataset: rubric grounded in academic literature on feedback, manual coding of a subset, inter-annotator agreement analysis, and RoBERTa fine-tuning on the hand-labeled data to scale labeling.
Performed interpretability analysis to quantify trends in linguistic features of actionable feedback from the classified dataset; paper published at the 20th ACL BEA workshop.

Stanford University · MS, Education Data Science

Stanford, CA

2024–Present

Specialization in natural language processing and measurement.
Awarded the Stanford HAI RA position: 50% tuition + stipend.
Awarded travel scholarships to ACL 2025 (Vienna) and ASU+GSV 2026 (San Diego).
Awarded the Stanford-Peking Fellowship on Generative AI + Healthcare for a two-week bootcamp in Beijing and Shenzhen.
Graduate Specialist for Outcomes at Stanford's Office for Inclusion, Belonging & Intergroup Communication.

UNESCO MGIEP · Research & Data Science Officer

Delhi, India

Jul 2020–Sep 2024

Led evaluation of 5+ quasi-experimental educational interventions across multi-country sites: data collection protocols and analysis plans to inform policymakers in UNESCO member states.
Designed and validated English and Hindi scales for social-emotional learning in students and teachers using psychometric analysis (factor analysis, IRT, measurement invariance, reliability and validity); all published in peer-reviewed journals.
Managed cross-site data collection with implementation partners, ensuring data quality, ethical compliance, and standardized research procedures.
Published six peer-reviewed publications; rated Exceeding Expectations in 2021 and 2023.

Alliance Française de Delhi · DELF B1

Delhi, India

2019

Obtained the Diplôme d'Études en Langue Française (B1), the official certificate of intermediate proficiency in French (mostly self-taught).

United Nations Volunteers · Infographics Volunteer

Remote

2018–2019

Transformed statistics-dense reports into infographics for UNICEF Nepal, UNV DRC, OECD, and FAO Rome.

Delhi Technological University (DTU) · B.Tech, Electrical & Electronics Engineering

Delhi, India

2016–2020