Hi, I'm Chinmay
AI/ML Software Engineer
I build scalable machine learning backends, generate text-to-SQL logic with LLMs, and deploy robust data pipelines. Currently pursuing my MSCS at USC.
import ai_models
from backend import scalable_api
model = ai_models.LLM()
api.deploy(model)
_
Work Experience
My professional journey in AI and Software Engineering.
USC
Private Research Uni
Jan 2026 - May 2026
Los Angeles, CA
Course Grader - ML
-
Evaluated coursework for a graduate-level Machine Learning course, critiquing theoretical models and implementations for a cohort of 212 students.
Python PyTorch
TWG Global AI
Holding Company
Jun 2025 - Aug 2025
Santa Monica, CA
AI ML Scientist Intern
-
Developed a React-based monitoring dashboard to visualize insurance cohort metrics and product trends, providing Operations Leadership with data-driven baselines for strategic planning.
React TypeScript
-
Built a bond return forecasting model in Azure ML Studio using LightGBM trained on time-varying latent representations, validated via 18-year rolling-window backtesting to ensure financial robustness.
Azure ML Studio LightGBM
AcadAlly
AI EdTech Platform
Feb 2024 - Jun 2024
Delhi, India
Software Engineer - Machine Learning
-
Automated exam generation via a Human-in-the-Loop GenAI pipeline using few-shot prompting, reducing manual content creation time by 40% while maintaining high academic quality.
Azure OpenAI Few-shot Prompting LangChain
-
Engineered scalable backend services using FastAPI and MongoDB to manage syllabi and real-time evaluations, utilizing Pytest to ensure system reliability for 15,000 active students.
FastAPI MongoDB Git GitHub Azure VM Pytest
-
Modeled a K-12 knowledge graph in Azure Neo4j mapping hierarchical topic-to-question dependencies, powering an adaptive diagnostic engine for 18 courses.
Neo4j Cypher
PeopleStrong
Enterprise HR Tech
Jan 2022 - Feb 2024
Gurgaon, India
Software Development Engineer
-
Built a Text-to-SQL analytics system for executives by curating a 14,000-sample training dataset and integrating a fine-tuned LLM to automate SQL procedure generation.
Llama 13B QLoRA PyTorch Gradio TGI
-
Engineered a distributed data ingestion pipeline using AWS Lambda, ECR, and DynamoDB to scrape 129,000 records, automating dataset curation for ML classifiers.
AWS (Lambda, DynamoDB, ECR) Python Docker
-
Constructed an incremental ETL pipeline to sync 1.3M applicant profiles from MS SQL Server to Elasticsearch using CDC, reducing search latency from seconds to milliseconds.
MS SQL Server Elasticsearch ETL Pipeline Python
-
Designed a knowledge graph comprising 26K+ job roles and 100K+ skill edges in Neo4j with Elasticsearch semantic resolution, powering production job recommendation tools.
Neo4j Elasticsearch
-
Developed an employee attrition prediction model using XGBoost, achieving a 0.82 weighted F1-score and deploying the system to production for HR risk assessment.
XGBoost Python Scikit-Learn FastAPI
Education
Academic background and institutional achievements.
University of Southern California
MS in Computer Science
Aug 2024 - May 2026 • GPA: 4.0/4.0
Key Coursework: Machine Learning, Artificial Intelligence, NLP, Info Retrieval, Distributed Databases, Operating Systems.
Manipal University Jaipur
B.Tech in Computer Science & Engineering
Jul 2018 - Jul 2022 • GPA: 9.49/10.0
Key Coursework: Advanced Data Structures, Relational Databases, Operating Systems, Big Data, Computer Networks.
Featured Projects
Multi-Tenant RAG
rocket_launchA robust retrieval-augmented generation backend utilizing FastAPI, LlamaIndex, and AWS S3 with pgvector for scalable vector search across isolated tenant namespaces.
Weenix OS Kernel
memoryImplementation of a UNIX-like operating system kernel. Developed core subsystems including virtual memory management, multi-process scheduling, and a virtual file system (VFS).