Resume | Yuheng (Aaron) Ding

General Information

Name	Yuheng (Aaron) Ding
Email	ading2@andrew.cmu.edu
Phone	706-238-1952
GitHub	https://github.com/AaXDing
LinkedIn	https://linkedin.com/in/yuheng-ding
Summary	Master in NLP and AI at Carnegie Mellon University School of Computer Science, with experience in software engineering and AI research.

Education

2025.8 - 2026.12
Master of Science in Intelligent Information Systems

Carnegie Mellon University
- Focus: Machine Learning, Natural Language Processing
2021.9 - 2025.6
Bachelor of Science in Computer Science and Applied Mathematics

University of California, Los Angeles
- GPA: 3.88/4.0
- Relevant Coursework: Algorithms and Complexity, Computer Networks, Machine Learning, Database Systems, Data Structures, Computer Organization, Deep Learning, Computer Vision, Natural Language Processing, Graph Theory, Linear Algebra

Work Experience

2023.7 - 2023.10
Full-Stack Software Engineer Intern
- Collaborated with a team of 4 engineers to architect a full-stack e-commerce platform, with a focus on promoting a healthy lifestyle and fostering environmental sustainability
- Engineered a product page UI template using Python Django framework for backend and MongoDB for data persistence; streamlined automatic page creation from database; enhanced scalability and amplified product creation efficiency by 400% over manual page inputting
- Improved user information gathering, payment workflow with Stripe API, and email order confirmations; achieved a 30% reduction in checkout process response time
- Collaborated with a colleague to enhance online consulting session reservation system, integrating voice and video chat functionalities by using WebSocket and WebRTC technologies, resulting in 60% decrease in reservation confirmation time

Research Experience

2024.6 - 2025.3
Undergraduate Researcher

UCLA NLP, UCLA PLUS Lab (Advisor: Prof. Kai-Wei Chang, Prof. Nanyun Peng)
- Developed a comprehensive benchmark for evaluating visual language models (VLMs) on perception and reasoning quality in QA tasks
- Developed an intuitive annotation tool that improved the annotator experience, enabling efficient curation of over 1,600 fine-grained annotated samples
- Tested on over 25 VLMs to assess self-critique and correction abilities
- Introduced a novel evaluation metric, namely VISCore, which incorporated answer accuracy, step-wise critique accuracy, and critique quality
- Improved critique accuracy by 8% through agent-based critique generation
2023.7 - 2023.9
Research Intern

UCLA ARNI Lab (Advisor: Prof. Christina Fragouli)
- Refined a graph theory solution to minimize tests and time used in identifying infection within a community network using Python
- Developed data pre-generation script using Python, resulting in a 70% reduction in test time for a population size of 20k
- Improved testing script with Jupyter and JSON that enabled test creation and logging with 20% overhead reduction
- Originated a community-aware diagonal splitting algorithm using a divide-and-conquer strategy which factored in community structure; the new algorithm needed 10% less tests compared to 5 other commonly-used algorithms

Publications

2025.02

VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning

Projects

2025.10 - 2025.12
Ego2Allo - Toward Perspective-Taking Visual Reasoning
- Fine-tuned VLMs for spatial perspective-taking tasks with SFT and RL-GRPO, achieving 10% higher accuracy.
- Incorporated ReAct-based function calling to abstract perspective change agentic pipeline emphasizing structured reasoning, improving accuracy by 3%.
2025.10 - 2025.11
Pittsburgh & CMU QA System
- Collaborate in a team of 3 in building an end-to-end RAG QA system with multiple chunking granularities.
- Created a Small-to-Big hybrid retriever that compute weighted fusion of BM25 score on sentence-level chunks and MiniLM-L6-v2 dense cosine similarity on paragraph-level chunks, achieving 75% accuracy under LLM evaluation.
2025.04 - 2025.06
High Performance URL Shortening Web Server
- Led a team of 4 in building a low-cost, high-performance async TCP server in C++ with Boost.Asio following test-driven development; authored a system design document and technical documentation to guide product architecture.
- Designed application architecture design for a scalable URL shortening service with Redis caching and PostgreSQL persistence, leveraging connection pooling to support 1,000+ concurrent users.
- Deployed service on Google Cloud Platform using Docker and Compute Engine; automated CI/CD with Cloud Build, to ensure zero-downtime deployments and automatic recovery through automated testing and health checks.
2023.10 - 2024.01
Job Search Platform
- Constructed a job search platform that personalized job recommendations merely based on user views and likes
- Incorporated Search, Favorite, and Recommendation functionalities in Java Servlets with REST APIs
- Implemented frontend UI components using JavaScript, HTML, CSS, and Ajax
- Designed AI-based keyword extraction algorithms and history-based recommendation engine
- Configured and deployed on AWS EC2 virtual machine, combining Redis and Amazon RDS; scaled to 10k QPS and reduced latency by 90%
2024.02 - 2024.03
DiffEdit: Text-Guided Image Editing with Diffusion
- Image editing via DiffEdit + BLIP with Stable Diffusion
- Reduced prompt restrictions and improved creation efficiency
2024.01 - 2024.03
LLM Evaluation
- Led a team of 4 in researching prompting techniques, enhancing the factual accuracy and fairness assessment of LLMs
- Developed a zero-shot evaluation framework using Microsoft Phi-2 and Mistral-7B, incorporating GPT-generated evidence and chain-of-thought reasoning
- Achieved a 76% accuracy rate in assessment tasks
2022.10 - 2022.12
Bruin-O-Bruin Web App
- Coordinated with a team of 5 to build a full-stack elimination web game using Node.js, React framework, and SQLite
- Led implementation of core features including game randomization, discussion forum, and scoreboard
- Launched account management features with password protection and reset functionality
- Deployed on Azure VM and achieved over 50% class participation rate

Skills

Programming Languages & Frameworks	Java, Spring Boot, Tomcat, Python, Django, PyTorch, NumPy, C++, JavaScript, React, SQL, Bash, HTML, CSS
Databases & Data	MySQL, PostgreSQL, SQLite, MongoDB, Redis, Spark
DevOps & Cloud	AWS, Azure, Docker, Kubernetes, Linux, Git

Languages

English	Native
Mandarin	Native
Spanish	Intermediate

General Information

Education

Master of Science in Intelligent Information Systems

Bachelor of Science in Computer Science and Applied Mathematics

Work Experience

Full-Stack Software Engineer Intern

Research Experience

Undergraduate Researcher

Research Intern

Publications

VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning

Projects

Ego2Allo - Toward Perspective-Taking Visual Reasoning

Pittsburgh & CMU QA System

High Performance URL Shortening Web Server

Job Search Platform

DiffEdit: Text-Guided Image Editing with Diffusion

LLM Evaluation

Bruin-O-Bruin Web App

Skills

Languages