Resume
General Information
| Name | Yuheng (Aaron) Ding |
| ading2@andrew.cmu.edu | |
| Phone | 706-238-1952 |
| GitHub | https://github.com/AaXDing |
| https://linkedin.com/in/yuheng-ding | |
| Summary | Master in NLP and AI at Carnegie Mellon University School of Computer Science, with experience in software engineering and AI research. |
Education
-
2025.8 - 2026.12 Master of Science in Intelligent Information Systems
Carnegie Mellon University - Focus: Machine Learning, Natural Language Processing
-
2021.9 - 2025.6 Bachelor of Science in Computer Science and Applied Mathematics
University of California, Los Angeles - GPA: 3.88/4.0
- Relevant Coursework: Algorithms and Complexity, Computer Networks, Machine Learning, Database Systems, Data Structures, Computer Organization, Deep Learning, Computer Vision, Natural Language Processing, Graph Theory, Linear Algebra
Work Experience
-
2023.7 - 2023.10 Full-Stack Software Engineer Intern
- Collaborated with a team of 4 engineers to architect a full-stack e-commerce platform, with a focus on promoting a healthy lifestyle and fostering environmental sustainability
- Engineered a product page UI template using Python Django framework for backend and MongoDB for data persistence; streamlined automatic page creation from database; enhanced scalability and amplified product creation efficiency by 400% over manual page inputting
- Improved user information gathering, payment workflow with Stripe API, and email order confirmations; achieved a 30% reduction in checkout process response time
- Collaborated with a colleague to enhance online consulting session reservation system, integrating voice and video chat functionalities by using WebSocket and WebRTC technologies, resulting in 60% decrease in reservation confirmation time
Research Experience
-
2024.6 - 2025.3 Undergraduate Researcher
UCLA NLP, UCLA PLUS Lab (Advisor: Prof. Kai-Wei Chang, Prof. Nanyun Peng) - Developed a comprehensive benchmark for evaluating visual language models (VLMs) on perception and reasoning quality in QA tasks
- Developed an intuitive annotation tool that improved the annotator experience, enabling efficient curation of over 1,600 fine-grained annotated samples
- Tested on over 25 VLMs to assess self-critique and correction abilities
- Introduced a novel evaluation metric, namely VISCore, which incorporated answer accuracy, step-wise critique accuracy, and critique quality
- Improved critique accuracy by 8% through agent-based critique generation
-
2023.7 - 2023.9 Research Intern
UCLA ARNI Lab (Advisor: Prof. Christina Fragouli) - Refined a graph theory solution to minimize tests and time used in identifying infection within a community network using Python
- Developed data pre-generation script using Python, resulting in a 70% reduction in test time for a population size of 20k
- Improved testing script with Jupyter and JSON that enabled test creation and logging with 20% overhead reduction
- Originated a community-aware diagonal splitting algorithm using a divide-and-conquer strategy which factored in community structure; the new algorithm needed 10% less tests compared to 5 other commonly-used algorithms
Publications
-
2025.02 VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
Projects
-
2025.10 - 2025.12 Ego2Allo - Toward Perspective-Taking Visual Reasoning
- Fine-tuned VLMs for spatial perspective-taking tasks with SFT and RL-GRPO, achieving 10% higher accuracy.
- Incorporated ReAct-based function calling to abstract perspective change agentic pipeline emphasizing structured reasoning, improving accuracy by 3%.
-
2025.10 - 2025.11 Pittsburgh & CMU QA System
- Collaborate in a team of 3 in building an end-to-end RAG QA system with multiple chunking granularities.
- Created a Small-to-Big hybrid retriever that compute weighted fusion of BM25 score on sentence-level chunks and MiniLM-L6-v2 dense cosine similarity on paragraph-level chunks, achieving 75% accuracy under LLM evaluation.
-
2025.04 - 2025.06 High Performance URL Shortening Web Server
- Led a team of 4 in building a low-cost, high-performance async TCP server in C++ with Boost.Asio following test-driven development; authored a system design document and technical documentation to guide product architecture.
- Designed application architecture design for a scalable URL shortening service with Redis caching and PostgreSQL persistence, leveraging connection pooling to support 1,000+ concurrent users.
- Deployed service on Google Cloud Platform using Docker and Compute Engine; automated CI/CD with Cloud Build, to ensure zero-downtime deployments and automatic recovery through automated testing and health checks.
-
2023.10 - 2024.01 Job Search Platform
- Constructed a job search platform that personalized job recommendations merely based on user views and likes
- Incorporated Search, Favorite, and Recommendation functionalities in Java Servlets with REST APIs
- Implemented frontend UI components using JavaScript, HTML, CSS, and Ajax
- Designed AI-based keyword extraction algorithms and history-based recommendation engine
- Configured and deployed on AWS EC2 virtual machine, combining Redis and Amazon RDS; scaled to 10k QPS and reduced latency by 90%
-
2024.02 - 2024.03 DiffEdit: Text-Guided Image Editing with Diffusion
- Image editing via DiffEdit + BLIP with Stable Diffusion
- Reduced prompt restrictions and improved creation efficiency
-
2024.01 - 2024.03 LLM Evaluation
- Led a team of 4 in researching prompting techniques, enhancing the factual accuracy and fairness assessment of LLMs
- Developed a zero-shot evaluation framework using Microsoft Phi-2 and Mistral-7B, incorporating GPT-generated evidence and chain-of-thought reasoning
- Achieved a 76% accuracy rate in assessment tasks
-
2022.10 - 2022.12 Bruin-O-Bruin Web App
- Coordinated with a team of 5 to build a full-stack elimination web game using Node.js, React framework, and SQLite
- Led implementation of core features including game randomization, discussion forum, and scoreboard
- Launched account management features with password protection and reset functionality
- Deployed on Azure VM and achieved over 50% class participation rate
Skills
| Programming Languages & Frameworks | Java, Spring Boot, Tomcat, Python, Django, PyTorch, NumPy, C++, JavaScript, React, SQL, Bash, HTML, CSS |
| Databases & Data | MySQL, PostgreSQL, SQLite, MongoDB, Redis, Spark |
| DevOps & Cloud | AWS, Azure, Docker, Kubernetes, Linux, Git |
Languages
| English | Native |
| Mandarin | Native |
| Spanish | Intermediate |