Software Developer with strong analytical thinking and the ability to quickly understand existing code and software systems. Demonstrates high self-discipline, technical precision, and a structured, responsible approach to problem-solving and development.
Advanced multi-stage RAG system for source-grounded answers

Mentored by: Mobileye
Data Science Bootcamp 2025 (Data)
Responsibilities:
LLM Model Selection: Evaluated and compared multiple LLMs with a focus on local inference constraints and performance trade-offs. Selected Gemma3-4B as the core model for RAG interaction and response generation.
LLM Communication Microservice: Built a Python microservice for efficient LLM interaction and model loading using HuggingFace Transformers.
Resource Optimization and Efficient Inference: Evaluated multiple quantization approaches and chose int8 to balance performance, memory efficiency, and response quality.
Retrieval Ranking Optimization: Used a ranker model to focus retrieved documents, improving answer accuracy and reducing memory usage significantly. Enhanced alignment between the retrieval stage and LLM-based response generation.
Design and implementation of a confidence for LLM responses: Researched multiple methods and applied a self-confidence approach to produce a weighted confidence score alongside answers.
Model deployment and resource optimization Evaluated and configured models for efficient local execution, balancing CPU/GPU usage, memory, and inference performance.
Research on knowledge distillation in neural networks: Reviewed and analyzed the paper Distilling the Knowledge in a Neural Network, presented insights to the team, and explored implementation ideas within the system.
User-specific functionality and UI enhancements: Implemented per-user features, including file upload capabilities for the RAG system, and adapted the user interface accordingly.

02–03/2025: Commercial Business Management System
Developed a full-stack web application for managing a balloon sales business using React and Node.js.
Implemented product browsing, shopping cart functionality, secure JWT authentication, MongoDB integration, and an admin panel.
Project link:
https://github.com/ruth925yi/Balloon
Fluent