Analytical thinker with strong attention to detail and excellent organizational skills. Excels in teamwork, takes full ownership of tasks, and demonstrates a high level of commitment to their success. Performs well under pressure, shows initiative when needed, and delivers high-quality results with dedication and persistence. Eager to grow professionally and contribute to impactful and challenging projects.
AI-powered optimization engine for next-generation computing
Mentored by: Next Silicon
Embedded Systems Bootcamp 2025 (Embedded)
Responsibilities:
Implemented SYCL kernels, connected them to the GGML operator system, and integrated them into the llama.cpp graph builder, including validation on real GPU hardware.
Performed kernel-level GPU optimizations, executed controlled Before/After benchmarks, and achieved significant speedups, including a ×4 performance improvement for ARANGE.
Handled maintainer reviews, refined code design, updated documentation, ensured CI stability, and successfully merged both operators into ggml / llama.cpp.
Developed a mechanism that reduces Attention complexity from O(n²) to O(n·k) by dynamically selecting the Top-K most relevant tokens per query.
Implemented buildsparsekmask, generating the mask using existing GGML operators only, without modifying model weights or model architecture.
Adjusted the design so Sparse-K is computed inside Flash Attention during graph build, following reviewer guidelines for clean, backend-consistent integration.
Ensured the Sparse-K mask is automatically applied in every Attention layer, with no additional per-layer code required.
Ran full Prompt Evaluation and Decode benchmarks, performed profiling and comparisons, and validated a 2.3× speedup with no accuracy degradation.
Since Sparse-K relies exclusively on existing GGML operators, any supported backend (such as CUDA) can execute it naturally. Performed HPC runs to confirm correctness and efficiency.
Downloaded models, performed GGUF conversion and quantization, and added Sparse-K metadata fields so all Sparse-K settings are loaded directly from the model, without environment variables.

Photo Printing Management System:
• Developed a photo-printing management system using .NET Core, MVC, Entity Framework, and SQL Server.
• Implemented UI, order processing, and image handling based on clean architecture principles.
• Integrated PDF invoice emailing using iText, GemBox, and Gmail SMTP, exceeding project requirements.
• Participated in daily stand-ups, sprint planning, and code reviews.
Online Store Project:
• Development of a responsive e-commerce web application using React, Redux, and React Router,
including dynamic navigation, shopping cart, and checkout workflows.
• Built a modern UI with MUI and Bootstrap, and integrated additional libraries such as jsPDF, along with
features that extended beyond the original project requirements.
• The project was developed without AI code-generation tools, with all implementation written manually.
• Deployed on Vercel as an independent initiative to enhance the project and improve the user experience.
Fluent