ML Engineer & AI Researcher specializing in hardware acceleration, ML infrastructure, RAG systems, and computer vision with deep interests in Linear Algebra and Quantum Mechanics.
I'm a Machine Learning Engineer and AI Researcher currently pursuing my Master's in ML & Data Science at UC San Diego. I specialize in hardware acceleration, ML infrastructure, RAG systems, and computer vision with deep interests in Linear Algebra and Quantum Mechanics.
At the Causality Lab under Prof. Biwei Hwang, I'm developing GPU-accelerated ML frameworks that achieve 2.5x speedup through optimized CUDA kernels and multi-GPU parallelization. My work bridges the gap between theoretical ML and high-performance computing.
I've delivered production ML solutions at Parabole.ai, Dell Technologies, and Tata Communications, focusing on inference optimization, distributed systems, and vector database integration for enterprise RAG applications.
I just don't only code :D In my free time I like to play tennis, drumming, wildlife & nature photography!
When I'm not optimizing ML algorithms, you'll find me exploring the world, capturing moments, and staying active
Technologies and frameworks I use to build intelligent systems
Innovative ML solutions that push the boundaries of what's possible
UC San Diego - Causality Lab
Built high-performance ML framework with custom CUDA kernels and multi-GPU parallelization. Optimized inference pipelines achieving 2.5x speedup over CPU-based implementations.
UC San Diego - Personal Project
Built a dynamic resource manager using CUDA + NCCL for multi-GPU load balancing with real-time memory allocation monitoring and Vulkan API visualization.
UC San Diego - Kaggle Competition
Anomaly detection system using Isolation Forests & Autoencoders with RBM integration for complex transactional fraud patterns detection.
Springer Publication - Networks & Systems
Published research on CNN-based art classification across historical periods. Advanced feature extraction techniques for style recognition in Baroque, Renaissance, and Impressionism paintings.
Parabole.ai - Production System
Built and optimized RAG-based platform with vector databases for enterprise knowledge retrieval. Implemented custom embeddings and semantic search with 30% performance improvement.
Techstars Hackathon - Winning Project
Investment portfolio optimizer using Modern Portfolio Theory and Black-Litterman model with scenario analysis and stress testing capabilities.
My journey through top-tier companies and research institutions
UC San Diego - Causality Lab
Developing GPU-accelerated ML frameworks with custom CUDA kernels and multi-GPU parallelization. Built high-performance inference pipelines achieving 2.5x speedup through hardware optimization and distributed computing.
Parabole.ai
Built enterprise RAG platform with vector databases and semantic search. Optimized embedding generation and retrieval systems, achieving 30% performance improvement through custom indexing and query optimization.
Dell Technologies
Built automated test suites reducing manual efforts by 25% and developed deployment automation with Python & Terraform, cutting deployment time by 50%.
Tata Communications
Created analytics models for project evaluation in JIRA using ETL pipelines with PostgreSQL and Tableau. Built real-time data pipeline with Apache Kafka for telecom operations.
Contributing to the advancement of ML and computer vision through peer-reviewed research
We developed a CNN-based image classification model to predict the genre of 8,500 digital paintings, achieving 60% accuracy and surpassing previous benchmarks. The research improved feature extraction techniques for style recognition in Baroque, Renaissance, and Impressionism paintings through advanced deep learning architectures.
This work presents novel CUDA kernel optimizations for distributed ML inference, achieving 2.5x speedup through multi-GPU parallelization. We introduce custom memory management strategies and stream-based asynchronous processing for production-scale model serving.
I'm always excited to collaborate on innovative ML projects, research opportunities, or discuss how AI can solve complex problems. Let's connect!
Available for full-time opportunities starting June 2025