Andy Fan Blog

Andy Fan

Andy Fan

AI Platform Engineer / RAG & Agentic AI Systems / Multi-Tenant SaaS / .NET Full Stack / Google Generative AI Leader / NVIDIA AI / CKA

Recent Posts

Training an Embedding Encoder for Medical Data

A practical guide to fine-tuning embedding models for domain-specific medical data retrieval.

RAG Through the Encoder–Decoder Lens

Why fine-tuning both encoder and decoder leads to truly grounded retrieval-augmented generation systems.

Understanding QLoRA (Quantized Low-Rank Adaptation)

Learn how QLoRA combines 4-bit quantization with LoRA adapters to enable fine-tuning of 65B models on a single GPU.

Understanding LoRA (Low-Rank Adaptation)

A hands-on guide to LoRA — the low-rank fine-tuning method that revolutionizes efficient LLM adaptation.

Hello World — Welcome to Andy Fan Blog

👋 Hi, I’m Andy Fan — AI Engineer and Full-Stack Developer passionate about bridging enterprise-grade backend engineering with modern AI systems.