Training an Embedding Encoder for Medical Data
A practical guide to fine-tuning embedding models for domain-specific medical data retrieval.
A practical guide to fine-tuning embedding models for domain-specific medical data retrieval.
Why fine-tuning both encoder and decoder leads to truly grounded retrieval-augmented generation systems.
Learn how QLoRA combines 4-bit quantization with LoRA adapters to enable fine-tuning of 65B models on a single GPU.
A hands-on guide to LoRA — the low-rank fine-tuning method that revolutionizes efficient LLM adaptation.
👋 Hi, I’m Andy Fan — AI Engineer and Full-Stack Developer passionate about bridging enterprise-grade backend engineering with modern AI systems.