ARCHIVES

Review Article

A Comprehensive Survey of LLM Fine-Tuning: From Foundations to Frontier Techniques

Milind k.Patil1
1 Syncaissa Systems Inc. USA.

Published Online: March-April 2026

Pages: 38-49

Abstract

This paper presents a comprehensive technical survey of large language model (LLM) fine-tuning, spanning the complete methodological landscape from foundational techniques to frontier advances as of early 2026. We organize the field along two orthogonal axes: the training objective—what the model learns (SFT, DPO, RLHF, GRPO, ORPO, SimPO, and KTO)—and the parameter-update strategy—how weights are modified (Full Fine-Tuning, LoRA, QLoRA, DoRA, GaLore, Spectrum). We trace the theoretical evolution from classical RLHF with its three-model PPO pipeline, through the DPO reparameterization that collapsed preference learning into a single supervised objective, to the reasoning-focused GRPO/RLVR paradigm that enabled DeepSeek-R1 to achieve 71.0% Pass@1 on AIME 2024 through emergent reasoning without supervised reasoning traces. The paper further provides rigorous treatment of model merging techniques (TIES, DARE, and SLERP) that compose capabilities in weight space without gradient computation, knowledge distillation methods including cross-tokenizer and comparative approaches, Mixture-of-Experts fine-tuning with sparse routing, multimodal adaptation of vision-language models, and the modern framework ecosystem. We present formal loss functions, convergence properties, and computational complexity analysis for each method, accompanied by empirical benchmark comparisons. The survey concludes with a unified taxonomy and actionable guidance for selecting technique combinations based on compute budget, data availability, and task requirements.

Related Articles

2026

Fake Currency Detection Using Deep Learning

2026

Smart E-Commerce System with Dynamic Pricing

2026

Personal Expense Tracker with Currency Converter

2026

Paw Safe: An Extensive Technology-Driven Framework for Stray Dog Rescue, Healthcare Management, Community Engagement, and Smart Urban Governance

2026

Design and Development of a Full-Stack E-Commerce Website

2026

Power quality improvement techniques from a topological perspective: An overview

Share Article

X
LinkedIn
Facebook
WhatsApp

Or copy link

https://test.ijsreat.com/archives/10.59256/ijsreat.20260602006

*Instagram doesn't support direct link sharing from web. Copy the link and share it in your Instagram story or post.