ARCHIVES

Review Article

A Comprehensive Survey of LLM Fine-Tuning: From Foundations to Frontier Techniques

Milind k.Patil1

¹ Syncaissa Systems Inc. USA.

Published Online: March-April 2026

Pages: 38-49

Abstract

This paper presents a comprehensive technical survey of large language model (LLM) fine-tuning, spanning the complete methodological landscape from foundational techniques to frontier advances as of early 2026. We organize the field along two orthogonal axes: the training objective—what the model learns (SFT, DPO, RLHF, GRPO, ORPO, SimPO, and KTO)—and the parameter-update strategy—how weights are modified (Full Fine-Tuning, LoRA, QLoRA, DoRA, GaLore, Spectrum). We trace the theoretical evolution from classical RLHF with its three-model PPO pipeline, through the DPO reparameterization that collapsed preference learning into a single supervised objective, to the reasoning-focused GRPO/RLVR paradigm that enabled DeepSeek-R1 to achieve 71.0% Pass@1 on AIME 2024 through emergent reasoning without supervised reasoning traces. The paper further provides rigorous treatment of model merging techniques (TIES, DARE, and SLERP) that compose capabilities in weight space without gradient computation, knowledge distillation methods including cross-tokenizer and comparative approaches, Mixture-of-Experts fine-tuning with sparse routing, multimodal adaptation of vision-language models, and the modern framework ecosystem. We present formal loss functions, convergence properties, and computational complexity analysis for each method, accompanied by empirical benchmark comparisons. The survey concludes with a unified taxonomy and actionable guidance for selecting technique combinations based on compute budget, data availability, and task requirements.

Related Articles

2026

Fake Currency Detection Using Deep Learning

2026

Smart E-Commerce System with Dynamic Pricing

2026

Personal Expense Tracker with Currency Converter

2026

Paw Safe: An Extensive Technology-Driven Framework for Stray Dog Rescue, Healthcare Management, Community Engagement, and Smart Urban Governance

2026

Design and Development of a Full-Stack E-Commerce Website

2026

Power quality improvement techniques from a topological perspective: An overview

2026

The Rust Tax: Measuring the Cost of Memory Safety and Safely Recovering What You Can

2026

Determination of Spectral Source Parameters from Broadband Earthquake Records in Western Anatolia (Türkiye)

2026

Spatial Damage Pattern and Structural Vulnerability Assessment of Moderate Magnitude Earthquakes: The 2017–2019 Ayvacık Case Study, Western Anatolia

2026

Integrated Rainwater Harvesting In a 10.1 Km Urban Elevated Corridor: Hydrological Performance, Urban Climate Resilience and Infrastructure Sustainability Implications