Large Language Models (LLMs) have emerged as a dominant paradigm in natural language processing, demonstrating strong performance across a wide range of generation and reasoning tasks. These systems depend on multi-stage training pipelines that integrate large-scale self-supervised pre-training, supervised fine-tuning, and alignment techniques. This paper presents a systematic mapping study of contemporary LLM training methodologies, emphasizing transformer-based architectures, optimization objectives, and data curation strategies as well as emerging sparse architectures such as Mixture-of-Experts (MoE) models. We analyze parameter-efficient fine-tuning approaches, retrieval-augmented generation frameworks, and multimodal training techniques, which we organize into a unified comparative taxonomy. We discuss key technical challenges such as scalability constraints, hallucination, bias amplification, and alignment–capability tradeoffs, then identify emerging research directions such as reasoning-centric training. This work provides a concise technical reference for researchers and practitioners working on scalable and reliable language model training.
Building similarity graph...
Analyzing shared references across papers
Loading...
Dimitris Karydas
Dimosthenis Margaritis
Helen C. Leligou
Technologies
University of West Attica
Building similarity graph...
Analyzing shared references across papers
Loading...
Karydas et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69994c27873532290d020705 — DOI: https://doi.org/10.3390/technologies14020133
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: