Journal of Data Science,
Statistics, and Visualisation

StatLLaMA: Multi-stage training for domain-optimized statistical large language models

Authors

DOI:

https://doi.org/10.52933/jdssv.v6i4.171

Keywords:

Foundation model, supervised fine-tuning, continual pretraining, instruction tuning, reinforcement learning from human feedback.

Abstract

This study investigates how to efficiently build a domain-specialized large language model (LLM) for statistics using the lightweight LLaMA-3.2-3B family as the foundation model (FM). We systematically compare three multi-stage training pipelines—starting from a base FM with no instruction-following capability, a base FM augmented with post-hoc instruction tuning, and an instruction-tuned FM with strong general reasoning abilities—across continual pretraining, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF) preference alignment. Results show that pipelines beginning with a base FM fail to develop meaningful statistical reasoning, even after extensive instruction tuning, SFT, or RLHF alignment. In contrast, starting from LLaMA-3.2-3B-Instruct enables effective domain specialization. A comprehensive evaluation of SFT variants reveals clear trade-offs between domain expertise and general reasoning ability. We further demonstrate that direct preference optimization provides stable and effective RLHF preference alignment. The final model, StatLLaMA, achieves strong and balanced performance on benchmarks of mathematical reasoning, common-sense reasoning, and statistical expertise, offering a practical blueprint for developing resource-efficient statistical LLMs. The code is available at https://github.com/HuangDLab/StatLLaMA.

Downloads

Published

2026-06-09

How to Cite

Zeng, J.-Y., & Huang, G.-H. (2026). StatLLaMA: Multi-stage training for domain-optimized statistical large language models. Journal of Data Science, Statistics, and Visualisation, 6(4). https://doi.org/10.52933/jdssv.v6i4.171
Journal of Data Science,
Statistics, and Visualisation
Pages