Large language model for unified and accurate description of multidimensional nuclear properties

Abstract

A prior-informed large language model (LLM) driven multi-task learning framework is proposed for the unified description of multiple nuclear observables. By fine-tuning the pre-trained DeepSeek-R1-1.5B model with Low-Rank Adaptation (LoRA), lightweight adapters are introduced while preserving general pre-trained parameters. Under a causal language modeling paradigm, the model is trained autoregressively on deviations between experimental and theoretical values. Significant accuracy improvements are achieved across seven observables, including charge radii, masses, binding energies, separation energies, and decay energies, with the training loss decreasing by over 98% across all tasks. This demonstrates that the LLM-based framework, through structured prior embedding, offers an efficient and shared approach for multi-task regression in fundamental nuclear properties.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…