Data-Efficient Machine learning for Predicting Dopant Formation Energies in TiO2 Monolayer

Abstract

Machine learning models are increasingly applied in materials science, yet their predictive power is often constrained by data scarcity. Here, we show that accurate predictions can be achieved, even with a limited number of training examples, provided the dataset is compact and and grounded in physically relevant quantities. By combining density functional theory calculations with a machine-learning framework, we construct accurate descriptor-based models to predict the formation energies of doped lepidocrocite TiO2 monolayers. The predictive accuracy of machine-learning models was first evaluated for single-dopant Pt configurations, demonstrating that the selected structural and chemical descriptors reliably capture the key factors governing dopant stability. Chemical transferability is then examined by extending the dataset to include Ag-doped configurations. Predictive accuracy improved systematically as additional Ag-doped data points were included in the training, while the performance of Pt remains robust. These results highlight the potential of small and well-curated datasets combined with physically informed descriptors to enable not only accurate but also chemically transferable machine-learning-driven screening in doped TiO2 monolayer.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…