Thermodynamics-Informed Accurate pKa Prediction and Protonation State Generation in PlayMolecule AI
Abstract
Accurate prediction of acid dissociation constants (pK a) and the determination of dominant protonation states is critical in drug discovery, influencing molecular properties such as solubility, permeability, and protein-ligand binding. We present AcepK a, an advanced application integrated into the PlayMolecule AI platform. AcepK a is built upon the theoretically rigorous Uni-pK a framework, which unifies statistical mechanics with representation learning. By modeling the complete protonation ensemble rather than treating pKa as a scalar regression target, AcepK a ensures thermodynamic consistency across coupled ionization sites. We describe the application's enhanced architecture, which features a retrained Uni-Mol backbone achieving state-of-the-art performance on standard benchmarks. Furthermore, we detail critical engineering advancements. These include AceConfgen, a proprietary GPU-accelerated conformer generator that achieves a ~40x speed-up compared to NVIDIA's nvmolkit, a streamlined inference engine to directly protonate molecules, and a 3D-aware modality for applying protonation states to bound ligand poses. Finally, we discuss the integration of AcepK a into the PlayMolecule AI ecosystem, a modern AI-assisted environment for molecular modelling and drug discovery.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.