Reinforcement Learning Enhanced Greedy Decoding for Quantum Stabilizer Codes over Fq
Abstract
We construct new classical Goppa codes and corresponding quantum stabilizer codes from plane curves defined by separated polynomials. In particular, over F3 with the Hermitian curve y3 + y = x4, we obtain a ternary code of length 27, dimension 13, distance 4, which yields a [[27, 13, 4]]3 quantum code. To decode, we introduce an RL-on-Greedy algorithm: first apply a standard greedy syndrome decoder, then use a trained Deep Q-Network to correct any residual syndrome. Simulation under a depolarizing noise model shows that RL-on-Greedy dramatically reduces logical failure compared to greedy alone. Our work thus broadens the class of Goppa- and quantum-stabilizer codes from separated-polynomial curves and delivers a learned decoder with near-optimal performance.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.