Distributional Learning of Context-Free Languages under Fixed Finite-Monoid Typing

Abstract

We study distributional learning of context-free languages under a fixed recognizable congruence h given as the kernel of an explicit finite monoid homomorphism h:* M. For this fixed-h setting, we develop a finite typed reconstruction theory for context-free h-substitutable languages. Starting from a reduced context-free grammar, we introduce a typed refinement that records both yield types and outer context types, show that the relevant structure is concentrated in a finite typed reconstruction basis, and prove that this basis is exposed by a finite observation set. Occurrences of the same nonterminal symbol may therefore have to be separated when their outer h-contexts differ. We then prove exact reconstruction from positive data. From any finite sample K⊂eq*, we construct a canonical hypothesis grammar G(K), and we show that once K contains the finite observation set associated with the target typed grammar, G(K) generates the target language exactly. Consequently, for every explicit finite monoid homomorphism h, the class Chcf of context-free h-substitutable languages is identifiable in the limit from positive data, with polynomial-time hypothesis construction and update. For the linear subclass Chlin, we further prove polynomial upper bounds on characteristic-sample size and word length. Thus the same learner gives a full polynomial time-and-data result for the linear subclass.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…