Combining pattern-based CRFs and weighted context-free grammars
Abstract
We consider two models for the sequence labeling (tagging) problem. The first one is a Pattern-Based Conditional Random Field (), in which the energy of a string (chain labeling) x=x1… xn∈ Dn is a sum of terms over intervals [i,j] where each term is non-zero only if the substring xi… xj equals a prespecified word w∈ . The second model is a Weighted Context-Free Grammar () frequently used for natural language processing. and encode local and non-local interactions respectively, and thus can be viewed as complementary. We propose a Grammatical Pattern-Based CRF model () that combines the two in a natural way. We argue that it has certain advantages over existing approaches such as the Hybrid model of Bened\'i and Sanchez that combines N-grams and . The focus of this paper is to analyze the complexity of inference tasks in a such as computing MAP. We present a polynomial-time algorithm for general and a faster version for a special case that we call Interaction Grammars.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.