kNN-NER: Named Entity Recognition with Nearest Neighbor Search
Abstract
Inspired by recent advances in retrieval augmented methods in NLP~khandelwal2019generalization,khandelwal2020nearest,meng2021gnn, in this paper, we introduce a k nearest neighbor NER (kNN-NER) framework, which augments the distribution of entity labels by assigning k nearest neighbors retrieved from the training set. This strategy makes the model more capable of handling long-tail cases, along with better few-shot learning abilities. kNN-NER requires no additional operation during the training phase, and by interpolating k nearest neighbors search into the vanilla NER model, kNN-NER consistently outperforms its vanilla counterparts: we achieve a new state-of-the-art F1-score of 72.03 (+1.25) on the Chinese Weibo dataset and improved results on a variety of widely used NER benchmarks. Additionally, we show that kNN-NER can achieve comparable results to the vanilla NER model with 40\% less amount of training data. Code available at https://github.com/ShannonAI/KNN-NER.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.