Los puntos clave no están disponibles para este artículo en este momento.
It is often claimed that Named Entity recognition systems need extensive gazetteers---lists of names of people, organisations, locations, and other named entities. Indeed, the compilation of such gazetteers is sometimes mentioned as a bottleneck in the design of Named Entity recognition systems.We report on a Named Entity recognition system which combines rule-based grammars with statistical (maximum entropy) models. We report on the system's performance with gazetteers of different types and different sizes, using test material from the MUC-7 competition. We show that, for the text type and task of this competition, it is sufficient to use relatively small gazetteers of well-known names, rather than large gazetteers of low-frequency names. We conclude with observations about the domain independence of the competition and of our experiments.
Building similarity graph...
Analyzing shared references across papers
Loading...
Andrei Mikheev
Marc Moens
Claire Grover
University of Edinburgh
Building similarity graph...
Analyzing shared references across papers
Loading...
Mikheev et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69dff774b28b234044e9c14a — DOI: https://doi.org/10.3115/977035.977037