gui
gui

The Berkeley Natural Language Processing Group

gui
gui

Berkeley Coreference Resolution System

Overview

The Berkeley Coreference Resolution System is a state-of-the-art English coreference system described in the following papers:

Easy Victories and Uphill Battles in Coreference Resolution [PDF], [BibTeX]
Greg Durrett and Dan Klein.
EMNLP 2013.

Decentralized Entity-Level Modeling for Coreference Resolution [PDF], [BibTeX]
Greg Durrett, David Hall, and Dan Klein.
ACL 2013.

It takes as input text with annotations in the CoNLL format, then detects and resolves mentions in that text. The system is bundled with a preprocessor that can take raw text input, split it by sentences and tokens, and produce the necessary CoNLL annotation layers: POS tags, syntactic parses, and named entity chunks.

Downloads

The README contains more information. The system is licensed under the GPLv3.

Download the system here (16MB tgz). The source code is mostly Scala, but the download includes a pre-built runnable .jar file that can be run with a standard JRE.

Download the models here (~300MB tgz). This package includes pre-trained models for both preprocessing (sentence splitting, parsing, and NER) and coreference (SURFACE and FINAL models from the paper), with different coreference models for the CoNLL data and for running on raw text.

Old Versions

Version 0.9: code, models

Site designed by Jonathan K. Kummerfeld