A synthetic dataset of dialogs we authored and annotated for references (pronouns, etc.). This dataset is discussed in the paper "MuDoCo: Corpus for Multidomain Coreference Resolution and Referring Expression Generation", which appeared at LREC 2020. -
View it on GitHub