Please join us online for Digital Coptic 3 , the virtual workshop for DH project on Coptic!
Would like to have more data to work with? Check our LREC paper , where we present a freely available, genre-balanced English web corpus totaling 4M tokens and featuring a large number of high-quality automatic annotation layers, including dependency trees, non-named entity annotations, coreference resolution, and discourse trees in Rhetorical Structure Theory.
Shabnam and Amir's paper on Reddit part of speech tagging was accepted to WAC-XII .
Thoughts on how to treebank social media? Read our LREC paper