Thread by @srush_nlp, Thread: Last week, a list of 100 important NLP papers (https://github.com/mhagiwara/100-nlp-papers) went [...]

Thread: Last week, a list of 100 important NLP papers ( https://github.com/mhagiwara/100-nlp-papers)">https://github.com/mhagiwara... went viral. The list is okay, but it has almost *no* papers with female first authors.

NLP is rich with amazing female researchers and mentors. Here is one paper I like for each area on the list:

Area: Discourse

Modeling Local Coherence: An Entity-Based Approach

Regina Barzilay and Mirella Lapata

https://people.csail.mit.edu/regina/my_papers/coherence.pdf">https://people.csail.mit.edu/regina/my...

Area: Topic Models

Topic Modeling: Beyond Bag-of-Words

Hanna M. Wallach

http://dirichlet.net/pdf/wallach06topic.pdf">https://dirichlet.net/pdf/walla...

Area: Text Classification + Machine Learning

Thumbs up? Sentiment classification using machine learning techniques

Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan

https://www.aclweb.org/anthology/W02-1011/">https://www.aclweb.org/anthology...

Area: Automatic Text Summarization

(Many good options! But I personally like the one of the list)

Get To The Point: Summarization with Pointer-Generator Networks

Abigail See et al.

https://www.aclweb.org/anthology/P17-1099/">https://www.aclweb.org/anthology...

Area: Parsing and Syntax

CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank

Julia Hockenmaier, Mark Steedman

https://www.aclweb.org/anthology/J07-3004/">https://www.aclweb.org/anthology...

Area: Segmentation and Tagging

(A million good options, but this one was super impactful)

Feature-rich part-of-speech tagging with a cyclic dependency network

Kristina Toutanova et al

Area: Question Answering and Machine Comprehension

(I took this to mean the modern deep-learning centric form)

A thorough examination of the CNN/Daily Mail reading comprehension task

Danqi Chen et al

https://arxiv.org/abs/1606.02858 ">https://arxiv.org/abs/1606....

Area: Sequential Labeling & Information Extraction

Empirical Methods in Information Extraction

Claire Cardie

https://aaai.org/ojs/index.php/aimagazine/article/view/1322/1223">https://aaai.org/ojs/index...

Area: Machine Translation

(Many to pick from, I like how this one foreshadows some unsupervised MT)

A Statistical View on Bilingual Lexicon Extraction: From Parallel Corpora to Non-Parallel Corpora

Pascale Fung https://link.springer.com/chapter/10.1007/3-540-49478-2_1">https://link.springer.com/chapter/1...

A Statistical View on Bilingual Lexicon Extraction: From Parallel Corp

We present two problems for statistically extracting bilingual lexicon: (1) How can noisy parallel corpora be used? (2) How can non-parallel yet comparable corpora be used? We describe our own work...

https://link.springer.com/chapter/10.1007/3-540-49478-2_1

Area: Coreference Resolution

(This is a bit niche, but I really found this work helpful when I worked in this area.)

The life and death of discourse entities: Identifying singleton mentions

M Recasens, MC de Marneffe, C Potts https://www.aclweb.org/anthology/N13-1071/">https://www.aclweb.org/anthology...

The Life and Death of Discourse Entities: Identifying Singleton Mentions

Marta Recasens, Marie-Catherine de Marneffe, Christopher Potts. Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language...

https://www.aclweb.org/anthology/N13-1071/

Area: Clustering and Word/Sentence Embeddings

(The paper from the list seems like a good choice.)

Skip-Thought Vectors

Jamie Kiros et al

https://arxiv.org/abs/1506.06726 ">https://arxiv.org/abs/1506....

Area: Generation

(Generation has a ton of papers to pick from. Here& #39;s a classic)

Discourse strategies for generating natural-language text

Kathleen R.McKeown

https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.100.8521&rep=rep1&type=pdf">https://citeseerx.ist.psu.edu/viewdoc/d...

Area: Language Modeling

(I couldn& #39;t find a classic here

https://abs.twimg.com/emoji/v2/... draggable="false" alt="😞" title="Enttäuschtes Gesicht" aria-label="Emoji: Enttäuschtes Gesicht"> Here& #39;s a recent interesting one that I would love to see on transformers...)

Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

Urvashi Khandelwal, He He, Peng Qi, Dan Jurafsky

Finally, read Yejin Choi. I tried to add one of her papers, but her work really transcends these categories.

Latest Threads Unrolled: