Thread: Last week, a list of 100 important NLP papers ( https://github.com/mhagiwara/100-nlp-papers) went viral. The list is okay, but it has almost *no* papers with female first authors.
NLP is rich with amazing female researchers and mentors. Here is one paper I like for each area on the list:
NLP is rich with amazing female researchers and mentors. Here is one paper I like for each area on the list:
Area: Discourse
Modeling Local Coherence: An Entity-Based Approach
Regina Barzilay and Mirella Lapata
https://people.csail.mit.edu/regina/my_papers/coherence.pdf
Modeling Local Coherence: An Entity-Based Approach
Regina Barzilay and Mirella Lapata
https://people.csail.mit.edu/regina/my_papers/coherence.pdf
Area: Topic Models
Topic Modeling: Beyond Bag-of-Words
Hanna M. Wallach
http://dirichlet.net/pdf/wallach06topic.pdf
Topic Modeling: Beyond Bag-of-Words
Hanna M. Wallach
http://dirichlet.net/pdf/wallach06topic.pdf
Area: Text Classification + Machine Learning
Thumbs up? Sentiment classification using machine learning techniques
Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan
https://www.aclweb.org/anthology/W02-1011/
Thumbs up? Sentiment classification using machine learning techniques
Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan
https://www.aclweb.org/anthology/W02-1011/
Area: Automatic Text Summarization
(Many good options! But I personally like the one of the list)
Get To The Point: Summarization with Pointer-Generator Networks
Abigail See et al.
https://www.aclweb.org/anthology/P17-1099/
(Many good options! But I personally like the one of the list)
Get To The Point: Summarization with Pointer-Generator Networks
Abigail See et al.
https://www.aclweb.org/anthology/P17-1099/
Area: Parsing and Syntax
CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank
Julia Hockenmaier, Mark Steedman
https://www.aclweb.org/anthology/J07-3004/
CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank
Julia Hockenmaier, Mark Steedman
https://www.aclweb.org/anthology/J07-3004/
Area: Segmentation and Tagging
(A million good options, but this one was super impactful)
Feature-rich part-of-speech tagging with a cyclic dependency network
Kristina Toutanova et al
(A million good options, but this one was super impactful)
Feature-rich part-of-speech tagging with a cyclic dependency network
Kristina Toutanova et al
Area: Question Answering and Machine Comprehension
(I took this to mean the modern deep-learning centric form)
A thorough examination of the CNN/Daily Mail reading comprehension task
Danqi Chen et al
https://arxiv.org/abs/1606.02858
(I took this to mean the modern deep-learning centric form)
A thorough examination of the CNN/Daily Mail reading comprehension task
Danqi Chen et al
https://arxiv.org/abs/1606.02858
Area: Sequential Labeling & Information Extraction
Empirical Methods in Information Extraction
Claire Cardie
https://aaai.org/ojs/index.php/aimagazine/article/view/1322/1223
Empirical Methods in Information Extraction
Claire Cardie
https://aaai.org/ojs/index.php/aimagazine/article/view/1322/1223
Area: Machine Translation
(Many to pick from, I like how this one foreshadows some unsupervised MT)
A Statistical View on Bilingual Lexicon Extraction: From Parallel Corpora to Non-Parallel Corpora
Pascale Fung https://link.springer.com/chapter/10.1007/3-540-49478-2_1
(Many to pick from, I like how this one foreshadows some unsupervised MT)
A Statistical View on Bilingual Lexicon Extraction: From Parallel Corpora to Non-Parallel Corpora
Pascale Fung https://link.springer.com/chapter/10.1007/3-540-49478-2_1
Area: Coreference Resolution
(This is a bit niche, but I really found this work helpful when I worked in this area.)
The life and death of discourse entities: Identifying singleton mentions
M Recasens, MC de Marneffe, C Potts https://www.aclweb.org/anthology/N13-1071/
(This is a bit niche, but I really found this work helpful when I worked in this area.)
The life and death of discourse entities: Identifying singleton mentions
M Recasens, MC de Marneffe, C Potts https://www.aclweb.org/anthology/N13-1071/
Area: Clustering and Word/Sentence Embeddings
(The paper from the list seems like a good choice.)
Skip-Thought Vectors
Jamie Kiros et al
https://arxiv.org/abs/1506.06726
(The paper from the list seems like a good choice.)
Skip-Thought Vectors
Jamie Kiros et al
https://arxiv.org/abs/1506.06726
Area: Generation
(Generation has a ton of papers to pick from. Here's a classic)
Discourse strategies for generating natural-language text
Kathleen R.McKeown
https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.100.8521&rep=rep1&type=pdf
(Generation has a ton of papers to pick from. Here's a classic)
Discourse strategies for generating natural-language text
Kathleen R.McKeown
https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.100.8521&rep=rep1&type=pdf
Area: Language Modeling
(I couldn't find a classic here
Here's a recent interesting one that I would love to see on transformers...)
Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context
Urvashi Khandelwal, He He, Peng Qi, Dan Jurafsky
(I couldn't find a classic here

Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context
Urvashi Khandelwal, He He, Peng Qi, Dan Jurafsky
Finally, read Yejin Choi. I tried to add one of her papers, but her work really transcends these categories.