acl155

acl 2021 论文列表

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021.

Vocabulary Learning via Optimal Transport for Neural Machine Translation.
Including Signed Languages in Natural Language Processing.
UnNatural Language Inference.
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning.
Neural Machine Translation with Monolingual Translation Memory.
Scientific Credibility of Machine Translation Research: A Meta-Evaluation of 769 Papers.
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text.
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering.
Discriminative Reranking for Neural Machine Translation.
Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on Neural Machine Translation.
Can Sequence-to-Sequence Models Crack Substitution Ciphers?
Language Embeddings for Typology and Cross-lingual Transfer Learning.
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling.
SpanNER: Named Entity Re-/Recognition as Span Prediction.
Transition-based Bubble Parsing: Improvements on Coordination Structure Prediction.
Hate Speech Detection Based on Sentiment Knowledge Sharing.
Conditional Generation of Temporally-ordered Event Sequences.
ReadOnce Transformers: Reusable Representations of Text for Transformers.
ADEPT: An Adjective-Dependent Plausibility Task.
Improving Paraphrase Detection with the Adversarial Paraphrasing Task.
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic.
RAW-C: Relatedness of Ambiguous Words in Context (A New Lexical Resource for English).
TIMEDIAL: Temporal Commonsense Reasoning in Dialog.
Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability.
DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations.
Space Efficient Context Encoding for Non-Task-Oriented Dialogue Generation with Graph Attention Transformer.
Using Meta-Knowledge Mined from Identifiers to Improve Intent Recognition in Conversational Systems.
The R-U-A-Robot Dataset: Helping Avoid Chatbot Deception by Detecting User Questions About Human or Non-Human Identity.
Lexical Semantic Change Discovery.
Dynamic Contextualized Word Embeddings.
Verb Knowledge Injection for Multilingual Event Processing.
Learning Prototypical Functions for Physical Artifacts.
Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution.
Cross-Lingual Abstractive Summarization with Limited Parallel Resources.
EmailSum: Abstractive Email Thread Summarization.
Improving Factual Consistency of Abstractive Summarization via Question Answering.
ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive Summarization with Argument Mining.
Are Missing Links Predictable? An Inferential Benchmark for Knowledge Graph Completion.
The statistical advantage of automatic NLG metrics at the system level.
Privacy at Scale: Introducing the PrivaSeer Corpus of Web Privacy Policies.
Neural semi-Markov CRF for Monolingual Word Alignment.
Evaluation of Thematic Coherence in Microblogs.
Joint Verification and Reranking for Open Fact Checking Over Tables.
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning.
Mid-Air Hand Gestures for Post-Editing of Machine Translation.
Multimodal Multi-Speaker Merger & Acquisition Financial Modeling: A New Task, Dataset, and Neural Baselines.
Learning Latent Structures for Cross Action Phrase Relations in Wet Lab Protocols.
Metaphor Generation with Conceptual Mappings.
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models.
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts.
Language Model Augmented Relevance Score.
Question Answering Over Temporal Knowledge Graphs.
End-to-End Training of Neural Retrievers for Open-Domain Question Answering.
Learning Dense Representations of Phrases at Scale.
On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study.
Detecting Propaganda Techniques in Memes.
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates.
Accelerating Text Communication via Abbreviated Sentence Input.
Multi-hop Graph Convolutional Network with High-order Chebyshev Approximation for Text Reasoning.
Determinantal Beam Search.
A Novel Estimator of Mutual Information for Learning to Disentangle Textual Representations.
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization.
GhostBERT: Generate More Features with Cheap Operations for BERT.
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search.
CCMatrix: Mining Billions of High-Quality Parallel Sentences on the Web.
Beyond Offline Mapping: Learning Cross-lingual Word Embeddings through Context Anchoring.
Measuring and Increasing Context Usage in Context-Aware Machine Translation.
Selective Knowledge Distillation for Neural Machine Translation.
BERTGen: Multi-task Generation through BERT.
Controllable Open-ended Question Generation with A New Question Type Ontology.
DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text Generation.
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics.
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence.
Keep It Simple: Unsupervised Simplification of Multi-Paragraph Text.
A Neural Transition-based Model for Argumentation Mining.
Argument Pair Extraction via Attention-guided Multi-Layer Multi-Cross Encoding.
Multi-Label Few-Shot Learning for Aspect Category Detection.
Dual Graph Convolutional Networks for Aspect-based Sentiment Analysis.
StructuralLM: Structural Pre-training for Form Understanding.
Document-level Event Extraction via Parallel Prediction Networks.
CLEVE: Contrastive Pre-training for Event Extraction.
Unleash GPT-2 Power for Event Detection.
Fine-grained Information Extraction from Biomedical Literature based on Knowledge-enriched Abstract Meaning Representation.
Joint Biomedical Entity and Relation Extraction with Knowledge-Enhanced Collective Inference.
Learning from Miscellaneous Other-Class Words for Few-shot Named Entity Recognition.
PRGC: Potential Relation and Global Correspondence Based Joint Relational Triple Extraction.
An End-to-End Progressive Multi-Task Learning Framework for Medical Named Entity Recognition and Normalization.
SENT: Sentence-level Distant Relation Extraction via Negative Training.
CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction.
BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition.
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering.
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation.
PhotoChat: A Human-Human Dialogue Dataset With Photo Sharing Behavior For Joint Image-Text Modeling.
Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering.
xMoCo: Cross Momentum Contrastive Learning for Open-Domain Question Answering.
Robustifying Multi-hop QA through Pseudo-Evidentiality Training.
Generating Query Focused Summaries from Query-Free Resources.
Focus Attention: Promoting Faithfulness and Diversity in Summarization.
Capturing Relations between Scientific Papers: An Abstractive Model for Related Work Section Generation.
BASS: Boosting Abstractive Summarization with Unified Semantic Graph.
RepSum: Unsupervised Dialogue Summarization based on Replacement Strategy.
Long-Span Summarization via Local Attention and Content Selection.
TGEA: An Error-Annotated Dataset and Benchmark Tasks for TextGeneration from Pretrained Language Models.
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation.
POS-Constrained Parallel Decoding for Non-autoregressive Generation.
Improving Encoder by Auxiliary Supervision Tasks for Table-to-Text Generation.
Guiding the Growth: Difficulty-Controllable Question Generation through Step-by-Step Rewriting.
PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check.
Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism.
Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding.
Multi-perspective Coherent Reasoning for Helpfulness Prediction of Multimodal Reviews.
Controversy and Conformity: from Generalized to Personalized Aggressiveness Detection.
Cross-modal Memory Networks for Radiology Report Generation.
What is Your Article Based On? Inferring Fine-grained Provenance.
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining.
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks.
Math Word Problem Solving with Explicit Numerical Values.
Lexicon Enhanced Chinese Sequence Labeling Using BERT Adapter.
MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-Lingual NER.
An In-depth Study on Internal Structure of Chinese Words.
A Unified Generative Framework for Various NER Subtasks.
A Conditional Splitting Framework for Efficient Constituency Parsing.
Adapting Unsupervised Syntactic Parsing Methodology for Discourse Dependency Parsing.
Coreference Reasoning in Machine Reading Comprehension.
A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters.
Transfer Learning for Sequence Generation: from Single-source to Multi-source.
Importance-based Neuron Allocation for Multilingual Neural Machine Translation.
Modeling Bilingual Conversational Characteristics for Neural Chat Translation.
Rewriter-Evaluator Architecture for Neural Machine Translation.
CoSQA: 20, 000+ Web Queries for Code Search and Question Answering.
DynaEval: Unifying Turn and Dialogue Level Evaluation.
MMGCN: Multimodal Fusion via Deep Graph Convolution Network for Emotion Recognition in Conversation.
DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue.
Learning to Ask Conversational Questions by Optimizing Levenshtein Distance.
Generating Relevant and Coherent Dialogue Responses using Self-Separated Conditional Variational AutoEncoders.
A Human-machine Collaborative Framework for Evaluating Malevolence in Dialogues.
Maria: A Visual Experience Powered Conversational Agent.
Learning to Perturb Word Embeddings for Out-of-distribution QA.
Exploring Distantly-Labeled Rationales in Neural Network Models.
Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition.
Rethinking Stealthiness of Backdoor Attack against NLP Models.
De-Confounded Variational Encoder-Decoder for Logical Table-to-Text Generation.
Unified Interpretation of Softmax Cross-Entropy and Negative Sampling: With Case Study for Knowledge Graph Embedding.
BanditMTL: Bandit-based Multi-task Learning for Text Classification.
Shortformer: Better Language Modeling using Shorter Inputs.
Defense against Synonym Substitution-based Adversarial Attacks via Dirichlet Neighborhood Ensemble.
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims.
PP-Rec: News Recommendation with Personalized User Interest and Time-aware News Popularity.
HieRec: Hierarchical User Interest Modeling for Personalized News Recommendation.
Counterfactual Inference for Text Classification Debiasing.
Matching Distributions between Model and Data: Cross-domain Knowledge Distillation for Unsupervised Domain Adaptation.
Syntax-Enhanced Pre-trained Model.
On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation.
Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators.
Alignment Rationale for Natural Language Inference.
StereoSet: Measuring stereotypical bias in pretrained language models.
Learning to Explain: Generating Stable Explanations Fast.
Language Model Evaluation Beyond Perplexity.
Positional Artefacts Propagate Through Masked Language Model Embeddings.
CTFN: Hierarchical Learning for Multimodal Sentiment Analysis Using Coupled-Translation Fusion Network.
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units.
LexFit: Lexical Fine-Tuning of Pretrained Language Models.
Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation.
Obtaining Better Static Word Embeddings Using Contextual Embedding Models.
A Knowledge-Guided Framework for Frame Identification.
Word Sense Disambiguation: Towards Interactive Context Exploitation from Both Word and Sense Perspectives.
Lower Perplexity is Not Always Human-Like.
A Cognitive Regularizer for Language Modeling.
Learning Event Graph Knowledge for Abductive Reasoning.
Bootstrapped Unsupervised Sentence Representation Learning.
Data Augmentation with Adversarial Training for Cross-Lingual NLI.
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models.
Structural Pre-training for Dialogue Comprehension.
Pre-training Universal Language Representation.
From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding.
Reasoning over Entity-Action-Location Graph for Procedural Text Understanding.
COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion.
Exploring Dynamic Selection of Branch Expansion Orders for Code Generation.
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer.
Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval.
Semi-Supervised Text Classification with Balanced Deep Representation Distributions.
Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision.
VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words.
Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Classification.
Writing by Memorizing: Hierarchical Retrieval-based Medical Report Generation.
Early Detection of Sexual Predators in Chats.
Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction.
Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques.
Personalized Transformer for Explainable Recommendation.
Lexicon Learning for Few Shot Sequence Modeling.
WARP: Word-level Adversarial ReProgramming.
Risk Minimization for Zero-shot Sequence Labeling.
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling.
Parameter-Efficient Transfer Learning with Diff Pruning.
Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution.
Knowledge-Enriched Event Causality Identification via Latent Structure Induction Networks.
StereoRel: Relational Triple Extraction from a Stereoscopic Perspective.
Exploiting Document Structures and Cluster Consistencies for Event Coreference Resolution.
MLBiNet: A Cross-Sentence Collective Event Detection Network.
A Span-Based Model for Joint Overlapped and Discontinuous Named Entity Recognition.
De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention.
GWLAN: General Word-Level AutocompletioN for Computer-Aided Translation.
Mask-Align: Self-Supervised Neural Word Alignment.
On Compositional Generalization of Neural Machine Translation.
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction.
Employing Argumentation Knowledge Graphs for Neural Argument Generation.
Search from History and Reason for Future: Two-stage Reasoning on Temporal Knowledge Graphs.
Benchmarking Scalable Methods for Streaming Cross Document Entity Coreference.
CoRI: Collective Relation Integration with Data Augmentation for Open Information Extraction.
AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding.
Element Intervention for Open Relation Extraction.
Trigger is Not Sufficient: Exploiting Frame-aware Knowledge for Implicit Event Argument Extraction.
How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level Relation Extraction.
Recursive Tree-Structured Self-Attention for Answer Sentence Selection.
ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data.
TWAG: A Topic-Guided Wikipedia Abstract Generator.
Continuous Language Generative Flow.
One2Set: Generating Diverse Keyphrases as a Set.
Prefix-Tuning: Optimizing Continuous Prompts for Generation.
Weakly Supervised Named Entity Tagging with Learnable Logical Rules.
How to Adapt Your Pretrained Multilingual Model to 1600 Languages.
Syntax-augmented Multilingual BERT for Cross-lingual Transfer.
Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models.
SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation.
Claim Matching Beyond English to Scale Global Fact-Checking.
Evaluation Examples are not Equally Informative: How should that change NLP Leaderboards?
Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP.
Dependency-driven Relation Extraction with Attentive Graph Convolutional Networks.
A Pre-training Strategy for Zero-Resource Response Selection in Knowledge-Grounded Conversations.
Semantic Representation for Dialogue Modeling.
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems.
Intent Classification and Slot Filling for Privacy Policies.
Neural Stylistic Response Generation with Disentangled Latent Variables.
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalizability.
Hierarchy-aware Label Semantics Matching Network for Hierarchical Text Classification.
PairRE: Knowledge Graph Embeddings via Paired Relation Vectors.
Are Pretrained Convolutions Better than Pretrained Transformers?
BinaryBERT: Pushing the Limit of BERT Quantization.
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
Subsequence Based Deep Active Learning for Named Entity Recognition.
Reservoir Transformers.
Societal Biases in Language Generation: Progress and Challenges.
Probing Toxic Content in Large Pre-Trained Language Models.
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task.
Verb Metaphor Detection via Contextual Relation Learning.
Psycholinguistic Tripartite Graph Network for Personality Detection.
How is BERT surprised? Layerwise detection of linguistic anomalies.
End-to-End AMR Corefencence Resolution.
Anonymisation Models for Text Data: State of the art, Challenges and Future Directions.
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data.
Reliability Testing for Natural Language Processing Systems.
Supporting Land Reuse of Former Open Pit Mining Sites using Text Classification and Active Learning.
Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy?
A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering.
Check It Again: Progressive Visual Question Answering via Visual Entailment.
Generation-Augmented Retrieval for Open-Domain Question Answering.
Dual Reader-Parser on Hybrid Textual and Tabular Evidence for Open Domain Question Answering.
Supporting Cognitive and Emotional Empathic Writing of Students.
ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation.
Handling Extreme Class Imbalance in Technical Logbook Datasets.
End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages.
Towards User-Driven Neural Machine Translation.
A unified approach to sentence segmentation of punctuated text in many languages.
VECO: Variable and Flexible Cross-lingual Pre-training for Language Understanding and Generation.
Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation.
Exploring Discourse Structures for Argument Impact Classification.
Adversarial Learning for Discourse Rhetorical Structure Parsing.
Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering.
ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences.
W-RST: Towards a Weighted RST-style Discourse Framework.
A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections.
Cross-language Sentence Selection via Data Augmentation and Rationale Training.
TAN-NTM: Topic Attention Networks for Neural Topic Modeling.
Label-Specific Dual Graph Neural Network for Multi-Label Text Classification.
Towards Propagation Uncertainty: Edge-enhanced Bayesian Graph Convolutional Networks for Rumor Detection.
A Sweet Rabbit Hole by DARCY: Using Honeypots to Detect Universal Trigger's Adversarial Attacks.
Making Pre-trained Language Models Better Few-shot Learners.
H-Transformer-1D: Fast One-Dimensional Hierarchical Attention for Sequences.
TextSETTR: Few-Shot Text Style Extraction and Tunable Targeted Restyling.
Self-Attention Networks Can Process Bounded Hierarchical Languages.
CogAlign: Learning to Align Textual Neural Representations to Cognitive Language Processing Signals.
Surprisal Estimators for Human Reading Times Need Character Models.
Structural Guidance for Transformer Language Models.
CDRNN: Discovering Complex Dynamics in Human Language Processing.
NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-Based Simulation.
Best of Both Worlds: Making High Accuracy Non-incremental Transformer-based Disfluency Detection Incremental.
MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding.
Value-Agnostic Conversational Semantic Parsing.
HERALD: An Annotation Efficient Method to Detect User Disengagement in Social Conversations.
Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach.
Exploring the Representation of Word Meanings in Context: A Case Study on Homonymy and Synonymy.
BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?
Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words.
Knowing the No-match: Entity Alignment with Dangling Cases.
Revisiting the Negative Data of Distantly Supervised Relation Extraction.
LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification.
Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path.
Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker.
Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training.
Diversifying Dialog Generation via Adaptive Label Smoothing.
GTM: A Generative Triple-wise Model for Conversational Question Generation.
Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System.
Towards Emotional Support Dialog Systems.
Prevent the Language Model from being Overconfident in Neural Machine Translation.
G-Transformer for Document-Level Machine Translation.
Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation.
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment.
Consistency Regularization for Cross-Lingual Fine-Tuning.
Structured Sentiment Analysis as Dependency Graph Parsing.
Every Bite Is an Experience: Key Point Analysis of Business Reviews.
Position Bias Mitigation: A Knowledge-Aware Graph Model for Emotion Cause Extraction.
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning.
Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation.
Meta-Learning to Compositionally Generalize.
Probabilistic, Structure-Aware Algorithms for Improved Variety, Accuracy, and Coverage of AMR Alignments.
Evidence-based Factual Error Correction.
Modeling Transitions of Focal Entities for Conversational Knowledge Base Question Answering.
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance.
Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip Prediction.
Joint Models for Answer Verification in Question Answering Systems.
Can Generative Pre-trained Language Models Serve As Knowledge Bases for Closed-book QA?
Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech.
MultiMET: A Multimodal Dataset for Metaphor Understanding.
Few-NERD: A Few-shot Named Entity Recognition Dataset.
Annotating Online Misogyny.
Fast and Accurate Neural Machine Translation with Translation Memory.
From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text.
Evaluating morphological typology in zero-shot cross-lingual transfer.
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models.
Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human Effort.
Database reasoning over text.
UnitedQA: A Hybrid Approach for Open Domain Question Answering.
Few-Shot Question Answering by Pretraining Span Selection.
Explanations for CommonsenseQA: New Dataset and Models.
A Semantic-based Method for Unsupervised Commonsense Question Answering.
Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains.
Learning Syntactic Dense Embedding with Correlation Graph for Automatic Readability Assessment.
Competence-based Multimodal Curriculum Learning for Medical Report Generation.
PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction.
Unsupervised Extractive Summarization-Based Representations for Accurate and Explainable Collaborative Filtering.
LeeBERT: Learned Early Exit for BERT with cross-level optimization.
EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering.
Rational LAMOL: A Rationale-based Lifelong Learning Framework.
Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation.
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer.
Lightweight Cross-Lingual Sentence Representation Learning.
Unsupervised Neural Machine Translation for Low-Resource Domains via Meta-Learning.
Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?
Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation.
Breaking the Corpus Bottleneck for Context-Aware Neural Machine Translation with Cross-Task Pre-training.
Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation.
OntoED: Low-resource Event Detection with Ontology Embedding.
A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization.
A Large-Scale Chinese Multimodal NER Dataset with Speech Clues.
Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction.
Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition.
Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making.
Evaluating Evaluation Measures for Ordinal Classification and Ordinal Quantification.
Factoring Statutory Reasoning as Language Understanding Challenges.
Assessing the Representations of Idiomaticity in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels.
Towards Quantifiable Dialogue Coherence Evaluation.
Ruddit: Norms of Offensiveness for English Reddit Comments.
Neural Bi-Lexicalized PCFG Induction.
The Limitations of Limited Context for Constituency Parsing.
Multi-View Cross-Lingual Structured Prediction with Minimum Supervision.
Automated Concatenation of Embeddings for Structured Prediction.
N-ary Constituent Tree Parsing with Recursive Semi-Markov Model.
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders.
Missing Modality Imagination Network for Emotion Recognition with Uncertain Missing Modalities.
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning.
LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding.
Beyond Sentence-Level End-to-End Speech Translation: Context Helps.
Multi-stage Pre-training over Simplified Multimodal Pre-training Models.
LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations.
Self-Guided Contrastive Learning for BERT Sentence Representations.
KACE: Generating Knowledge Aware Contrastive Explanations for Natural Language Inference.
Towards Robustness of Text-to-SQL Models against Synonym Substitution.
OTTers: One-turn Topic Transitions for Open-Domain Dialogue.
Comprehensive Study: How the Context Information of Different Granularity Affects Dialogue State Tracking?
Robustness Testing of Language Understanding in Task-Oriented Dialog.
ProtAugment: Intent Detection Meta-Learning through Unsupervised Diverse Paraphrasing.
Enhancing the generalization for Intent Classification and Out-of-Domain Detection in SLU.
Discovering Dialogue Slots with Weak Supervision.
A Unified Generative Framework for Aspect-based Sentiment Analysis.
A Hierarchical VAE for Calibrating Attributes while Generating Text using Normalizing Flow.
DynaSent: A Dynamic Benchmark for Sentiment Analysis.
Style is NOT a single variable: Case Studies for Cross-Stylistic Language Understanding.
Distributed Representations of Emotion Categories in Emotion Space.
ExCAR: Event Graph Knowledge Enhanced Explainable Causal Reasoning.
Tree-Structured Topic Modeling with Nonparametric Neural Variational Inference.
CLINE: Contrastive Learning with Semantic Negative Examples for Natural Language Understanding.
Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-Dependent Text-to-SQL.
Better than Average: Paired Evaluation of NLP systems.
An Empirical Study on Hyperparameter Optimization for Fine-Tuning Pre-trained Language Models.
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus.
KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers.
SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis.
Integrating Semantics and Neighborhood Information with Graph-Driven Generative Models for Document Retrieval.
Data Augmentation for Text Generation Without Any Augmented Data.
On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation.
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets.
Changing the World by Changing the Data.
PlotCoder: Hierarchical Decoding for Synthesizing Visualization Code in Programmatic Context.
Mitigating Bias in Session-based Cyberbullying Detection: A Non-Compromising Approach.
IrEne: Interpretable Energy Prediction for Transformers.
Explaining Relationships Between Scientific Documents.
COVID-Fact: Fact Extraction and Verification of Real-World Claims on COVID-19 Pandemic.
BERTAC: Enhancing Transformer-based Language Models with Adversarially Pretrained Convolutional Neural Networks.
Optimizing Deeper Transformers on Small Datasets.
Weight Distillation: Transferring the Knowledge in Neural Network Parameters.
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information.
Modeling Fine-Grained Entity Types with Box Embeddings.
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World.
Edited Media Understanding Frames: Reasoning About the Intent and Implications of Visual Misinformation.
Control Image Captioning Spatially and Temporally.
Hierarchical Context-aware Network for Dense Video Event Captioning.
Glancing Transformer for Non-Autoregressive Neural Machine Translation.
UXLA: A Robust Unsupervised Data Augmentation Framework for Zero-Resource Cross-Lingual NLP.
Crafting Adversarial Examples for Neural Machine Translation.
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks.
RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models.
Intrinsic Bias Metrics Do Not Correlate with Application Bias.
A Survey of Race, Racism, and Anti-Racism in NLP.
Bad Seeds: Evaluating Lexical Methods for Bias Measurement.
Poisoning Knowledge Graph Embeddings via Relation Inference Patterns.
Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases.
Bird's Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach.
Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models.
Implicit Representations of Meaning in Neural Language Models.
Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning.
Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model.
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data.
A Systematic Investigation of KB-Text Embedding Alignment at Scale.
A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech.
Dialogue Response Selection with Hierarchical Curriculum Learning.
Discovering Dialog Structure Graph for Coherent Dialog Generation.
A Sequence-to-Sequence Approach to Dialogue State Tracking.
I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling.
InfoSurgeon: Cross-Media Fine-grained Information Consistency Checking for Fake News Detection.
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection.
A Survey of Code-switching: Linguistic and Social Perspectives for Language Technologies.
Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions.
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset.
Topic-Aware Evidence Reasoning and Stance-Aware Aggregation for Fact Verification.
Stance Detection in COVID-19 Tweets.
Syntopical Graphs for Computational Argumentation Tasks.
Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection.
Improving Formality Style Transfer with Context-Aware Rule Injection.
Directed Acyclic Graph Network for Conversational Emotion Recognition.
Factuality Assessment as Modal Dependency Parsing.
MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity Recognition.
Leveraging Type Descriptions for Zero-shot Named Entity Recognition and Classification.
A Gradually Soft Multi-Task and Data-Augmented Approach to Medical Question Understanding.
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval.
Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization.
BACO: A Background Knowledge- and Content-Based Framework for Citing Sentence Generation.
Towards Table-to-Text Generation with Numerical Reasoning.
Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models.
AggGen: Ordering and Aggregating while Generating.
Factorising Meaning and Form for Intent-Preserving Paraphrasing.
Select, Extract and Generate: Neural Keyphrase Generation with Layer-wise Coverage Attention.
Assessing Emoji Use in Modern Text Processing Tools.
CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes.
Automated Generation of Storytelling Vocabulary from Photographs for use in AAC.
Towards Argument Mining for Social Good: A Survey.
On Finding the K-best Non-projective Dependency Trees.
Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study.
Diverse Pretrained Context Encodings Improve Document Translation.
Attention Calibration for Transformer in Neural Machine Translation.
Common Sense Beyond English: Evaluating and Improving Multilingual Language Models for Commonsense Reasoning.
Improving Zero-Shot Translation by Disentangling Positional Information.
Measure and Evaluation of Semantic Divergence across Two Languages.
Align Voting Behavior with Public Statements for Legislator Representation Learning.
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?
A Dataset and Baselines for Multilingual Reply Suggestion.
Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of children's mindreading ability.
AugNLG: Few-shot Natural Language Generation using Self-trained Data Augmentation.
More Identifiable yet Equally Performant Transformers for Text Classification.
Uncovering Constraint-Based Behavior in Neural Models via Targeted Fine-Tuning.
Comparing Test Sets with Item Response Theory.
Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation.
When Do You Need Billions of Words of Pretraining Data?
Multi-Task Retrieval for Knowledge-Intensive Tasks.
Explainable Prediction of Text Complexity: The Missing Preliminaries for Text Simplification.
Selecting Informative Contexts Improves Language Model Fine-tuning.
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation.
Unsupervised Out-of-Domain Detection via Pre-trained Transformers.
The Art of Abstention: Selective Prediction and Error Regularization for Natural Language Processing.
A DQN-based Approach to Finding Precise Evidences for Fact Verification.
Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-Ranking Network.
Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets.
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation.
Prosodic segmentation for parsing spoken dialogue.
To POS Tag or Not to POS Tag: The Impact of POS Tags on Morphological Learning in Low-Resource Settings.
The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Processing.
A Targeted Assessment of Incremental Processing in Neural Language Models and Humans.
Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both?
Span-based Semantic Parsing for Compositional Generalization.
XLPT-AMR: Cross-Lingual Pre-Training via Multi-Task Learning for Zero-Shot AMR Parsing and Text Generation.
DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations.
Integrated Directional Gradients: Feature Interaction Attribution for Neural NLP Models.
What Context Features Can Transformer Language Models Use?
Learning Faithful Representations of Causal Graphs.
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models.
Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment.
Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data.
Do Context-Aware Translation Models Pay the Right Attention?
LNN-EL: A Neuro-Symbolic Approach to Short-text Entity Linking.
Discontinuous Named Entity Recognition as Maximal Clique Discovery.
Compare to The Knowledge: Graph Neural Fake News Detection with External Knowledge.
AdvPicker: Effectively Leveraging Unlabeled Data via Adversarial Discriminator for Cross-Lingual NER.
From Discourse to Narrative: Knowledge Projection for Event Relation Extraction.
CitationIE: Leveraging the Citation Graph for Scientific Information Extraction.
Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features.
Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training.
Improving Dialog Systems for Negotiation with Personality Modeling.
TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems.
SocAoG: Incremental Graph Parsing for Social Relation Inference in Dialogues.
Breaking Down the Invisible Wall of Informal Fallacies in Online Discussions.
Modeling Language Usage and Listener Engagement in Podcasts.
Structurizing Misinformation Stories via Rationalizing Fact-Checks.
Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model.
OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text Classification.
COSY: COunterfactual SYntax for Cross-Lingual Understanding.
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks.
Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor.
Cascaded Head-colliding Attention.
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation.
Learning Relation Alignment for Calibrated Cross-modal Retrieval.
E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning.
Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Problem.
Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification.
Explaining Contextualization in Language Models using Visual Analytics.
Examining the Inductive Bias of Neural Language Models with Artificial Languages.
Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger.
Introducing Orthogonal Constraint in Structural Probes.
DESCGEN: A Distantly Supervised Datasetfor Generating Entity Descriptions.
A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance and Self-referenced Redundancy.
Self-Supervised Multimodal Opinion Summarization.
Multi-TimeLine Summarization (MTLS): Improving Timeline Summarization by Generating Multiple Summaries.
Deep Differential Amplifier for Extractive Summarization.
PASS: Perturb-and-Select Summarizer for Product Reviews.
Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions.
Multimodal Sentiment Detection Based on Multi-channel Graph Neural Networks.
Bridge-Based Active Domain Adaptation for Aspect Term Extraction.
Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis.
Learning Language Specific Sub-network for Multilingual Machine Translation.
A Bidirectional Transformer Based Alignment Model for Unsupervised Word Alignment.
Multi-Head Highly Parallelized LSTM Decoder for Neural Machine Translation.
Understanding the Properties of Minimum Bayes Risk Decoding in Neural Machine Translation.
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation.
Refining Sample Embeddings with Relation Prototypes to Enhance Continual Relation Extraction.
UniRE: A Unified Label Space for Entity Relation Extraction.
Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder.
Modularized Interaction Network for Named Entity Recognition.
Accelerating BERT Inference for Sequence Labeling via Early-Exit.
GL-GIN: Fast and Accurate Non-Autoregressive Model for Joint Multiple Intent Detection and Slot Filling.
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data.
Transferable Dialogue Systems and User Simulators.
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking.
Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances.
Generalising Multilingual Concept-to-Text NLG with Language Agnostic Delexicalisation.
Mention Flags (MF): Constraining Transformer-based Text Generators.
Enhancing Content Preservation in Text Style Transfer Using Reverse Attention and Conditional Layer Normalization.
PENS: A Dataset and Generic Framework for Personalized News Headline Generation.
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling.
Unified Dual-view Cognitive Model for Interpretable Claim Verification.
HateCheck: Functional Tests for Hate Speech Detection Models.
Engage the Public: Poll Question Generation for Social Media Posts.
How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements.
Investigating label suggestions for opinion mining in German Covid-19 social media.
Frontmatter.