刁宇峰同学的论文被EMNLP2018录取为长文
新闻来源:IR实验室       发布时间:2018/8/17 15:33:24

  近日ENNLP2018程序委员会发布了录用情况,博士生刁宇峰的论文“WECA: A WordNet-Encoded Collocation-Attention Network for Homographic Pun Recognition”,其研究主题关于双关语的识别。主要内容如下:

Abstract:

Homographic puns have a long history in human writing, widely used in written and spoken literature, which usually occur in a certain syntactic or stylistic structure. How to recognize homographic puns is an important research. However, homographic pun recognition does not solve very well in existing work.

In this work, we first use WordNet to understand and expand word embedding for settling the polysemy of homographic puns, and then propose a WordNet-Encoded CollocationAttention network model (WECA) which combined with the context weights for recognizing the puns. Our experiments on the SemEval2017 Task7 and Pun of the Day demonstrate that the proposed model is able to distinguish between homographic pun and nonhomographic pun texts. We show the effectiveness of the model to present the capability of choosing qualitatively informative words.

The results show that our model achieves the state-of-the-art performance on homographic puns recognition.

中文摘要:

语义双关语在人类写作中有悠久的 ,包含一定的句法或体裁结构,广泛应用于写作等文学作品中。如何识别语义双关语是一项重要的研究。但是,现有的工作未能很好的解决这一课题。

在这项工作中,我们首先使用WordNet来理解和扩展词嵌入模型,用于解决语义双关语的歧义问题。然后,我们提出一种WordNet-Encoded CollocationAttention (WECA)网络,融合了上下文权重来识别双关语。我们分别在SemEval2017 Task7和Pun of the Day两个数据集上进行实验,实验结果表明我们的模型可以有效的区分语义双关语,该性能已达到国际先进水平。


EMNLP2018将于2018年10月31日到11月4日在比利时布鲁塞尔举行,期待与各国学术同行进行交流。