1

BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?

Analogies play a central role in human commonsense reasoning. The ability to recognize analogies such as {``}eye is to seeing what ear …

Asahi Ushio, Luis Espinosa-Anke, Steven Schockaert, Jose Camacho-Collados

COVID-19 and Misinformation: A Large-Scale Lexical Analysis on Twitter

Social media is often used by individuals and organisations as a platform to spread misinformation. With the recent coronavirus …

Dimosthenis Antypas, Jose Camacho-Collados, Alun Preece, David Rogers

Deriving Word Vectors from Contextualized Language Models using Topic-Aware Mention Selection

One of the long-standing challenges in lexical semantics consists in learning representations of words which reflect their semantic …

Yixiao Wang, Zied Bouraoui, Luis Espinosa-Anke, Steven Schockaert

Probing Pre-Trained Language Models for Disease Knowledge

Pre-trained language models such as ClinicalBERT have achieved impressive results on tasks such as medical Natural Language Inference. …

Israa Alghanmi, Luis Espinosa-Anke, Steven Schockaert

Evaluating Language Models for the Retrieval and Categorization of Lexical Collocations

Lexical collocations are idiosyncratic combinations of two syntactically bound lexical items (e.g., ‘‘heavy …

Luis Espinosa-Anke, Joan Codina-Filba, Leo Wanner

T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition

Language model (LM) pretraining has led to consistent improvements in many NLP downstream tasks, including named entity recognition …

Asahi Ushio, Jose Camacho-Collados

WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context

We present WiC-TSV, a new multi-domain evaluation benchmark for Word Sense Disambiguation. More specifically, we introduce a framework …

Anna Breit, Artem Revenko, Kiamehr Rezaee, Mohammad Taher Pilehvar, Jose Camacho-Collados

A Mixture-of-Experts Model for Learning Multi-Facet Entity Embeddings

Various methods have already been proposed for learning entity embeddings from text descriptions. Such embeddings are commonly used for …

Rana Alshaikh, Zied Bouraoui, Shelan Jeawak, Steven Schockaert

Cardiff University at SemEval-2020 Task 6: Fine-tuning BERT for Domain-Specific Definition Classification

We describe the system submitted to SemEval-2020 Task 6, Subtask 1. The aim of this subtask is to predict whether a given sentence …

Shelan Jeawak, Luis Espinosa-Anke, Steven Schockaert

CollFrEn: Rich Bilingual English--French Collocation Resource

Collocations in the sense of idiosyncratic lexical co-occurrences of two syntactically bound words traditionally pose a challenge to …

Beatriz Fisas, Luis Espinosa-Anke, Joan Codina-Filbá, Leo Wanner