News
People
Projects
Publications
Seminar
MSc in NLP
Workshop
Hackathon
1
BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?
Analogies play a central role in human commonsense reasoning. The ability to recognize analogies such as {``}eye is to seeing what ear …
Asahi Ushio
,
Luis Espinosa-Anke
,
Steven Schockaert
,
Jose Camacho-Collados
PDF
Cite
Code
DOI
COVID-19 and Misinformation: A Large-Scale Lexical Analysis on Twitter
Social media is often used by individuals and organisations as a platform to spread misinformation. With the recent coronavirus …
Dimosthenis Antypas
,
Jose Camacho-Collados
,
Alun Preece
,
David Rogers
PDF
Cite
DOI
Deriving Word Vectors from Contextualized Language Models using Topic-Aware Mention Selection
One of the long-standing challenges in lexical semantics consists in learning representations of words which reflect their semantic …
Yixiao Wang
,
Zied Bouraoui
,
Luis Espinosa-Anke
,
Steven Schockaert
PDF
Cite
Code
DOI
Probing Pre-Trained Language Models for Disease Knowledge
Pre-trained language models such as ClinicalBERT have achieved impressive results on tasks such as medical Natural Language Inference. …
Israa Alghanmi
,
Luis Espinosa-Anke
,
Steven Schockaert
PDF
Cite
Code
DOI
Evaluating Language Models for the Retrieval and Categorization of Lexical Collocations
Lexical collocations are idiosyncratic combinations of two syntactically bound lexical items (e.g., ‘‘heavy …
Luis Espinosa-Anke
,
Joan Codina-Filba
,
Leo Wanner
PDF
Cite
Dataset
DOI
T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition
Language model (LM) pretraining has led to consistent improvements in many NLP downstream tasks, including named entity recognition …
Asahi Ushio
,
Jose Camacho-Collados
PDF
Cite
Code
DOI
WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context
We present WiC-TSV, a new multi-domain evaluation benchmark for Word Sense Disambiguation. More specifically, we introduce a framework …
Anna Breit
,
Artem Revenko
,
Kiamehr Rezaee
,
Mohammad Taher Pilehvar
,
Jose Camacho-Collados
PDF
Cite
Dataset
DOI
A Mixture-of-Experts Model for Learning Multi-Facet Entity Embeddings
Various methods have already been proposed for learning entity embeddings from text descriptions. Such embeddings are commonly used for …
Rana Alshaikh
,
Zied Bouraoui
,
Shelan Jeawak
,
Steven Schockaert
PDF
Cite
Code
DOI
Cardiff University at SemEval-2020 Task 6: Fine-tuning BERT for Domain-Specific Definition Classification
We describe the system submitted to SemEval-2020 Task 6, Subtask 1. The aim of this subtask is to predict whether a given sentence …
Shelan Jeawak
,
Luis Espinosa-Anke
,
Steven Schockaert
PDF
Cite
DOI
CollFrEn: Rich Bilingual English--French Collocation Resource
Collocations in the sense of idiosyncratic lexical co-occurrences of two syntactically bound words traditionally pose a challenge to …
Beatriz Fisas
,
Luis Espinosa-Anke
,
Joan Codina-Filbá
,
Leo Wanner
PDF
Cite
Dataset
«
»
Cite
×