Publications

(2023). Negativity spreads faster: A large-scale multilingual twitter analysis on the role of sentiment in political communication. Online Social Networks and Media.

Cite

(2023). Enabling Early Health Care Intervention by Detecting Depression in Users of Web-Based Forums using Language Models: Longitudinal Analysis and Evaluation. JMIR AI.

PDF Cite

(2022). TweetNLP: Cutting-Edge Natural Language Processing for Social Media. EMNLP 2022: System Demonstrations.

PDF Cite

(2022). Probing Relational Knowledge in Language Models via Word Analogies. Findings EMNLP 2022.

PDF Cite

(2022). Improving Embeddings Representations for Comparing Higher Education Curricula: A Use Case in Computing. EMNLP 2022.

PDF Cite

(2022). Guiding Generative Language Models for Data Augmentation in Few-Shot Text Classification. DaSH 2022.

PDF Cite

(2022). Generative Language Models for Paragraph-Level Question Generation. EMNLP 2022.

PDF Cite

(2022). A Benchmark for Neural Readability Assessment of Texts in Spanish. TSAR 2022.

PDF Cite

(2022). Neural Readability Pairwise Ranking for Sentences in Italian Administrative Language. AACL-IJCNLP 2022.

PDF Cite

(2022). Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts. AACL-IJCNLP 2022.

PDF Cite

(2022). Twitter Topic Classification. COLING 2022.

PDF Cite

(2022). TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media. COLING 2022.

PDF Cite

(2022). XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond. LREC 2022.

PDF Cite

(2022). Towards Readability-Controlled Machine Translation of COVID-19 Texts. EAMT 2022.

PDF Cite

(2022). Simple TICO-19: A Dataset for Joint Translation and Simplification of COVID-19 Texts. LREC 2022.

PDF Cite

(2022). PeruSIL: A Framework to Build a Continuous Peruvian Sign Language Interpretation Dataset. SignLang 2022.

PDF Cite

(2022). TimeLMs: Diachronic Language Models from Twitter. ACL 2022: System Demonstrations.

PDF Cite DOI

(2022). LMMS reloaded: Transformer-based sense embeddings for disambiguation and beyond. Artificial Intelligence.

Cite

(2021). On the Cross-lingual Transferability of Contextualized Sense Embeddings. MLR 2021.

PDF Cite DOI

(2021). Distilling Relation Embeddings from Pretrained Language Models. EMNLP 2021.

PDF Cite Code DOI

(2021). Probing Pre-Trained Language Models for Disease Knowledge. Findings ACL-IJCNLP 2021.

PDF Cite Code DOI

(2021). Deriving Word Vectors from Contextualized Language Models using Topic-Aware Mention Selection. RepL4NLP-2021.

PDF Cite Code DOI

(2021). COVID-19 and Misinformation: A Large-Scale Lexical Analysis on Twitter. ACL-IJCNLP SRW 2021.

PDF Cite DOI

(2021). BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?. ACL-IJCNLP 2021.

PDF Cite Code DOI

(2021). Analysis and Evaluation of Language Models for Word Sense Disambiguation. Computational Linguistics.

PDF Cite DOI

(2021). WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context. EACL 2021.

PDF Cite Dataset DOI

(2021). T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition. EACL 2021: System Demonstrations.

PDF Cite Code DOI

(2020). Towards Preemptive Detection of Depression and Anxiety in Twitter. #SMM4H 2020.

PDF Cite

(2020). Go Simple and Pre-Train on Domain-Specific Corpora: On the Role of Training Data for Text Classification. COLING 2020.

PDF Cite Code DOI

(2020). Embeddings in Natural Language Processing. COLING 2020: Tutorial Abstracts.

PDF Cite DOI

(2020). Definition Extraction Feature Analysis: From Canonical to Naturally-Occurring Definitions. CogALex 2020.

PDF Cite

(2020). CollFrEn: Rich Bilingual English--French Collocation Resource. MWE-LEX 2020.

PDF Cite Dataset

(2020). A Mixture-of-Experts Model for Learning Multi-Facet Entity Embeddings. COLING 2020.

PDF Cite Code DOI

(2020). XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization. EMNLP 2020.

PDF Cite Code DOI

(2020). Understanding the Source of Semantic Regularities in Word Embeddings. CoNLL 2020.

PDF Cite DOI

(2020). TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification. Findings EMNLP 2020.

PDF Cite DOI

(2020). Combining BERT with Static Word Embeddings for Categorizing Social Media. W-NUT 2020.

PDF Cite DOI

(2020). Learning Company Embeddings from Annual Reports for Fine-grained Industry Characterization. FinNLP 2020.

PDF Cite Code

(2020). On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning. LREC 2020.

PDF Cite

(2020). A Short Survey on Sense-Annotated Corpora. LREC 2020.

PDF Cite