Embeddings in Natural Language Processing

Jose Camacho-Collados, Mohammad Taher Pilehvar

December 2020

Abstract

Embeddings have been one of the most important topics of interest in NLP for the past decade. Representing knowledge through a low-dimensional vector which is easily integrable in modern machine learning models has played a central role in the development of the field. Embedding techniques initially focused on words but the attention soon started to shift to other forms. This tutorial will provide a high-level synthesis of the main embedding techniques in NLP, in the broad sense. We will start by conventional word embeddings (e.g., Word2Vec and GloVe) and then move to other types of embeddings, such as sense-specific and graph alternatives. We will finalize with an overview of the trending contextualized representations (e.g., ELMo and BERT) and explain their potential and impact in NLP.

Type

Publication

Proceedings of the 28th International Conference on Computational Linguistics: Tutorial Abstracts

Embeddings in Natural Language Processing

Abstract

Jose Camacho-Collados

Professor & UKRI Future Leaders Fellow