Encoder-Decoder Methods

Mihai Surdeanu; Marco Antonio Valenzuela-Escárcega

doi:10.1017/9781009026222.015

14 - Encoder-Decoder Methods

Published online by Cambridge University Press: 01 February 2024

Mihai Surdeanu and

Marco Antonio Valenzuela-Escárcega

Show author details

Mihai Surdeanu: Affiliation:
University of Arizona
Marco Antonio Valenzuela-Escárcega: Affiliation:
University of Arizona

Book contents

Get access

Summary

In Chapters 10 and 12, we focused on two common usages of recurrent neural networks and transformer networks: acceptors and transducers. In this chapter, we discuss a third architecture for both recurrent neural networks and transformer networks: encoder-decoder methods. We introduce three encoder-decoder architectures, which enable important NLP applications such as machine translation. In particular, we discuss the sequence-to-sequence method of Sutskever et al. (2014), which couples an encoder long short-term memory with a decoder long short-term memory. We follow this method with the approach of Bahdanau et al. (2015), which extends the previous decoder with an attention component, which produces a different encoding of the source text for each decoded word. Last, we introduce the complete encoder-decoder transformer network, which relies on three attention mechanisms: one within the encoder (which we discussed in Chapter 12), a similar one that operates over decoded words, and, importantly, an attention component that connects the input words with the decoded ones.

Keywords

encoder-decoder sequence-to-sequence attention transformer networks

Type: Chapter
Information: Deep Learning for Natural Language Processing
A Gentle Introduction
, pp. 216 - 228

DOI: https://doi.org/10.1017/9781009026222.015 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2024

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

14 - Encoder-Decoder Methods

Summary

Keywords

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive