Deep learning that scales: leveraging compute and data

Campos Camúñez, Víctor

Deep learning that scales: leveraging compute and data

dc.contributor

Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors

dc.contributor.author

Campos Camúñez, Víctor

dc.date.accessioned

2021-01-15T12:38:20Z

dc.date.available

2021-01-15T12:38:20Z

dc.date.issued

2020-12-22

dc.identifier.uri

http://hdl.handle.net/10803/670372

dc.description.abstract

Deep learning has revolutionized the field of artificial intelligence in the past decade. Although the development of these techniques spans over several years, the recent advent of deep learning is explained by an increased availability of data and compute that have unlocked the potential of deep neural networks. They have become ubiquitous in domains such as natural language processing, computer vision, speech processing, and control, where enough training data is available. Recent years have seen continuous progress driven by ever-growing neural networks that benefited from large amounts of data and computing power. This thesis is motivated by the observation that scale is one of the key factors driving progress in deep learning research, and aims at devising deep learning methods that scale gracefully with the available data and compute. We narrow down this scope into two main research directions. The first of them is concerned with designing hardware-aware methods which can make the most of the computing resources in current high performance computing facilities. We then study bottlenecks preventing existing methods from scaling up as more data becomes available, providing solutions that contribute towards enabling training of more complex models. This dissertation studies the aforementioned research questions for two different learning paradigms, each with its own algorithmic and computational characteristics. The first part of this thesis studies the paradigm where the model needs to learn from a collection of examples, extracting as much information as possible from the given data. The second part is concerned with training agents that learn by interacting with a simulated environment, which introduces unique challenges such as efficient exploration and simulation.

dc.format.extent

156 p.

dc.format.mimetype

application/pdf

dc.language.iso

eng

dc.publisher

Universitat Politècnica de Catalunya

dc.rights.license

L'accés als continguts d'aquesta tesi queda condicionat a l'acceptació de les condicions d'ús establertes per la següent llicència Creative Commons: http://creativecommons.org/licenses/by-nc/4.0/

dc.rights.uri

http://creativecommons.org/licenses/by-nc/4.0/

dc.source

TDX (Tesis Doctorals en Xarxa)

dc.subject.other

Àrees temàtiques de la UPC::Informàtica

dc.title

Deep learning that scales: leveraging compute and data

dc.type

info:eu-repo/semantics/doctoralThesis

dc.type

info:eu-repo/semantics/publishedVersion

dc.subject.udc

004

dc.contributor.director

Torres, Jordi (Torres Viñals)

dc.contributor.codirector

Giró i Nieto, Xavier

dc.embargo.terms

cap

dc.rights.accessLevel

info:eu-repo/semantics/openAccess

Documents

TVCC1de1.pdf

8.551Mb PDF

This item appears in the following Collection(s)

Departament d'Arquitectura de Computadors [364]