A deep learning perspective on linguistic ambiguity

Aina, Laura

A deep learning perspective on linguistic ambiguity

dc.contributor

Universitat Pompeu Fabra. Departament de Traducció i Ciències del llenguatge

dc.contributor.author

Aina, Laura

dc.date.accessioned

2022-05-09T14:28:04Z

dc.date.available

2022-05-09T14:28:04Z

dc.date.issued

2022-03-25

dc.identifier.uri

http://hdl.handle.net/10803/674202

dc.description.abstract

This thesis studies the information that an expression and its context contribute to ambiguity resolution, focusing on the syntactic, lexical, and referential levels, and on the English language. I adopt computational linguistics, in particular deep learning methods, as my research framework, by introducing methodologies to 1) analyze deep learning models, and 2) use them for linguistic analysis. In a subset of studies, I investigate how neural language models – trained from unlabeled text corpora – process ambiguities, analyzing both their internal representations and their predictions. Other experiments employ deep learning models to estimate different kinds of linguistic information, with the goal of scaling the testing of linguistic hypotheses. Concretely, I study how an expression and its context interact during both interpretation and production. The thesis provides a comprehensive perspective on linguistic ambiguity, contributing to the understanding of the mechanisms that humans and artificial systems adopt to deal with this phenomenon.

en_US

dc.description.abstract

Aquesta tesi estudia la informació que una expressió i el seu context contribueixen a la resolució d’ambigüitats, centrant-se en els nivells sintàctic, lèxic i referencial, i en la llengua anglesa. Adopto la lingüística computacional, en particular els mètodes d’aprenentatge profund, com a marc de recerca, introduint metodologies per a 1) analitzar models d’aprenentatge profund i 2) utilitzar-los per a l’anàlisi lingüística. En un subconjunt d’estudis, investigo com els models neuronals del llenguatge, que aprenent a partir de corpus de text sense etiquetar, processen les ambigüitats, analitzant tant les seves representacions internes com les seves prediccions. Altres experiments fan servir models d’aprenentatge profund per estimar diferents tipus d’informació lingüística, amb l’objectiu d’escalar l’avaluació d’hipòtesis lingüístiques. Concretament, estudio com interactuen una expressió i el seu context tant durant la interpretació com durant la producció. La tesi proporciona una àmplia perspectiva sobre l’ambigüitat lingüística, contribuint a la comprensió dels mecanismes que els humans i els sistemes artificials adopten per respondre a aquest fenomen.

en_US

dc.description.abstract

Esta tesis estudia la información que una expresión y su contexto contribuyen a la resolución de la ambigüedad, en los niveles sintáctico, léxico y referencial, y en inglés. Adopto la lingüística computacional, y en particular métodos de aprendizaje profundo, como mi marco de investigación, introduciendo metodologías para 1) analizar modelos de aprendizaje profundo y 2) usarlos para el análisis lingüístico. En algunos estudios, investigo cómo los modelos del lenguaje, entrenados a partir de corpus de texto sin etiquetar, procesan las ambigüedades, analizando tanto sus representaciones internas como sus predicciones. En otros experimentos, los uso para estimar diferentes tipos de información lingüística, con el objetivo de escalar la evaluación de hipótesis lingüísticas. En concreto, estudio cómo una expresión y su contexto interactúan tanto durante la interpretación como durante la producción. La tesis proporciona una perspectiva amplia sobre la ambigüedad lingüística, contribuyendo a la comprensión de los mecanismos que adoptan los seres humanos y los sistemas artificiales para responder a este fenómeno.

en_US

dc.format.extent

184 p.

en_US

dc.format.mimetype

application/pdf

dc.language.iso

eng

en_US

dc.publisher

Universitat Pompeu Fabra

dc.rights.license

L'accés als continguts d'aquesta tesi queda condicionat a l'acceptació de les condicions d'ús establertes per la següent llicència Creative Commons: http://creativecommons.org/licenses/by/4.0/

dc.rights.uri

http://creativecommons.org/licenses/by/4.0/

dc.source

TDX (Tesis Doctorals en Xarxa)

dc.subject

Linguistic ambiguity

en_US

dc.title

A deep learning perspective on linguistic ambiguity

en_US

dc.type

info:eu-repo/semantics/doctoralThesis

dc.type

info:eu-repo/semantics/publishedVersion

dc.subject.udc

en_US

dc.contributor.director

Boleda, Gemma

dc.embargo.terms

cap

en_US

dc.rights.accessLevel

info:eu-repo/semantics/openAccess

dc.description.degree

Programa de doctorat en Traducció i Ciències del Llenguatge

Documentos

tladef.pdf

2.436Mb PDF

Este ítem aparece en la(s) siguiente(s) colección(ones)

Programa de Doctorat en Traducció i Ciències del Llenguatge [321]

Àrea de contingut