Completed FELLOWSHIP UKRI Gateway to Research

Unsupervised Background Knowledge for Language Understanding

£11.23M GBP

Funder	UK Research and Innovation Future Leaders Fellowship
Recipient Organization	Cardiff University
Country	United Kingdom
Start Date	Feb 01, 2021
End Date	Jan 31, 2025
Duration	1,460 days
Number of Grantees	2
Roles	Fellow; Award Holder
Data Source	UKRI Gateway to Research
Grant ID	`MR/T042001/1`

Grant Description

Significant progress in Artificial Intelligence (AI) has been made in recent years, and this has resulted in a huge expectation on what this technology can offer us in the future. However, there are still many challenges that must be addressed before this promise can be turned into a reality, and one of these challenges is Natural Language Processing (NLP).

If a computer is ever to understand humans in a natural way and to demonstrate a level of intelligence that we would normally expect, then the problem of Language Understanding must be solved.

Making computers to understand natural languages is a non-trivial task. Current approaches to language understanding rely on end-to-end supervised learning, exemplified by deep learning techniques in recent years. Typically, a corpus of relevant text is collected and then used to train the computer to perform a certain task.

However, this approach may have several problems, e.g., the words extracted and used to train a computer often have implicit meanings and can be ambiguous. Consider the following two sentences, for example: (1) We found many birds during our visit to the zoo: eagles, parrots, cranes... (2) The crane was hurt and could barely move.

A computer will not be able to understand from these training examples that there are in fact two types of crane (bird and machine) and the fact that only one type of crane (bird) can get hurt. It is widely recognised that handling word ambiguity and, more broadly, understanding what words mean, is a significant challenge in NLP. For instance, Google Translate, widely considered as the state-of-the-art in machine translation, fails to translate these two sentences correctly even to closely related languages such as Spanish.

Generally speaking, current techniques are hard to generalize across different tasks and domains, especially in applications requiring language understanding.

The proposed research intends to develop theories and novel solutions to bridge this gap by combining and leveraging lexical resources and unsupervised techniques for analysing text corpora, thereby learning the much-needed, but not-explicitly-available background knowledge. Our goal is then to seamlessly integrate this background knowledge into real-world applications for more accurate language understanding.

We will exploit these techniques in different languages, making them directly applicable in important multilingual NLP tasks, including lower-resourced languages such as Welsh, and in domains with direct societal impact such as social media and health care.

All Grantees

Cardiff University

Interested in applying for this grant?

Complete our application form to express your interest and we'll guide you through the process.

Apply for This Grant

Unsupervised Background Knowledge for Language Understanding

Grant Description

All Grantees

Interested in applying for this grant?

Quick Summary

Related Grants