Loading…
Loading grant details…
| Funder | UK Research and Innovation Future Leaders Fellowship |
|---|---|
| Recipient Organization | Cardiff University |
| Country | United Kingdom |
| Start Date | Feb 01, 2021 |
| End Date | Jan 31, 2025 |
| Duration | 1,460 days |
| Number of Grantees | 2 |
| Roles | Fellow; Award Holder |
| Data Source | UKRI Gateway to Research |
| Grant ID | MR/T042001/1 |
Significant progress in Artificial Intelligence (AI) has been made in recent years, and this has resulted in a huge expectation on what this technology can offer us in the future. However, there are still many challenges that must be addressed before this promise can be turned into a reality, and one of these challenges is Natural Language Processing (NLP).
If a computer is ever to understand humans in a natural way and to demonstrate a level of intelligence that we would normally expect, then the problem of Language Understanding must be solved.
Making computers to understand natural languages is a non-trivial task. Current approaches to language understanding rely on end-to-end supervised learning, exemplified by deep learning techniques in recent years. Typically, a corpus of relevant text is collected and then used to train the computer to perform a certain task.
However, this approach may have several problems, e.g., the words extracted and used to train a computer often have implicit meanings and can be ambiguous. Consider the following two sentences, for example: (1) We found many birds during our visit to the zoo: eagles, parrots, cranes... (2) The crane was hurt and could barely move.
A computer will not be able to understand from these training examples that there are in fact two types of crane (bird and machine) and the fact that only one type of crane (bird) can get hurt. It is widely recognised that handling word ambiguity and, more broadly, understanding what words mean, is a significant challenge in NLP. For instance, Google Translate, widely considered as the state-of-the-art in machine translation, fails to translate these two sentences correctly even to closely related languages such as Spanish.
Generally speaking, current techniques are hard to generalize across different tasks and domains, especially in applications requiring language understanding.
The proposed research intends to develop theories and novel solutions to bridge this gap by combining and leveraging lexical resources and unsupervised techniques for analysing text corpora, thereby learning the much-needed, but not-explicitly-available background knowledge. Our goal is then to seamlessly integrate this background knowledge into real-world applications for more accurate language understanding.
We will exploit these techniques in different languages, making them directly applicable in important multilingual NLP tasks, including lower-resourced languages such as Welsh, and in domains with direct societal impact such as social media and health care.
Cardiff University
Complete our application form to express your interest and we'll guide you through the process.
Apply for This Grant