Mihai Surdeanu

University of Arizona

Explainable Deep Learning for Natural Language Processing

Summary

While deep learning approaches to natural language processing (NLP) have had many successes, they can be difficult to understand, augment, or maintain as needs shift. In this talk I will discuss two recent efforts that aim to bring explainability back into deep learning methods for NLP.

In the first part of the talk, I will introduce an explainable approach for information extraction (IE), an important NLP task, which mitigates the tension between generalization and explainability by jointly training for the two goals. Our approach uses a multi-task learning architecture, which jointly trains a classifier for information extraction, and a sequence model that labels words in the context of the relation that explain the decisions of the relation classifier. This sequence model is trained using a hybrid strategy: supervised, when supervision from pre-existing patterns is available, and semi-supervised otherwise. In the latter situation, we treat the sequence model’s labels as latent variables, and learn the best assignment that maximizes the performance of the extractor. We show that, even with minimal guidance for what makes a good explanation, i.e., 5 rules per relation type to be extracted, the sequence model provides labels that serve as accurate explanations. Further, we show that the joint training generally improves the performance of the IE classifier.

In the second part of the talk, we adapt recent advances from the adjacent field of program synthesis to information extraction, synthesizing extraction rules directly from a few provided examples. We use a transformer-based architecture to guide an enumerative search, and show that this reduces the number of steps that need to be explored before a rule is found. Further, we show that without training the synthesis algorithm on the specific domain, our synthesized rules achieve state-of-the-art performance in a 1-shot IE task, i.e., when only 1 example is provided for each class to be learned.

Short bio

Dr. Surdeanu works on NLP systems that process and extract meaning from natural language texts such as question answering (answering natural language questions), information extraction (converting free text into structured relations and events), and textual entailment. He focuses mostly on interpretable models, i.e., approaches where the computer can explain in human understandable terms why it made a decision, and machine reasoning, i.e., methods that approximate the human capacity to understand bigger things from knowing smaller facts. He published more than 100 peer-reviewed articles, including four articles that were among the top three most cited articles at their respective venues that year. His work has been cited more than 15 thousand times, and has a current h-index of 41. Dr. Surdeanu’s work was funded by several United States government organizations (DARPA, NIH, NSF), as well as private foundations (the Allen Institute for Artificial Intelligence, the Bill Melinda Gates Foundation).

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_9	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.

Mihai Surdeanu

Summary

Short bio

Other Courses