Machine Learning Faculty Publications

Towards Visual Question Answering on Pathology Images

Xuehai He, University of California, San Diego
Zhuo Cai, Tsinghua University
Wenlan Wei, Wuhan University
Yichen Zhang, University of California, San DiegoFollow
Luntian Mou, Beijing University of Technology
Eric Xing, Mohamed bin Zayed University of Artificial IntelligenceFollow
Pengtao Xie, University of California, San Diego

Document Type

Book

Publication Title

ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference

Abstract

Pathology imaging is broadly used for identifying the causes and effects of diseases or injuries. Given a pathology image, being able to answer questions about the clinical findings contained in the image is very important for medical decision making. In this paper, we aim to develop a pathological visual question answering framework to analyze pathology images and answer medical questions related to these images. To build such a framework, we create PathVQA, a pathology VQA dataset with 32,795 questions asked from 4,998 pathology images. We also propose a three-level optimization framework which performs self-supervised pretraining and VQA finetuning end-to-end to learn powerful visual and textual representations jointly and automatically identifies and excludes noisy self-supervised examples from pretraining. We perform experiments on our created PathVQA dataset and the results demonstrate the effectiveness of our proposed methods. The datasets and code are available at https://github.com/UCSD-AI4H/PathVQA.

First Page

708

Last Page

718

DOI

10.18653/v1/2021.acl-short.90

Publication Date

8-1-2021

Comments

IR deposit conditions: none described

OA version available on ACL Anthology

Recommended Citation

X. He et al., “Towards visual question answering on pathology images,” ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference, vol. 2, pp. 708–718, 2021, doi: 10.18653/V1/2021.ACL-SHORT.90.

Link to Full Text

COinS

Machine Learning Faculty Publications

Towards Visual Question Answering on Pathology Images

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Comments

Recommended Citation

Browse

Contribute

Links

Machine Learning Faculty Publications

Towards Visual Question Answering on Pathology Images

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Comments

Recommended Citation

Share

Browse

Contribute

Links