Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening.

TitleDeep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening.
Publication TypeJournal Article
Year of Publication2020
AuthorsWu N, Phang J, Park J, Shen Y, Huang Z, Zorin M, Jastrzebski S, Fevry T, Katsnelson J, Kim E, Wolfson S, Parikh U, Gaddam S, Lin LLeng Young, Ho K, Weinstein JD, Reig B, Gao Y, Toth H, Pysarenko K, Lewin A, Lee J, Airola K, Mema E, Chung S, Hwang E, Samreen N, S Kim G, Heacock L, Moy L, Cho K, Geras KJ
JournalIEEE Trans Med Imaging
Volume39
Issue4
Pagination1184-1194
Date Published2020 04
ISSN1558-254X
KeywordsBreast, Breast Neoplasms, Deep Learning, Early Detection of Cancer, Female, Humans, Image Interpretation, Computer-Assisted, Mammography, Radiologists
Abstract

We present a deep convolutional neural network for breast cancer screening exam classification, trained, and evaluated on over 200000 exams (over 1000000 images). Our network achieves an AUC of 0.895 in predicting the presence of cancer in the breast, when tested on the screening population. We attribute the high accuracy to a few technical advances. 1) Our network's novel two-stage architecture and training procedure, which allows us to use a high-capacity patch-level network to learn from pixel-level labels alongside a network learning from macroscopic breast-level labels. 2) A custom ResNet-based network used as a building block of our model, whose balance of depth and width is optimized for high-resolution medical images. 3) Pretraining the network on screening BI-RADS classification, a related task with more noisy labels. 4) Combining multiple input views in an optimal way among a number of possible choices. To validate our model, we conducted a reader study with 14 readers, each reading 720 screening mammogram exams, and show that our model is as accurate as experienced radiologists when presented with the same data. We also show that a hybrid model, averaging the probability of malignancy predicted by a radiologist with a prediction of our neural network, is more accurate than either of the two separately. To further understand our results, we conduct a thorough analysis of our network's performance on different subpopulations of the screening population, the model's design, training procedure, errors, and properties of its internal representations. Our best models are publicly available at https://github.com/nyukat/breast_cancer_classifier.

DOI10.1109/TMI.2019.2945514
Alternate JournalIEEE Trans Med Imaging
PubMed ID31603772
PubMed Central IDPMC7427471
Grant ListP41 EB017183 / EB / NIBIB NIH HHS / United States
R21 CA225175 / CA / NCI NIH HHS / United States
Related Institute: 
MRI Research Institute (MRIRI)

Weill Cornell Medicine
Department of Radiology
525 East 68th Street New York, NY 10065