A deep learning artificial intelligence (AI) model that was developed using only mammogram image biomarkers accurately predicted both ductal carcinoma in situ (DCIS) and invasive carcinoma, according to research presented at the annual meeting of the Radiological Society of North America (RSNA). Additionally, the model showed no bias across multiple races.

Traditional breast cancer risk assessment models use information obtained from patient questionnaires, such as medical and reproductive history, to calculate a patient’s future risk of developing breast cancer. 

“In the domain of precision medicine, risk-based screening has been elusive because we have not been able to accurately evaluate a woman’s risk of developing breast cancer,” says study lead author Leslie R. Lamb, MD, MSc, a breast radiologist at Massachusetts General Hospital (MGH) in Boston. “Even the best existing traditional risk models do not perform well on the individual level.”

Traditional risk models have also demonstrated poor performance across different patient races, most likely due to the data used to develop the model. “Traditional models likely have racial biases due to the populations on which they were developed,” Lamb says. “Several of the commonly used models were developed on predominantly European Caucasian populations.” 

According to the American Cancer Society, Black women demonstrate the lowest five-year relative survival rate for breast cancer among all racial and ethnic groups. This translates to a persistent 6% to 8% disparity in five-year survival rates between Black and white women across all breast cancer types. 

To accurately determine breast cancer risk, foster early detection and improve patient survival rates, it is important that risk models are developed that are applicable across different populations. 

A deep learning AI risk assessment model developed using mammographic images alone can outperform traditional risk assessment models in future breast cancer development while also mitigating the racial biases seen in traditional models.

In the first study of its kind, Lamb and colleagues sought to assess the performance of an image-based deep learning risk assessment model in predicting both future invasive breast cancer and DCIS across multiple races. 

The model’s performance was assessed by comparing areas under the receiver operating characteristic curve (AUC) with the DeLong test. The AUC score measures the predictive rate of the model on a scale of from 0 to 1. Multiple prior studies have estimated traditional risk model performance measured by AUC in the range of 0.59-0.62 for white women, with much lower performance in women of other races.

The multisite study included 129,340 routine bilateral screening mammograms performed in 71,479 women between 2009 to 2018 with five-year follow-up data. Patient demographics were obtained from electronic medical records, and instances of cancer were identified from the regional tumor registry. 

The racial makeup of the study group included white (106,839 exams), Black (6,154 exams), Asian (6,435 exams), self-reported other races (6,257 exams) and unknown (3,655 exams). The mean age of the women was 59 years old. The deep learning model consistently outperformed traditional risk models in predicting a woman’s risk of developing DCIS, which is early-stage breast cancer, and invasive breast cancer, which is cancer that has potential to spread. 

“The model is able to translate the full diversity of subtle imaging biomarkers in the mammogram, beyond what the naked eye can see, that can predict a woman’s future risk of both DCIS and invasive breast cancer,” Lamb says. “The deep learning image-only risk model can provide increased access to more accurate, equitable and less costly risk assessment.” 

The predictive rate of both DCIS and invasive cancer was 0.71 across all races. The AUC in predicting DCIS was 0.77 in non-white patients and 0.71 in white patients. The AUC in predicting invasive cancer was 0.72 in non-white patients and 0.71 in white patients. 

“This is a particularly exciting domain for AI, as it demonstrates the opportunity to apply ‘AI for good’—to reduce well-known racial disparities in risk assessment,” says senior author Constance D. Lehman, MD, PhD, a breast radiologist at MGH. “We are now poised to translate these findings into improved clinical care for our patients.”