Image source: Karen Arnott/EMBL-EBI

News • Stratification approach

Using national health data to predict cancer risk

A new large-scale study uses data from Danish health registries to predict individual risks of developing 20 different types of cancer

This statistical study is a proof of concept, but the analysis suggests the model could be adapted and transferred to other healthcare systems. It could help to identify people with a high risk of developing cancer, for whom early cancer screening programs could be trialled. 

The researchers published their insights in the journal Lancet Digital Health

Detecting cancer early gives patients more treatment options and generally results in better clinical outcomes. Current screening programmes only focus on specific cancer types, for example, bowel or cervical cancer, although new blood tests are being trialled that could detect multiple cancer types. If there was a simple way to use health data to calculate an individual’s risk of developing cancer, this could further inform cancer screening.

A typical pattern is that, sadly, diseases, including cancer, often come in clusters. This [...] is a correlation that could be taken into account when calculating cancer risk

Moritz Gerstung

This large, systematic study used comprehensive data from the Danish health register, in which all clinical diagnoses of the population are stored. The researchers systematically analysed health records, family history, and lifestyle data. While the analysis does not allow an exact prediction of which person will develop cancer, it does determine the individual risk, and enable a comparison with people of a similar age. 

The prediction model was first trained on data collected between 1995 and 2014 from 6.7 million adults. The training dataset included more than 90 million diagnoses spanning over 1,000 different diseases. The model was then validated on datasets from the same registry, collected between 2015 and 2018, and covering 4.7 million Danes. The agreement between the model’s predictions and the time when individuals developed cancer, if any, was 81%. The model had high accuracy for cancers of the digestive system, as well as for thyroid, kidney, and uterine cancer. 

Recommended article

Photo

Article • Focus topic

Digitalisation in healthcare

From telemedicine to smart hospitals, the digitalisation of healthcare is advancing - bringing exciting opportunities, but also new challenges. Keep reading for latest developments and background reports.

To test if this model would work with health data from other countries, the researchers leveraged data from the UK Biobank and achieved comparable levels of accuracy. “Such models are not perfect, but they can provide valuable information for new risk-adapted cancer screening programmes,” said Moritz Gerstung, Division Head at DKFZ and Visiting Research Group Leader at EMBL-EBI. “But the only way this works is by having a system for capturing and leveraging comprehensive health data. The Danish health data ecosystem is unique because it holds digital data for the entire population and spans decades. Only a few European countries offer something similar, including Finland, Iceland and Sweden, or special research cohorts in the UK.” 

Our model covers more than 1,000 factors that can contribute to a person’s risk of developing cancer, and this is huge compared to previous models which only took a few factors into consideration

Alexander Jung

The work confirmed well-known factors that are associated with cancer, such as smoking and alcohol consumption. The researchers also found that while family history is most informative before the age of 45, at an older age, an individual’s disease history is more informative for their cancer risk. The work also suggests that individuals who previously suffered from multiple different diseases can be at a higher risk of developing cancer. “A typical pattern is that, sadly, diseases, including cancer, often come in clusters. This doesn’t mean that preceding diseases cause cancer, and the true reason might be a different one. But it’s a correlation that could be taken into account when calculating cancer risk,” said Gerstung. 

“The novelty of the study lies in the volumes and richness of data we used, and the work we did to scale up well-established statistical models,” said Alexander Jung, Postdoctoral Researcher at the University of Copenhagen and visiting scientist at EMBL-EBI. “Our model covers more than 1,000 factors that can contribute to a person’s risk of developing cancer, and this is huge compared to previous models which only took a few factors into consideration.” 


Source: European Bioinformatics Institute

25.05.2024

Related articles

Photo

News • Oncology early detection tool

Blood test for 50+ types of cancer promising for screening

Final results from a study of a blood test that can detect more than 50 types of cancer have shown that it is accurate enough to be rolled out as a multi-cancer screening test among people at higher…

Photo

News • CancerSEEK

Single blood test screens for 8 cancer types

Johns Hopkins Kimmel Cancer Center researchers developed a single blood test that screens for eight common cancer types and helps identify the location of the cancer. The test, called CancerSEEK, is…

Photo

News • Biomarker validation

Plodding toward a pancreatic cancer screening test

Pancreatic cancer is one of the most deadly types of malignancies, with a 5-year survival rate after late diagnosis of only about 5%. The majority of patients—about 80%—receive their diagnosis…

Related products

Subscribe to Newsletter