New machine learning method offers better predictions of future disease risk

Predictions for developing common diseases were more accurate than before.
Cardiovascular Machine Learning
Image: Aalto University / Matti Ahlgren

Researchers at Aalto University have developed a new machine learning method that improves the estimation of risk of developing complex diseases such as heart disease, diabetes, and liver conditions. The new tool, called survivalFM, looks not just at individual risk factors, such as cholesterol levels or age, but also at how these factors interact with each other to affect long-term health outcomes. 

‘Today’s health data is incredibly complex – and so is human health. Factors like age, lifestyle, and genetics rarely act alone; they also influence each other in subtle ways. We wanted to create a method that could capture some of these complex interdependencies and still be clear enough for researchers and clinicians to understand and use,’ says Heli Julkunen, the study’s lead author and a machine learning researcher at Aalto University. 

What makes this approach novel is how it handles the interactions between risk factors. Examining every possible pair of interacting risk factors one by one would be computationally expensive. The new method uses a mathematical technique to efficiently capture the underlying patterns of interaction, even in very large datasets.

The results, published in Nature Communications on July 18, suggest that this approach could lead to more accurate and personalized predictions of who is likely to develop certain diseases in the future.

‘For instance, software that uses our method could help clinicians gain a better understanding of how combinations of risk factors, such as high cholesterol and smoking together, affect disease risk. This, however, is only a simplified example, since the true novelty of the method lies in its ability to examine the simultaneous effects of many such risk factors,’ Julkunen says. 

Why this matters

Healthcare professionals use risk prediction models to estimate a person’s likelihood of developing a disease over time. These tools help guide decisions about prevention, screening, and treatment. For example, models like QRISK3 in the UK or FINRISKI in Finland are commonly used to estimate the future risk of cardiovascular disease in the next 10 years.

Traditional prediction models usually treat each risk factor on its own. But in reality, many factors affect each other. For instance, cholesterol levels can predict cardiovascular risk differently depending on age, genetics or lifestyle habits. By considering these interdependencies, the new machine learning method provides a more detailed picture of a person’s risk. 

Tested with real-world health data

The researchers tested the new method using data from the UK Biobank, a large health research database that includes medical records, lab tests, lifestyle information, and genetic data from around 500,000 people. 

The model was trained to predict the risk of developing ten common diseases over a ten-year period. Across most conditions, it outperformed standard prediction tools that treat risk factors independently. The new method was especially effective at improving individual risk estimates, in other words assigning higher predicted risks to those who actually went on to develop disease, and lower risks to those who remained healthy.

Designed to be interpretable

Unlike many machine learning and AI models that are difficult to interpret, this new method was designed to be transparent. That means researchers and users can understand how the model arrives at its predictions and see which combinations of risk factors influence the prediction.

‘We see an increasing interest for interpretability in machine learning and AI applications, particularly in sensitive areas like healthcare. This method allows us to look at the model and directly see why this person was flagged as high risk,’ says professor Juho Rousu from Aalto University. 

The method is broadly applicable to any outcome where timing matters. This includes not only medical research but also fields like engineering reliability studies and financial risk modelling.

The research was funded by the Research Council of Finland and the Technology Industries of Finland Centennial Foundation via the Aalto University House of AI centre.

This article was originally published on the Aalto University website on 26.08.2025

  • Updated:
  • Published:
Share
URL copied!

Read more news

anonymity of AI
AI, Artificial Intelligence, Computer Science Department, Highlight, Research, University of Helsinki Published:

How to ensure anonymity of AI systems?

When training artificial intelligence systems, developers need to use privacy-enhancing technologies to ensure that the subjects of the training data are not exposed, new study suggests.
Director at OKKA Foundation, Tuulikki Similä, Arto Hellas, and chairwoman of the board of directors of Nokia, Sari Baldauf.
Aalto University, Awards, Computer Science Department, Highlight Published:

Arto Hellas receives the Nokia Foundation teaching recognition award

Arto Hellas was awarded the inaugural Nokia-OKKA Educational Recognition Award for his long-term efforts in advancing ICT education.
Katsiaryna and Arash at ECAI 2025
AI, Computer Science Department, News from HIIT, Research, University of Helsinki Published:

GRADSTOP: Early Stopping of Gradient Descent via Posterior Sampling presented at ECAI 2025

HIIT Postdoc Katsiaryna Haitsiukewich presented a full paper at ECAI-2025
HIIT Open 2025
Aalto University, Community Outreach, Education, Foundations, News from HIIT Published:

HIIT Open 2025 programming contest

Two weeks ago, 16 teams and 41 contestants gathered in Otaniemi to compete on algorithmic problem solving.