New machine learning method offers better predictions of future disease risk

Published: 26.8.2025

Predictions for developing common diseases were more accurate than before.

Cardiovascular Machine Learning — Image: Aalto University / Matti Ahlgren

Researchers at Aalto University have developed a new machine learning method that improves the estimation of risk of developing complex diseases such as heart disease, diabetes, and liver conditions. The new tool, called survivalFM, looks not just at individual risk factors, such as cholesterol levels or age, but also at how these factors interact with each other to affect long-term health outcomes.

‘Today’s health data is incredibly complex – and so is human health. Factors like age, lifestyle, and genetics rarely act alone; they also influence each other in subtle ways. We wanted to create a method that could capture some of these complex interdependencies and still be clear enough for researchers and clinicians to understand and use,’ says Heli Julkunen, the study’s lead author and a machine learning researcher at Aalto University.

What makes this approach novel is how it handles the interactions between risk factors. Examining every possible pair of interacting risk factors one by one would be computationally expensive. The new method uses a mathematical technique to efficiently capture the underlying patterns of interaction, even in very large datasets.

The results, published in Nature Communications on July 18, suggest that this approach could lead to more accurate and personalized predictions of who is likely to develop certain diseases in the future.

‘For instance, software that uses our method could help clinicians gain a better understanding of how combinations of risk factors, such as high cholesterol and smoking together, affect disease risk. This, however, is only a simplified example, since the true novelty of the method lies in its ability to examine the simultaneous effects of many such risk factors,’ Julkunen says.

Why this matters

Healthcare professionals use risk prediction models to estimate a person’s likelihood of developing a disease over time. These tools help guide decisions about prevention, screening, and treatment. For example, models like QRISK3 in the UK or FINRISKI in Finland are commonly used to estimate the future risk of cardiovascular disease in the next 10 years.

Traditional prediction models usually treat each risk factor on its own. But in reality, many factors affect each other. For instance, cholesterol levels can predict cardiovascular risk differently depending on age, genetics or lifestyle habits. By considering these interdependencies, the new machine learning method provides a more detailed picture of a person’s risk.

Tested with real-world health data

The researchers tested the new method using data from the UK Biobank, a large health research database that includes medical records, lab tests, lifestyle information, and genetic data from around 500,000 people.

The model was trained to predict the risk of developing ten common diseases over a ten-year period. Across most conditions, it outperformed standard prediction tools that treat risk factors independently. The new method was especially effective at improving individual risk estimates, in other words assigning higher predicted risks to those who actually went on to develop disease, and lower risks to those who remained healthy.

Designed to be interpretable

Unlike many machine learning and AI models that are difficult to interpret, this new method was designed to be transparent. That means researchers and users can understand how the model arrives at its predictions and see which combinations of risk factors influence the prediction.

‘We see an increasing interest for interpretability in machine learning and AI applications, particularly in sensitive areas like healthcare. This method allows us to look at the model and directly see why this person was flagged as high risk,’ says professor Juho Rousu from Aalto University.

The method is broadly applicable to any outcome where timing matters. This includes not only medical research but also fields like engineering reliability studies and financial risk modelling.

The research was funded by the Research Council of Finland and the Technology Industries of Finland Centennial Foundation via the Aalto University House of AI centre.

This article was originally published on the Aalto University website on 26.08.2025

Updated: 9.10.2025
Published: 26.8.2025

AI, Computer Science Department, Highlight, University of Helsinki Published: 19.2.2026

LUMI AI Factory Hub opens in the Kumpula Campus in Helsinki

The Kumpula hub offers services and hosts training and hackathons for artificial intelligence researchers and students.

Aalto University, Computer Science Department, Highlight, Research Published: 10.2.2026

A survey on users' experiences of Mykanta in collaboration between Aalto University and Kela

Senior university lecturer Sari Kujala's research group is exploring, in collaboration with Kela, users' experiences with the Mykanta online patient portal and the MyKanta mobile application.

Aalto University, AI, Department of Information and Communications Engineering, Highlight, Research Published: 19.1.2026

Specialised AI models could be Finland's next global export

Finland has the potential to build AI solutions that are different from ChatGPT-like large language models. Aalto University's School of Electrical Engineering already has decades of experience in developing specialised, resource-efficient AI models. They could be a key component of future 6G networks, automation, and industrial systems – and the next competitive edge of our country.

Aalto University, Awards, Department of Information and Communications Engineering, Highlight Published: 19.1.2026

Professor Patric Östergård becomes a member of the Finnish Society of Sciences and Letters

Finnish Society of Sciences and Letters is Finland's oldest science academy. It promotes scientific discussion, publishes scientific literature, awards prizes and provides financial support for research.