Machine learning tool joins the battle against “long COVID”

Artificial intelligence software is being used to extract insights from health records and shed light on chronic, long-lasting symptoms plaguing the planet

Long COVID, a condition in which patients experience chronic symptoms from the initial COVID-19 infection, has become a pandemic within the pandemic, and researchers are using machine learning to discover why some people develop debilitating long-lasting symptoms.

A team of US researchers has developed a machine learning tool that analyses electronic health records (EHRs) to identify common symptoms and define subtypes of long COVID. The research, published in eBioMedicine, also found strong correlations between long COVID subtypes and pre-existing conditions like diabetes and hypertension.

According to Justin Reese, a Computer Research Scientist at Berkeley Lab's Biosciences Area, this research can improve our understanding of long COVID and enable more effective treatments by helping clinicians create tailored therapies for different groups. For example, the best treatment for patients experiencing nausea and abdominal pain might differ from treatment for those suffering from a persistent cough and other lung symptoms.

The team validated their software using EHR information from 6,469 long COVID patients who had confirmed COVID-19 infections.

“Basically, we found long COVID features in the EHR data for each long COVID patient, and then assessed patient-patient similarity using semantic similarity, which essentially allows ‘fuzzy matching’ between features – for example, ‘cough’ is not the same as ‘shortness of breath,’ but they are similar since they both involve lung problems,” says Reese. “We compare all symptoms for the pair of the patients in this way, and get a score of how similar the two long COVID patients are. We can then perform unsupervised machine learning on these scores to find different subtypes of long COVID.”

Machine learning adapts to different EHR systems

The researchers applied machine learning to patient-patient similarity scores to cluster patients into groups. These groups were characterised by analysing relationships between symptoms and pre-existing diseases, as well as other demographic features like age, gender, and race.

According to Reese and his colleagues, the tool will be useful for researchers because the machine learning approach is adaptable to different EHR systems, allowing researchers to gather data from various medical establishments.

This research builds on previous work to create the Human Phenotype Ontology, an open-access database and research tool that provides a standardised vocabulary of symptoms and features found in all human diseases. The latest work was funded by the National COVID Cohort Collaborative.


Featured Articles

Lenovo: Employees prefer mix of AI and human IT support

New Lenovo survey shows 91% of employees believe they would be more productive when their IT issues at work are resolved quickly and effectively

Kyndryl’s Data and AI Console to simplify data management

Data-driven solution expands and increases observability and insights, while enhanced data governance helps identify irregularities and threats

Deep neural networks still struggling to match human vision

New study by researchers in Canada finds artificial intelligence still can't match the powers of human vision despite deep learning's ability with big data

Metaverse destined to become an impossible, dangerous place


Clever coders lead the way as Microsoft launches 365 Copilot

AI Applications

Baidu’s ERNIE doesn’t want confrontation with United States

AI Applications