„Swarm Learning“

AI with swarm intelligence to analyse medical data

Communities benefit from sharing knowledge and experience among their members. Following a similar principle - called “swarm learning” - an international research team has trained artificial intelligence algorithms to detect blood cancer, lung diseases and Covid-19 in data stored in a decentralized fashion.

This approach has advantage over conventional methods since it inherently provides privacy preservation technologies, which facilitates cross-site analysis of scientific data. Swarm learning could thus significantly promote and accelerate collaboration and information exchange in research, especially in the field of medicine. Experts from the German Center for Neurodegenerative Diseases (DZNE), the University of Bonn, the information technology company Hewlett Packard Enterprise (HPE) and other research institutions report on this in the scientific journal “Nature”.

Photo
Prof. Joachim Schultze, Director of Systems Medicine at DZNE

Image source: DZNE / Frommann


Science and medicine are becoming increasingly digital. Analyzing the resulting volumes of information - known as “big data” - is considered a key to better treatment options. “Medical research data are a treasure. They can play a decisive role in developing personalized therapies that are tailored to each individual more precisely than conventional treatments,” said Joachim Schultze, Director of Systems Medicine at the DZNE and professor at the Life & Medical Sciences Institute (LIMES) at the University of Bonn. “It’s critical for science to be able to use such data as comprehensively and from as many sources as possible.”

However, the exchange of medical research data across different locations or even between countries is subject to data protection and data sovereignty regulations. In practice, these requirements can usually only be implemented with significant effort. In addition, there are technical barriers: For example, when huge amounts of data have to be transferred digitally, data lines can quickly reach their performance limits. In view of these conditions, many medical studies are locally confined and cannot utilize data that is available elsewhere.

In light of this, a research collaboration led by Joachim Schultze tested a novel approach for evaluating research data stored in a decentralized fashion. The basis for this was the still young “Swarm Learning” technology developed by HPE. In addition to the IT company, numerous research institutions from Greece, the Netherlands and Germany - including members of the “German Covid-19 OMICS Initiative” (DeCOI) - participated in this study.

Recommended article

Photo

"Faces" of the disease

Covid-19: researchers identify at least 5 variants

According to current studies, the Covid-19 disease which is caused by the SARS-CoV-2 coronavirus comprises at least five different variants. These differ in how the immune system responds to the infection. Researchers from the German Center for Neurodegenerative Diseases (DZNE) and the University of Bonn, together with other experts from Germany, Greece and the Netherlands, present these findings…

Swarm Learning combines a special kind of information exchange across different nodes of a network with methods from the toolbox of “machine learning”, a branch of artificial intelligence (AI). The linchpin of machine learning are algorithms that are trained on data to detect patterns in it - and that consequently acquire the ability to recognize the learned patterns in other data as well. “Swarm Learning opens up new opportunities for collaboration in medical research, as well as in business. The key is that all participants can learn from each other without having to share confidential data,” said Dr. Eng Lim Goh, Senior Vice President and Chief Technology Officer for artificial intelligence at HPE. In fact, with Swarm Learning, all research data remains on site. Only algorithms and parameters are shared – in a sense, lessons learned. “Swarm Learning fulfills the requirements of data protection in a natural way,” Joachim Schultze emphasized.

All members of the swarm have equal rights. There is no central power over what happens and over the results. So there is, in a sense, no spider controlling the data web

Joachim Schultze

Unlike “federated learning”, in which the data also remains locally, there is no centralized command center, the Bonn scientist explained. “Swarm Learning happens in a cooperative way based on rules that all partners have agreed on in advance. This set of rules is captured in a blockchain.” This is a kind of digital protocol that regulates information exchange between the partners in a binding manner, it documents all events and all parties have access to it. “The blockchain is the backbone of Swarm Learning,” Schultze said. “All members of the swarm have equal rights. There is no central power over what happens and over the results. So there is, in a sense, no spider controlling the data web.”

Thus, the AI algorithms learn locally, namely on the basis of the data available at each network node. The learning outcomes of each node are collected as parameters through the blockchain and smartly processed by the system. The outcome, i. e. optimized parameters, are passed on to all parties. This process is repeated multiple times, gradually improving the algorithms’ ability to recognize patterns at each node of the network.

The illustration compares the different learning concepts: a) local learning...
The illustration compares the different learning concepts: a) local learning with data and computation at different, disconnected locations; b) cloud-based machine learning; c) Federated learning, with data being kept with the data contributor and computing performed at the site of local data storage and availability, but parameter settings orchestrated by a central parameter server; d) Swarm Learning without the need for a central custodian

Image source: Warnat-Herresthal et al., Nature 2021 (CC BY 4.0)

The researchers are now providing practical proof of this approach through the analysis of X-ray images of the lungs and of transcriptomes: The latter are data on the gene activity of cells. In the current study, the focus was specifically on immune cells circulating in the blood - in other words, white blood cells. “Data on the gene activity of blood cells are like a molecular fingerprint. They hold important information about how the organism reacts to a disease,” Schultze said. “Transcriptomes are available in large numbers just like X-ray images, and they are highly complex. This is exactly the kind of information you need for artificial intelligence analysis. Such data is perfect for testing Swarm Learning.“

The research team addressed a total of four infectious and non-infectious diseases: two variants of blood cancer (acute myeloid leukemia and acute lymphoblastic leukemia), as well as tuberculosis and Covid-19. The data included a total of more than 16,000 transcriptomes. The swarm learning network over which the data were distributed typically consisted of at least three and up to 32 nodes. Independently of the transcriptomes, the researchers analyzed about 100,000 chest X-ray images. These were from patients with fluid accumulation in the lung or other pathological findings as well as from individuals without anomalies. These data were distributed across three different nodes.

The analysis of both the transcriptomes and the X-ray images followed the same principle: First, the researchers fed their algorithms with subsets of the respective data set. This included information about which of the samples came from patients and which from individuals without findings. The learned pattern recognition for “sick” or “healthy” was then used to classify further data, in other words it was used to sort the data into samples with or without disease. The accuracy, i.e. the ability of the algorithms to distinguish between healthy and diseased individuals, was around 90 percent on average for the transcriptomes (each of the four diseases was evaluated separately); in the case of the X-ray data, it ranged from 76 to 86 percent.

Swarm Learning has the potential to be a real game changer and could help make the wealth of experience in medicine more accessible worldwide

Joachim Schultze

“The methodology worked best in leukemia. In this disease, the signature of gene activity is particularly striking and thus easiest for artificial intelligence to detect. Infectious diseases are more variable. Nevertheless, the accuracy was also very high for tuberculosis and Covid-19. For X-ray data, the rate was somewhat lower, which is due to the lower data or image quality,” Schultze commented on the results. “Our study thus proves that Swarm Learning can be successfully applied to very different data. In principle, this applies to any type of information for which pattern recognition by means of artificial intelligence is useful. Be it genome data, X-ray images, data from brain imaging or other complex data.”

The study also found that Swarm Learning yielded significantly better results than when the nodes in the network learned separately. “Each node benefits from the experience of the other nodes, although only local data is ever available. The concept of Swarm Learning has thus passed the practical test,” Schultze said. “I am convinced that swarm learning can give a huge boost to medical research and other data-driven disciplines. The current study was just a test run. In the future, we intend to apply this technology to Alzheimer’s and other neurodegenerative diseases,” Schultze said. “Swarm Learning has the potential to be a real game changer and could help make the wealth of experience in medicine more accessible worldwide. Not only research institutions but also hospitals, for example, could join together to form such swarms and thus share information for mutual benefit.”


Source: German Center for Neurodegenerative Diseases

27.05.2021

Read all latest stories

Related articles

Photo

Respiratory research

Covid-19 infection does not affect lung function in kids, young adults

Covid-19 infection does not appear to affect the lung function of young adults, according to new research presented at the ‘virtual’ European Respiratory Society International Congress. In the…

Photo

Neurology

Supercomputer helps create 3D synthetic brain models

Scientists are using artificial intelligence (AI) and the Cambridge-1 supercomputer to synthesise artificial 3-D MRI images of human brains and create models that show disease states across various…

Photo

Infection research

Understanding lung damage in Covid-19 patients

Covid-19 disease severity is determined by the individual patient’s immune response. The precise mechanisms taking place inside the lungs and blood during the early phase of the disease, however,…

Related products

Agfa - Smart XR

Accessories/ Complementary Systems

Agfa - Smart XR

Agfa HealthCare
Beckman Coulter – Access Interleukin-6 (IL-6)

Immunoassays

Beckman Coulter – Access Interleukin-6 (IL-6)

Beckman Coulter, Inc.
Beckman Coulter – SARS-CoV-2 Assays

Immunoassays

Beckman Coulter – SARS-CoV-2 Assays

Beckman Coulter, Inc.
Canon - Advanced Intelligent Clear-IQ Engine for CT

Artificial Intelligence

Canon - Advanced Intelligent Clear-IQ Engine for CT

Canon Medical Systems Europe B.V.
Canon – Advanced intelligent Clear-IQ Engine for MR

Artificial Intelligence

Canon – Advanced intelligent Clear-IQ Engine for MR

Canon Medical Systems Europe B.V.
Canon - Aquilion Exceed LB

Oncology CT

Canon - Aquilion Exceed LB

Canon Medical Systems Europe B.V.
Subscribe to Newsletter