UGA researchers Xiangyu Deng (shown) and Shaokang Zhang led a team of...
UGA researchers Xiangyu Deng (shown) and Shaokang Zhang led a team of scientists who have trained an algorithm called Random Forest to predict certain animal sources of S. Typhimurium genomes.

Source: University of Georgia

News • Foodborne illnesses

Machine learning gets to the source of Salmonella

A team of scientists led by researchers at the University of Georgia Center for Food Safety in Griffin has developed a machine-learning approach that could lead to quicker identification of the animal source of certain Salmonella outbreaks.

In the research, published in Emerging Infectious Diseases, Xiangyu Deng and his colleagues used more than a thousand genomes to predict the animal sources, especially livestock, of Salmonella Typhimurium. Deng, an assistant professor of food microbiology at the center, and Shaokang Zhang, a postdoctoral associate with the center, led the project, which also included experts from the Centers for Disease Control and Prevention, the U.S. Food and Drug Administration, the Minnesota Department of Health and the Translational Genomics Research Institute.

Photo
Color-enhanced scanning electron micrograph showing Salmonella typhimurium (red) invading cultured human cells

Source: National Institutes of Health, SalmonellaNIAID, Details auf Wikimedia Commons

According to the Foodborne Disease Outbreak Surveillance System, close to 3,000 outbreaks of foodborne illness were reported in the U.S. from 2009 to 2015. Of those, 900 — or 30 percent — were caused by different serotypes of Salmonella, including Typhimurium, Deng said. “We had at least three outbreaks of Typhimuirum, or its close variant, in 2018. These outbreaks were linked to chicken, chicken salad and dried coconut,” he said. “There are more than 2,600 serotypes of Salmonella, and Typhimurium is just one of them, but since the 1960s, about a quarter of Salmonella isolates linked to outbreaks reported to U.S. national surveillance are Typhimurium.”

Into the 'random forest'

The researchers trained the “machine,” an algorithm called Random Forest, with more than 1,300 S. Typhimurium genomes with known sources. After the training, the “machine” learned how to predict certain animal sources of S. Typhimurium genomes. For this study, the scientists usedSalmonella Typhimurium genomes from three major surveillance and monitoring programs: the CDC’s PulseNet network; the FDA’s GenomeTrakr database of sources in the United States, Europe, South America, Asia and Africa; and retail meat isolates from the FDA arm of the National Antimicrobial Resistance Monitoring System. “With so many genomes, machine learning is a natural choice to deal with all these data. We used this big collection of Typhimurium genomes as the training set to build the classifier,” said Deng who was awarded the UGA Creative Research Medal  in 2017 for his work in this area. “The classifier predicts the source of the Typhimurium isolate by interrogating thousands of genetic features of its genome.”

Recommended article

Photo

Article • AI structure

Machine Learning: into the pumpkin patch

Standardised and well-structured data, as well as the definition of clear objectives, are indispensable prerequisites for artificial intelligence implementation into clinical processes. ‘Ask not what artificial intelligence can do for you but what you can do for artificial intelligence!’ This variation on John F Kennedy’s famous quote comes from Dr Ben Glocker, Senior Lecturer in Medical…

Overall, the system predicted the animal source of the S. Typhimurium with 83 percent accuracy.The classifier performed best in predicting poultry and swine sources, followed by bovine and wild bird sources. The machine also detects whether its prediction is precise or imprecise. When the prediction was precise, the machine was accurate about 92 percent of the time, Deng said. “We retrospectively analyzed eight of the major zoonotic outbreaks that occurred in the U.S. from 1998 to 2013,” he said. “The classifier attributed seven of them to the correct livestock source. I’d call this approach a proof of concept. It will get better as more genomes from various sources become available”.

In tweets about the study, Frank Yiannas, deputy director of the FDA, called the machine learning of whole genome sequences project “a new era of smarter food safety and epidemiology.” To the average person, the success of this project means strains of Salmonella Typhimurium could be traced back to the source faster. Identifying what causes a foodborne illness outbreak is key to stopping it and preventing further illnesses. “Using our method, investigators can better link cases of the same outbreak and better match isolates from food or food processing environments to isolates from sick people,” he said. “This will give investigators more confidence to implicate a specific source that is behind the outbreak.”


Source: University of Georgia

12.02.2019

Read all latest stories

Related articles

Photo

News • Infection control

Legionnaire's disease: genetic analysis key to understanding risk

Routine sampling of water supplies and genomic sequencing of Legionella bacteria could play a key role in identifying the source of Legionnaires’ disease outbreaks, research suggests.

Photo

News • Protective coating

Researchers develop spray-on 'Covid shield'

A sprayable coating that can prevent the surface spread of infection from bacteria and viruses – including Covid-19 – over a sustained period has been developed by Australian researchers.

Photo

News • Breakthrough against C. diff

New Clostridioides difficile vaccine on the horizon

Researchers at the University of Exeter first identified a gene in the 'hospital bug' Clostridioides difficile responsible for producing a protein that aids in binding the bacteria to the gut of its…

Related products

Subscribe to Newsletter