Skip to Main Content
 

Global Search Box

 
 
 

ETD Abstract Container

Abstract Header

New insights into the relationships between the rumen microbiome and animal production traits learned from bioinformatics and machine learning analyses – estimation of growth rate and development of new prediction models for methane emissions and milk production traits from meta-omic data

Abstract Details

, Doctor of Philosophy, Ohio State University, Animal Sciences.
Feed efficiency, greenhouse gas (GHG) emissions, and animal productivities are major challenges facing the animal industry, especially the ruminant livestock industry. The rumen microbiome has attracted tremendous research interests because it is the major producer of the nutrients that can be assimilated by ruminants and contributor of methane production, and because it affects both feed efficiency and product (meat and milk) quality. In the past decade, meta-omic studies of the rumen microbiome produced huge amount of data about rumen microbiome and its association with feed efficiency, animal productivity, and methane emissions. The newly available data provide new opportunities to address the above challenges facing the ruminant livestock industry. However, the usage of such data was limited mainly because of their high dimensions. The developments of methods and algorithms of bioinformatics and machine learning (ML) provide new tools to analyze the rumen microbiome data beyond correlation analyses. Thus, in the studies presented in this dissertation, we explored ML methods coupled with bioinformatics analysis in investigating the quantitative relationship between relative sequence abundance of members of the rumen microbiome and several animal productivity traits and gain new insights into the roles of rumen microbiome in affecting ruminant production, with a focus on methane emissions from sheep and milk production traits of Holstein cows. The overall objectives of our studies were to investigate the quantitative relationship of members of the rumen microbiome with methane emissions and animal productivity and identify potential biomarkers and prediction models of the above animal traits. The abundance of some rumen microbes has been found to be associated with methane emissions from ruminants. We hypothesized that the growth rates, which are directly determined by metabolic activities and physiological features, of some rumen microbes might also be associated with methane emissions. In the first study, we tested the above hypothesis by estimating the growth rates of rumen bacteria that were represented by peak-to-trough ratio, which is the ratio between the ori region and the ter region, of metagenome-assembled genomes (MAGs) and examining the correlation between growth rate and methane emissions. The growth rates of 21 bacterial species (represented by MAGs) taxonomically classified to 12 genera in two groups of sheep differing in methane yield (high vs. low). Faecalibacterium prausnitzii was the only bacterial species with significantly different growth rates between the two groups of sheep. Examining the association between the relative abundance of rumen bacteria and methane emissions, we found 79 genera or species of rumen bacteria whose relative abundances differed between the high- and low-methane yield sheep. Lactate-producing bacteria (i.e., Intestinibaculum porci) and lactate-utilizing bacteria (i.e., Megasphaera elsdenii) were enriched in the rumen of the low-methane yield sheep. Additionally, we compared the expression of the genes (the transcripts) that were mapped to the genomes of F. prausnitzii using metatranscriptomic data. The expressions of the genes involved in energy metabolism of F. prausnitzii were upregulated, including those involved in butyrate production, in the rumen of the low-methane yield sheep. This finding suggests that butyrate production by F. prausnitzii might be promoted by lactate-metabolizing bacteria. Thus, the growth rate and expression of the genes involved in energy metabolism of F. prausnitzii may be useful biomarkers of methane emissions from sheep. The results of this study suggest that growth rates of individual rumen bacteria can be estimated from metagenomic sequence data, and growth rates of some rumen bacteria may affect methane emissions from ruminants. We also examined the correlation between methane emissions and the relative abundance of MAGs or gene expressions of some lactate-metabolizing bacteria. We revealed negative associations between the growth rate of F. prausnitzii in the rumen and methane yield in sheep and between the growth rate of F. prausnitzii and the expression of its genes involved in energy metabolism including butyrate production. We also proposed a working model to explain the promoted butyrate productions and enriched lactate-metabolizing bacteria in the rumen of low-methane yield sheep. In the second study, we developed new prediction models to improve prediction of methane emissions from sheep by including metataxonomic data of the rumen microbiota, in addition to animal-related data and rumen fermentation characteristics data. To overcome high-dimensionality of the metataxonomics data, we used supervised ML methods, including random forest combined with boosting (RF-B), least absolute shrinkage and selection operator (LASSO), generalized linear mixed model with LASSO (glmmLasso), and smoothly clipped absolute deviation (SCAD), to make variable selection and model fitting. We successfully developed one prediction model each, which, beside animal predictor variables, included genera (known genera such as Megasphaera, Oribacterium Selenomonas, Moryella, Prevotella_7, and unclassified candidate genera) of microbes as predictor variables, for prediction of methane production (amount of methane per animal per day) and methane yield (amount pf methane per kg of dry matter intake). Evaluation showed that both the animal-based prediction model and dry matter intake-based prediction model had considerable decreases in root mean square prediction error (RMSPE), mean absolute error (MAE), and Bayesian information criterion (BIC), and an increase in Lin’s concordance correlation coefficient (CCC) when compared to the models that only contained animal predictor variables. The new models also resulted in improved correlation between the observed and the predicted methane emissions. Our results demonstrated that the inclusion of rumen microbes as predictor variables, besides animal- and feed-related variables, could improve the accuracy and robustness of methane prediction models, and rumen microbiome data should be included in prediction models of methane emissions from ruminants. The metatranscriptomic analysis revealed that the gene expressions of Megasphaera, including those involved in propionate and butyrate production, were upregulated in low methane sheep. All the gene expressions of Oribacterium and Selenomonas showed slight tendency in two groups of sheep. In summary, the second aim revealed the inclusion of the relative abundance of some rumen microbes at genera-level using ML improved prediction of methane emissions. The important biomarkers selected by ML such as Megasphaera were interpreted based on upregulated expression of energy-related genes. Besides methane emissions, milk production can also be associated with rumen microbiome because it produces many of the precursors of milk production and affects feed utilization efficiency. Thus, a third study was dedicated to developing new models to predict milk production traits by including rumen microbes as predictor variables. We collated data from one study involving 334 lactating Holstein cows fed the same diet, which contained animal data, rumen fermentation characteristics data, and metataxonomic data of the rumen microbiota. We evaluated multiple ML methods in developing models to predict four milk production traits (milk yield, milk fat content, milk protein content, and milk lactose content). Amplicon sequence variants (ASVs) of bacteria as predictor variables resulted in better models than known genera or all genera of bacteria. For each milk production trait, we obtained 47 models by fitting the data using the ML methods (14 in total). The best prediction model for each milk production trait was selected based on evaluation of the prediction performance and fitting performance. The best performing models included animal variables, rumen fermentation characteristics variables, and ASVs as predictor variables. As evaluated based on RMSPE, MAE, CCC, Akaike information criterion, and adjusted R-squared, these new models improved prediction accuracy and reduced biases compared to the models that contained only animal variables or animals and rumen fermentation characteristics variables. The predictions of metagenome-functions ASV were then made to explore their functions in rumen. By comparing with ASV negatively associated with milk yield, the abundances of genes with functions related to biosynthesis of products such as amino acid, B vitamins, and lipopolysaccharide were more abundant in ASV positively associated with milk yield. Also, ASV positively associated with milk lactose had more genes related to starch and sucrose metabolism than that in ASV negatively associated with milk lactose. Analysis of the predicted functions of the ASVs included in the prediction models showed that the rumen microbiome may affect the host performance by affecting sugar metabolism and biosynthesis of amino acids and B vitamins. The overall results of this dissertation provide more insights into the biomarkers in rumen microbiome and their functions related to ruminant productions than was previously studied. Future research is warranted to explore the associations of identified bacteria, especially with respect to their growth, gene abundance and gene transcriptions, with methane emissions or milk productions from ruminants.
Zhongtang Yu (Advisor)
Maurice Eastridge (Committee Member)
Jeffrey Firkins (Committee Member)
Shili Lin (Advisor)

Recommended Citations

Citations

  • Zhang, B. (2022). New insights into the relationships between the rumen microbiome and animal production traits learned from bioinformatics and machine learning analyses – estimation of growth rate and development of new prediction models for methane emissions and milk production traits from meta-omic data [Doctoral dissertation, Ohio State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=osu1650561854704611

    APA Style (7th edition)

  • Zhang, Boyang. New insights into the relationships between the rumen microbiome and animal production traits learned from bioinformatics and machine learning analyses – estimation of growth rate and development of new prediction models for methane emissions and milk production traits from meta-omic data. 2022. Ohio State University, Doctoral dissertation. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=osu1650561854704611.

    MLA Style (8th edition)

  • Zhang, Boyang. "New insights into the relationships between the rumen microbiome and animal production traits learned from bioinformatics and machine learning analyses – estimation of growth rate and development of new prediction models for methane emissions and milk production traits from meta-omic data." Doctoral dissertation, Ohio State University, 2022. http://rave.ohiolink.edu/etdc/view?acc_num=osu1650561854704611

    Chicago Manual of Style (17th edition)