A Match Made in Landfills?
Exploring the diversity and burden of antimicrobial resistance genes carried by white stork (Ciconia ciconia) throughout the breeding season in Madrid, Spain
The following contains a current draft of a data analysis project. The project repository can be found here.
Highlights
Our results suggest that landfills may not be as impactful as previously believed to the emergence and maintenance of AMR in this system.
Storks are most impacted by multi-drug resistance and ARG burden when anthropogenic waste is most heavily consumed later in the breeding season.
This species appears to be a good sentinel for anthropogenic impact on environment.
Future efforts in stork AMR research should focus on exploring the relationship between other anthropogenic environments (e.g., agricultural pastures) and health.
Abstract
Anthropogenic environments are hotspots for the emergence and maintenance of antimicrobial resistance. Agricultural pastures and landfills are of particular interest due to their complex microbial communities and abundant wildlife visitation, which could facilitate the exchange of antimicrobial resistance genes (ARGs) via horizontal transfer. Wild birds that occupy these environments may become both reservoirs and transporters of ARGs. The White Stork is a highly urbanized wading bird that has significantly changed its ecology due to shifts in Spanish waste management toward open-air landfills. This species now heavily forages at landfills, which provide abundant food and allow for improved reproductive success. Birds that are dependent on anthropogenic resources, such as storks, provide the ideal opportunity to understand the emergence and spread of AMR. We evaluated the diversity and quantity of ARGs in storks during three periods of the breeding season (defined by distinct foraging strategies). A total of 31 nests at Prado Herrero were sampled between March-July (2020-2021). Fresh feces were collected from 31 nests to evaluate the presence of 23 important ARGs affecting eight antibiotic classes via quantitative PCR. All nests carried multiple ARGs. Over 70% of nests had multi-drug resistance to at least 3 antibiotic classes during at least one time-period. Generalized Linear Mixed Models revealed that increased diversity in antibiotic class resistance, amount of ARGs present in a sample, and multi-drug resistance were associated with increased adult age and decreased landfill use. Our results suggest that landfills may not be contributing significantly to the emergence and maintenance of AMR in this system. Little literature exists on the relationship between stork habitat selection and health outside of landfill use in Spain. Future efforts in stork AMR research should focus on exploring the relationship between agricultural land use and health.
Introduction
General Background Information
Provide enough background on your topic that others can understand the why and how of your analysis
Description of data and data source
We have 126 observations taken over a period of 2 years. We are evaluating for the presence and burden of antimicrobial resistance genes in white stork feces. Codebook is a WIP located in raw data folder.
Questions/Hypotheses to be addressed
Does landfill use by storks increase the likelihood of carrying Antimicrobial resistance (AMR) genes and resistance gene burden?.
Stork that visit the landfill more often will have a higher likelihood of carrying AMR genes and have a higher burden of AMR genes.
Is nest success related to AMR gene burden and multi-drug resistance (i.e., resistance to 3 or more drug classes)?
Stork with more AMR gene burdens and multi-drug resistance will have lower nest success.
To cite other work (important everywhere, but likely happens first in introduction), make sure your references are in the bibtex file specified in the YAML header above (here dataanalysis_template_references.bib
) and have the right bibtex key. Then you can include like this:
Examples of reproducible research projects can for instance be found in [@mckay2020; @mckay2020a]
Methods
Data aquisition
Experimental Design
Study Area — This study took place in Prado Herrero, a private cattle ranch located northwest of metropolitan Madrid and is surrounded by agriculture (e.g. beef cattle, cereal grains, legumes, and forage plants (“Renta Agraria,” 2019)). Prado Herrero is located within a nationally protected area (Cuenca Alta del Manzanares Regional Park) and is just north of Santillana Reservoir. This reservoir that was declared an Important Bird Area by Regional Catalogue of Reservoirs and Wetlands of the Community of Madrid due to the numerous resident and migratory species that utilize this water source . This cattle ranch has supported a productive white stork rookery where storks have been banded and monitored by biologists at UCM for over 20 years <CITE: Aguirre and Atienza>. During the 2020 to 2021 breeding season, between the months of March to June, stork nests were identified, marked, and monitored for productivity. Of marked nests between the 2020 and 2021 breeding seasons, 31 with banded adults were used both years to lay eggs successfully. All 31 nests were located in ash trees (Fraxinus angustifolius ) found within the cattle pasture. Storks that breed within the Prado Herrero rookery are known to utilize Colmenar Viejo Landfill which is located approximately 12km southeast. Colmenar Viejo is an open-air landfill and it is second largest of it’s kind in the Madrid region <CITE: Alejandro 2022>.
Sample Collection — Between March to May 2020 and 2021, we collected feces from marked nests with banded adults in known breeding pairs at 3 points of the breeding season; (1) an adult sample during incubation, (2) an early juvenile sample during the first two weeks of the chicks life when adults are believed to forage on natural sources <CITE: >, and (3) a late juvenile sample after chicks were past two weeks of age when adults forage on anthropogenic resources. Nests were visited in the late mornings and approximately one gram of fresh feces was collected from the perimeter of the nest structure into a sterile Eppendorf tube (). Samples were maintained cold in a portable cooler with frozen gel packs and frozen in a −20°C freezer within 4 hours of collection and processed at a later date.
Ethics statement: All animal handling was authorized by Cumunidad de Madrid: Consejeria de Medio Ambiente, Administracion Local y Ordenacion de Territorio. The permit number is D.N.I. nº 07.239.972-D.
Molecular analysis of ARGs
We performed total DNA extraction directly from fecal samples, by using a pressure filtration technique (QuickGene DNA Tissue Kit S, Fujifilm, Japan) following the manufacturer’s instructions. The 16S rRNA gene was amplified in each DNA sample by real time PCR (rtPCR) in 10-fold dilutions of extracted samples, according to Jiang et al. (2013)
Data import and cleaning (Reference file and give details HERE. Reference a ReadME/doc here)
Write code that reads in the file and cleans it so it’s ready for analysis. Since this will be fairly long code for most datasets, it might be a good idea to have it in one or several R scripts. If that is the case, explain here briefly what kind of cleaning/processing you do, and provide more details and well documented code somewhere (e.g. as supplement in a paper). All materials, including files that contain code, should be commented well so everyone can follow along.
library(tidyverse)
-- Attaching core tidyverse packages ------------------------ tidyverse 2.0.0 --
v dplyr 1.1.2 v readr 2.1.4
v forcats 1.0.0 v stringr 1.5.0
v ggplot2 3.4.1 v tibble 3.2.1
v lubridate 1.9.2 v tidyr 1.3.0
v purrr 1.0.1
-- Conflicts ------------------------------------------ tidyverse_conflicts() --
x dplyr::filter() masks stats::filter()
x dplyr::lag() masks stats::lag()
i Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
library(here)
Statistical analysis
Presence absence ARG results obtained from the fecal samples between 2020 to 2021 were used for simple summary statistics. Samples were classified as “multiresistant” if they were resistant to three or more of the 8 antibiotic classes that we evaluated for in this study (Blanco-Peña et al., 2017). In addition, we applied the following formula to estimate the percentage of bacteria harboring ARGs: x = 10[2+0.33(ct16S-ctARG)], where x individual percent gene burden in the sample (i.e., the estimated number of copies of the gene present per reaction). Results were expressed in log10, ranging from −8 ( zero to a negligible amount of the bacteria in the sample carried an ARG) to 2 (all or 100% of the bacteria in the sample carried an ARG). The inverse Log10 was then applied to results so they could be totaled and used to evaluate total gene burdens across sampling periods.
Several linear mixed models (LMM) were constructed to evaluate multi-resistance and ARG burden as response variables with nest as a random factor. Covariates considered with each response variable included adult age, adult mean land fill use index (LUI), sample period (as described above), and nest success. Landfill use was quantified by physically observing a banded stork at Colmenar Viejo during weekly visits from March to June in 2021. The LUI was calculated as the number of observations of one particular bird within the total number of visits to landfill per year <CITE Alejandro 2022>. If a banded adult was not seen at the landfill during the breeding season, they were assigned a LUI of 0, suggestive of no resource provisioning at the landfill. All covariates were evaluated for correlation, no covariates were correlated with all the Spearman’s correlation coefficients (rho) < 0.5 and the p > 0.05. All continuous variables (LUI, age, and nest success) were then standardized prior to analysis.
All models were constructed with only 2021 data, as the COVID-19 pandemic prohibited the collection of LUI data in 2020. Models were built and fitted to data using the statistical package tidymodels in Program R (R version 4.2.1, www.r-project.org).
(Note: still trying to figure out how to do this with the performance package) We evaluated model fit for all models using Akaike Information Criterion scores adjusted for small sample sizes (AICc: Burnham and Anderson, 2002). The best fit model was selected by having a ΔAICc < 2._
Results
Exploratory/Descriptive analysis (SUMMARY STATS IN EXPLORATION FILE TBD -> Burden/MDR associated with each class. summary table? with actual summary stats)
Use a combination of text/tables/figures to explore and describe your data. Show the most important descriptive results here. Additional ones should go in the supplement. Even more can be in the R and Quarto files that are part of your project.
Basic statistical analysis
To get some further insight into your data, if reasonable you could compute simple statistics (e.g. simple models with 1 predictor) to look for associations between your outcome(s) and each individual predictor variable. Though note that unless you pre-specified the outcome and main exposure, any “p<0.05 means statistical significance” interpretation is not valid.
Full analysis
Use one or several suitable statistical/machine learning methods to analyze your data and to produce meaningful figures, tables, etc. This might again be code that is best placed in one or several separate R scripts that need to be well documented. You want the code to produce figures and data ready for display as tables, and save those. Then you load them here.
Binomial generalized linear mixed models (GLMMs) were used to predict multi-drug resistance (MDR) in White Storks, with nest identification classified as a random effects parameter. Of these, the highest performing model contained the predictors landfill use index (LUI) and age (@Table1). In this model, an increase in the age of the bird was associated with a higher likelihood of MDR being present (@Table2), while an increase in LUI was found to be associated with a decreased likelihood of MDR presence. A second competitive model, with a 97.20% performance score, contained only age as the predictor.
Linear mixed models (LMMs) were used to predict total antimicrobial gene burden in White Storks, with nest identification again classified as a random effects parameter. The global model, containing the variables landfill use index, age, sample period, and nest success, was the highest performing model (@Table3). In this model, each variable was positively correlated with antimicrobial gene burden (@Table4). The next highest performing model contained age as a single predictor, with a model performance score of 86.39%.
Name | Model | AICc_wt | Performance_Score |
---|---|---|---|
glmer_fit_11 | _glmerMod | 0.3254465 | 1.0000000 |
glmer_fit_2 | _glmerMod | 0.3163397 | 0.9720177 |
glmer_fit_12 | _glmerMod | 0.1173421 | 0.3605572 |
glmer_fit_global | _glmerMod | 0.0962681 | 0.2958031 |
glmer_fit_10 | _glmerMod | 0.0561710 | 0.1725966 |
glmer_fit_9 | _glmerMod | 0.0256592 | 0.0788428 |
glmer_fit_8 | _glmerMod | 0.0251523 | 0.0772854 |
glmer_fit_1 | _glmerMod | 0.0192742 | 0.0592237 |
glmer_fit_5 | _glmerMod | 0.0112575 | 0.0345909 |
glmer_fit_6 | _glmerMod | 0.0070892 | 0.0217830 |
glmer_fit_null | _glmerMod | 0.0000001 | 0.0000002 |
glmer_fit_3 | _glmerMod | 0.0000001 | 0.0000001 |
glmer_fit_4 | _glmerMod | 0.0000000 | 0.0000000 |
glmer_fit_7 | _glmerMod | 0.0000000 | 0.0000000 |
effect | group | term | estimate | std.error | statistic | p.value |
---|---|---|---|---|---|---|
fixed | NA | (Intercept) | -0.0383609 | 0.3528408 | -0.1087202 | 0.9134244 |
fixed | NA | s.lui | -0.5784180 | 0.3953718 | -1.4629724 | 0.1434749 |
fixed | NA | s.age | 0.0439471 | 0.3558834 | 0.1234873 | 0.9017212 |
ran_pars | nes | sd__(Intercept) | 0.7088240 | NA | NA | NA |
Small sample adjusted Akaike's Information Criteria (AICc) model weights and performance scores for binomial generalized linear mixed models predicting multi-drug resistance in White Storks (Ciconia ciconia) in Madrid, Spain.
Name | Model | AICc_wt | Performance_Score |
---|---|---|---|
lmer_fit2_global | _lmerMod | 0.2643531 | 1.0000000 |
lmer_fit2_2 | _lmerMod | 0.2283685 | 0.8638768 |
lmer_fit2_11 | _lmerMod | 0.1340166 | 0.5069607 |
lmer_fit2_12 | _lmerMod | 0.1128365 | 0.4268403 |
lmer_fit2_9 | _lmerMod | 0.1112876 | 0.4209809 |
lmer_fit2_10 | _lmerMod | 0.0818244 | 0.3095268 |
lmer_fit2_8 | _lmerMod | 0.0673134 | 0.2546344 |
lmer_fit2_5 | _lmerMod | 0.0000000 | 0.0000000 |
lmer_fit2_1 | _lmerMod | 0.0000000 | 0.0000000 |
lmer_fit2_6 | _lmerMod | 0.0000000 | 0.0000000 |
lmer_fit2_null | _lmerMod | 0.0000000 | 0.0000000 |
lmer_fit2_3 | _lmerMod | 0.0000000 | 0.0000000 |
lmer_fit2_4 | _lmerMod | 0.0000000 | 0.0000000 |
lmer_fit2_7 | _lmerMod | 0.0000000 | 0.0000000 |
effect | group | term | estimate | std.error | statistic |
---|---|---|---|---|---|
fixed | NA | (Intercept) | -4.302130 | 14.817887 | -0.2903336 |
fixed | NA | s.lui | 4.962357 | 5.647220 | 0.8787258 |
fixed | NA | s.age | 11.437368 | 5.663061 | 2.0196440 |
fixed | NA | samp | 13.112677 | 6.721352 | 1.9508989 |
fixed | NA | s.nsuccess | 12.422550 | 8.636042 | 1.4384541 |
ran_pars | nes | sd__(Intercept) | 0.000000 | NA | NA |
ran_pars | Residual | sd__Observation | 38.049645 | NA | NA |
Small sample adjusted Akaike's Information Criteria (AICc) model weights and performance scores for linear mixed models predicting total antimicrobial gene burden in White Storks (Ciconia ciconia) in Madrid, Spain.
Discussion
Summary and Interpretation
As multi-drug resistance and class diversity increase (similar things I know) throughout the breeding season (compounding effect likely), mean LUI decreases and nest success decreases (makes sense because in your prior papers you have found increase nest success with increased LUI).
Resistance gene burden appears to increase as mean LUI use and age increase. o Most of the burden is due to blaTEM, a common resistance gene associated with anthropogenic impact. Sampling period does not appear to explain burden, but the top blaTEM model did show a trend in burden increasing from the 2nd sampling period to the 3rd sampling period and age.
Strengths and Limitations
- Not much statistically significant data, thus we may have to argue biological significance.
Conclusions
LUI appears to be correlated with higher levels of AMR gene burden in storks. As LUI increases thorough the breeding season (Bialas et al 2021) resistance gene burden also increases with beta lactam resistance contributing to the majority of the burden. However, multidrug resistance appears to decrease as LUI increases, thus it is likely that storks are being exposed to antimicrobial resistance genes at other foraging areas (urban centers, agricultural pastures, etc.). Our results suggest that landfills may not be contributing significantly to the emergence and maintenance of AMR in this system. Little literature exists on the relationship between stork habitat selection and health outside of landfill use in Spain. Future efforts in stork AMR research should focus on exploring the relationship between agricultural land use and health.
What are the main take-home messages?
Include citations in your Rmd file using bibtex, the list of references will automatically be placed at the end
This paper [@leek2015] discusses types of analyses.
These papers [@mckay2020; @mckay2020a] are good examples of papers published using a fully reproducible setup similar to the one shown in this template.
Note that this cited reference will show up at the end of the document, the reference formatting is determined by the CSL file specified in the YAML header. Many more style files for almost any journal are available. You also specify the location of your bibtex reference file in the YAML. You can call your reference file anything you like, I just used the generic word references.bib
but giving it a more descriptive name is probably better.