A Match Made in Landfills?

Exploring the diversity and burden of antimicrobial resistance genes carried by white stork (Ciconia ciconia) throughout the breeding season in Madrid, Spain

Author

Raquel Francisco and Seth Latter

Published

April 25, 2023

The following contains a current draft of a data analysis project. The project repository can be found here.

Highlights

  • Our results suggest that landfills may not be as impactful as previously believed to the emergence and maintenance of AMR in this system.

  • Storks are most impacted by multi-drug resistance and ARG burden when anthropogenic waste is most heavily consumed later in the breeding season.

  • This species appears to be a good sentinel for anthropogenic impact on environment.

  • Future efforts in stork AMR research should focus on exploring the relationship between other anthropogenic environments (e.g., agricultural pastures) and health.

Abstract

Anthropogenic environments are hotspots for the emergence and maintenance of antimicrobial resistance. Agricultural pastures and landfills are of particular interest due to their complex microbial communities and abundant wildlife visitation, which could facilitate the exchange of antimicrobial resistance genes (ARGs) via horizontal transfer. Wild birds that occupy these environments may become both reservoirs and transporters of ARGs. The White Stork is a highly urbanized wading bird that has significantly changed its ecology due to shifts in Spanish waste management toward open-air landfills. This species now heavily forages at landfills, which provide abundant food and allow for improved reproductive success. Birds that are dependent on anthropogenic resources, such as storks, provide the ideal opportunity to understand the emergence and spread of AMR. We evaluated the diversity and quantity of ARGs in storks during three periods of the breeding season (defined by distinct foraging strategies). A total of 31 nests at Prado Herrero were sampled between March-July (2020-2021). Fresh feces were collected from 31 nests to evaluate the presence of 23 important ARGs affecting eight antibiotic classes via quantitative PCR. All nests carried multiple ARGs. Over 70% of nests had multi-drug resistance to at least 3 antibiotic classes during at least one time-period. Generalized Linear Mixed Models revealed that increased diversity in antibiotic class resistance, amount of ARGs present in a sample, and multi-drug resistance were associated with increased adult age and decreased landfill use. Our results suggest that landfills may not be contributing significantly to the emergence and maintenance of AMR in this system. Little literature exists on the relationship between stork habitat selection and health outside of landfill use in Spain. Future efforts in stork AMR research should focus on exploring the relationship between agricultural land use and health.

Introduction

General Background Information

Provide enough background on your topic that others can understand the why and how of your analysis

Description of data and data source

We have 126 observations taken over a period of 2 years. We are evaluating for the presence and burden of antimicrobial resistance genes in white stork feces. Codebook is a WIP located in raw data folder.

Questions/Hypotheses to be addressed

Does landfill use by storks increase the likelihood of carrying Antimicrobial resistance (AMR) genes and resistance gene burden?.

Stork that visit the landfill more often will have a higher likelihood of carrying AMR genes and have a higher burden of AMR genes.

Is nest success related to AMR gene burden and multi-drug resistance (i.e., resistance to 3 or more drug classes)?

Stork with more AMR gene burdens and multi-drug resistance will have lower nest success.

To cite other work (important everywhere, but likely happens first in introduction), make sure your references are in the bibtex file specified in the YAML header above (here dataanalysis_template_references.bib) and have the right bibtex key. Then you can include like this:

Examples of reproducible research projects can for instance be found in [@mckay2020; @mckay2020a]

Methods

Data aquisition

Experimental Design

Study Area — This study took place in Prado Herrero, a private cattle ranch located northwest of metropolitan Madrid and is surrounded by agriculture (e.g. beef cattle, cereal grains, legumes, and forage plants (“Renta Agraria,” 2019)). Prado Herrero is located within a nationally protected area (Cuenca Alta del Manzanares Regional Park) and is just north of Santillana Reservoir. This reservoir that was declared an Important Bird Area by Regional Catalogue of Reservoirs and Wetlands of the Community of Madrid due to the numerous resident and migratory species that utilize this water source . This cattle ranch has supported a productive white stork rookery where storks have been banded and monitored by biologists at UCM for over 20 years <CITE: Aguirre and Atienza>. During the 2020 to 2021 breeding season, between the months of March to June, stork nests were identified, marked, and monitored for productivity. Of marked nests between the 2020 and 2021 breeding seasons, 31 with banded adults were used both years to lay eggs successfully. All 31 nests were located in ash trees (Fraxinus angustifolius ) found within the cattle pasture. Storks that breed within the Prado Herrero rookery are known to utilize Colmenar Viejo Landfill which is located approximately 12km southeast. Colmenar Viejo is an open-air landfill and it is second largest of it’s kind in the Madrid region <CITE: Alejandro 2022>.

Sample Collection — Between March to May 2020 and 2021, we collected feces from marked nests with banded adults in known breeding pairs at 3 points of the breeding season; (1) an adult sample during incubation, (2) an early juvenile sample during the first two weeks of the chicks life when adults are believed to forage on natural sources <CITE: >, and (3) a late juvenile sample after chicks were past two weeks of age when adults forage on anthropogenic resources. Nests were visited in the late mornings and approximately one gram of fresh feces was collected from the perimeter of the nest structure into a sterile Eppendorf tube (). Samples were maintained cold in a portable cooler with frozen gel packs and frozen in a −20°C freezer within 4 hours of collection and processed at a later date.

Ethics statement: All animal handling was authorized by Cumunidad de Madrid: Consejeria de Medio Ambiente, Administracion Local y Ordenacion de Territorio. The permit number is D.N.I. nº 07.239.972-D.

Molecular analysis of ARGs

We performed total DNA extraction directly from fecal samples, by using a pressure filtration technique (QuickGene DNA Tissue Kit S, Fujifilm, Japan) following the manufacturer’s instructions. The 16S rRNA gene was amplified in each DNA sample by real time PCR (rtPCR) in 10-fold dilutions of extracted samples, according to Jiang et al. (2013). A DNA sample was considered validated when a ten-fold dilution showed a cycle threshold (Ct) less than 25 (Esperón et al., 2018). Once validated, we analyzed DNA samples by with a panel of 21 different ARGs encoding resistance to eight different antimicrobial classes: tetracyclines (tet(A), tet(B), tet(Y), tet(K), tet(M), tet(Q), tet(S), and tet(W)), sulfonamides (sulI and sulII), aminoglycosides (str and aadA), phenicols (catI and catII), macrolides (ermB and ermF), quinolones (qnrS and qnrB), betalactams (blaTEM and mecA), and polymyxins (mcr-1). All rtPCR reactions utilized premade gelled format 96-well plates (Biotools, B &M Labs, S.A., Madrid, Spain), with the exception of blaTEM and mecA genes which used the Sybr GreenTM and TaqManTM probe, respectively. Our thermal cycle was the same for all the rtPCR reactions [6′ 95 °C, 40× (10″ 95 °C, 30″ 60 °C)], with alignment and extension in the same step, at constant temperature of 60 °C. A melting curve step was performed at the end of the qPCR reaction to validate the authenticity of the positive (Nieto-Claudin et al., 2019). We quantified the relative burden of each gene for each sample via the cycle threshold (Ct) for the 16S rRNA and the Ct value of the individual ARG using a previously published formula (Esperón et al., 2020).

Data import and cleaning (Reference file and give details HERE. Reference a ReadME/doc here)

Write code that reads in the file and cleans it so it’s ready for analysis. Since this will be fairly long code for most datasets, it might be a good idea to have it in one or several R scripts. If that is the case, explain here briefly what kind of cleaning/processing you do, and provide more details and well documented code somewhere (e.g. as supplement in a paper). All materials, including files that contain code, should be commented well so everyone can follow along.

library(tidyverse)
-- Attaching core tidyverse packages ------------------------ tidyverse 2.0.0 --
v dplyr     1.1.2     v readr     2.1.4
v forcats   1.0.0     v stringr   1.5.0
v ggplot2   3.4.1     v tibble    3.2.1
v lubridate 1.9.2     v tidyr     1.3.0
v purrr     1.0.1     
-- Conflicts ------------------------------------------ tidyverse_conflicts() --
x dplyr::filter() masks stats::filter()
x dplyr::lag()    masks stats::lag()
i Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
library(here)

Statistical analysis

Presence absence ARG results obtained from the fecal samples between 2020 to 2021 were used for simple summary statistics. Samples were classified as “multiresistant” if they were resistant to three or more of the 8 antibiotic classes that we evaluated for in this study (Blanco-Peña et al., 2017). In addition, we applied the following formula to estimate the percentage of bacteria harboring ARGs: x = 10[2+0.33(ct16S-ctARG)], where x individual percent gene burden in the sample (i.e., the estimated number of copies of the gene present per reaction). Results were expressed in log10, ranging from −8 ( zero to a negligible amount of the bacteria in the sample carried an ARG) to 2 (all or 100% of the bacteria in the sample carried an ARG). The inverse Log10 was then applied to results so they could be totaled and used to evaluate total gene burdens across sampling periods.

Several linear mixed models (LMM) were constructed to evaluate multi-resistance and ARG burden as response variables with nest as a random factor. Covariates considered with each response variable included adult age, adult mean land fill use index (LUI), sample period (as described above), and nest success. Landfill use was quantified by physically observing a banded stork at Colmenar Viejo during weekly visits from March to June in 2021. The LUI was calculated as the number of observations of one particular bird within the total number of visits to landfill per year <CITE Alejandro 2022>. If a banded adult was not seen at the landfill during the breeding season, they were assigned a LUI of 0, suggestive of no resource provisioning at the landfill. All covariates were evaluated for correlation, no covariates were correlated with all the Spearman’s correlation coefficients (rho) < 0.5 and the p > 0.05. All continuous variables (LUI, age, and nest success) were then standardized prior to analysis.

All models were constructed with only 2021 data, as the COVID-19 pandemic prohibited the collection of LUI data in 2020. Models were built and fitted to data using the statistical package tidymodels in Program R (R version 4.2.1, www.r-project.org).

(Note: still trying to figure out how to do this with the performance package) We evaluated model fit for all models using Akaike Information Criterion scores adjusted for small sample sizes (AICc: Burnham and Anderson, 2002). The best fit model was selected by having a ΔAICc < 2._

Results

Exploratory/Descriptive analysis (SUMMARY STATS IN EXPLORATION FILE TBD -> Burden/MDR associated with each class. summary table? with actual summary stats)

Use a combination of text/tables/figures to explore and describe your data. Show the most important descriptive results here. Additional ones should go in the supplement. Even more can be in the R and Quarto files that are part of your project.

Basic statistical analysis

To get some further insight into your data, if reasonable you could compute simple statistics (e.g. simple models with 1 predictor) to look for associations between your outcome(s) and each individual predictor variable. Though note that unless you pre-specified the outcome and main exposure, any “p<0.05 means statistical significance” interpretation is not valid.

Total antimicrobial gene burden as predicted by the variables landfill use index, age, sampling period, and nest success.

Total antimicrobial gene burden as predicted by the variables landfill use index, age, sampling period, and nest success.

Full analysis

Use one or several suitable statistical/machine learning methods to analyze your data and to produce meaningful figures, tables, etc. This might again be code that is best placed in one or several separate R scripts that need to be well documented. You want the code to produce figures and data ready for display as tables, and save those. Then you load them here.

Binomial generalized linear mixed models (GLMMs) were used to predict multi-drug resistance (MDR) in White Storks, with nest identification classified as a random effects parameter. Of these, the highest performing model contained the predictors landfill use index (LUI) and age (@Table1). In this model, an increase in the age of the bird was associated with a higher likelihood of MDR being present (@Table2), while an increase in LUI was found to be associated with a decreased likelihood of MDR presence. A second competitive model, with a 97.20% performance score, contained only age as the predictor.

Linear mixed models (LMMs) were used to predict total antimicrobial gene burden in White Storks, with nest identification again classified as a random effects parameter. The global model, containing the variables landfill use index, age, sample period, and nest success, was the highest performing model (@Table3). In this model, each variable was positively correlated with antimicrobial gene burden (@Table4). The next highest performing model contained age as a single predictor, with a model performance score of 86.39%.

Name Model AICc_wt Performance_Score
glmer_fit_11 _glmerMod 0.3254465 1.0000000
glmer_fit_2 _glmerMod 0.3163397 0.9720177
glmer_fit_12 _glmerMod 0.1173421 0.3605572
glmer_fit_global _glmerMod 0.0962681 0.2958031
glmer_fit_10 _glmerMod 0.0561710 0.1725966
glmer_fit_9 _glmerMod 0.0256592 0.0788428
glmer_fit_8 _glmerMod 0.0251523 0.0772854
glmer_fit_1 _glmerMod 0.0192742 0.0592237
glmer_fit_5 _glmerMod 0.0112575 0.0345909
glmer_fit_6 _glmerMod 0.0070892 0.0217830
glmer_fit_null _glmerMod 0.0000001 0.0000002
glmer_fit_3 _glmerMod 0.0000001 0.0000001
glmer_fit_4 _glmerMod 0.0000000 0.0000000
glmer_fit_7 _glmerMod 0.0000000 0.0000000
effect group term estimate std.error statistic p.value
fixed NA (Intercept) -0.0383609 0.3528408 -0.1087202 0.9134244
fixed NA s.lui -0.5784180 0.3953718 -1.4629724 0.1434749
fixed NA s.age 0.0439471 0.3558834 0.1234873 0.9017212
ran_pars nes sd__(Intercept) 0.7088240 NA NA NA

Small sample adjusted Akaike's Information Criteria (AICc) model weights and performance scores for binomial generalized linear mixed models predicting multi-drug resistance in White Storks (Ciconia ciconia) in Madrid, Spain.

Name Model AICc_wt Performance_Score
lmer_fit2_global _lmerMod 0.2643531 1.0000000
lmer_fit2_2 _lmerMod 0.2283685 0.8638768
lmer_fit2_11 _lmerMod 0.1340166 0.5069607
lmer_fit2_12 _lmerMod 0.1128365 0.4268403
lmer_fit2_9 _lmerMod 0.1112876 0.4209809
lmer_fit2_10 _lmerMod 0.0818244 0.3095268
lmer_fit2_8 _lmerMod 0.0673134 0.2546344
lmer_fit2_5 _lmerMod 0.0000000 0.0000000
lmer_fit2_1 _lmerMod 0.0000000 0.0000000
lmer_fit2_6 _lmerMod 0.0000000 0.0000000
lmer_fit2_null _lmerMod 0.0000000 0.0000000
lmer_fit2_3 _lmerMod 0.0000000 0.0000000
lmer_fit2_4 _lmerMod 0.0000000 0.0000000
lmer_fit2_7 _lmerMod 0.0000000 0.0000000
effect group term estimate std.error statistic
fixed NA (Intercept) -4.302130 14.817887 -0.2903336
fixed NA s.lui 4.962357 5.647220 0.8787258
fixed NA s.age 11.437368 5.663061 2.0196440
fixed NA samp 13.112677 6.721352 1.9508989
fixed NA s.nsuccess 12.422550 8.636042 1.4384541
ran_pars nes sd__(Intercept) 0.000000 NA NA
ran_pars Residual sd__Observation 38.049645 NA NA

Small sample adjusted Akaike's Information Criteria (AICc) model weights and performance scores for linear mixed models predicting total antimicrobial gene burden in White Storks (Ciconia ciconia) in Madrid, Spain.

Discussion

Summary and Interpretation

  • As multi-drug resistance and class diversity increase (similar things I know) throughout the breeding season (compounding effect likely), mean LUI decreases and nest success decreases (makes sense because in your prior papers you have found increase nest success with increased LUI).

  • Resistance gene burden appears to increase as mean LUI use and age increase. o Most of the burden is due to blaTEM, a common resistance gene associated with anthropogenic impact. Sampling period does not appear to explain burden, but the top blaTEM model did show a trend in burden increasing from the 2nd sampling period to the 3rd sampling period and age.

Strengths and Limitations

  • Not much statistically significant data, thus we may have to argue biological significance.

Conclusions

LUI appears to be correlated with higher levels of AMR gene burden in storks. As LUI increases thorough the breeding season (Bialas et al 2021) resistance gene burden also increases with beta lactam resistance contributing to the majority of the burden. However, multidrug resistance appears to decrease as LUI increases, thus it is likely that storks are being exposed to antimicrobial resistance genes at other foraging areas (urban centers, agricultural pastures, etc.). Our results suggest that landfills may not be contributing significantly to the emergence and maintenance of AMR in this system. Little literature exists on the relationship between stork habitat selection and health outside of landfill use in Spain. Future efforts in stork AMR research should focus on exploring the relationship between agricultural land use and health.

What are the main take-home messages?

Include citations in your Rmd file using bibtex, the list of references will automatically be placed at the end

This paper [@leek2015] discusses types of analyses.

These papers [@mckay2020; @mckay2020a] are good examples of papers published using a fully reproducible setup similar to the one shown in this template.

Note that this cited reference will show up at the end of the document, the reference formatting is determined by the CSL file specified in the YAML header. Many more style files for almost any journal are available. You also specify the location of your bibtex reference file in the YAML. You can call your reference file anything you like, I just used the generic word references.bib but giving it a more descriptive name is probably better.

References