ABSTRACT: Aim: A key goal in ecology is to develop more effective ways to understand species’ distributions in order to facilitate both their study and conservation. Many species distribution modeling analyses have been performed to date, using either structured survey data or unstructured citizen science data; these two pools of data have tradeoffs in terms of data density, spatiotemporal coverage, and accuracy. Recent studies have shown that combining structured and unstructured survey data can greatly improve the accuracy of species distribution models for birds, but most of this work has focused on north temperate bird species and uses bird atlas data that is much more common in the temperate zone than elsewhere. We sought to adapt a data pooling approach from the literature on north temperate bird biology to create distribution models for a selection of secretive suboscine bird species that occur in a highly diverse region of the southwestern Amazon. We adapted a data pooling methodology which typically uses structured point count data and eBird observations by substituting the former with acoustic detections collected with autonomous monitoring devices. The pooled dataset was used to model encounter rate, assess habitat variable importance, and predict site occupancy probabilities. Models produced with the pooled dataset performed better than those produced with the eBird-only dataset and yielded occupancy patterns that better reflect the major habitat gradients in this region. Our result supports prior work on eBird data pooling and shows that using acoustic rather than human survey data is a scalable approach to modeling the distributions of other species in Amazonia as well as more broadly across the global tropics.
KEYWORDS: birds, neotropics, acoustic monitoring, species distribution modeling, Amazon, eBird
INTRODUCTION: Quantifying the environmental factors that influence organismal occurrence patterns has helped address key research questions in 21st century biology (Guisan & Thuiller, 2005; Brotons, Herrando, & Pla, 2007; Shirley et al., 2013). Some of these include describing life history strategies (Brambilla & Ficetola, 2012; Segal, Massaro, Carlile, & Whitsed, 2021), estimating the relative contributions of ecological partitioning and competitive interactions in community structuring (Freeman, Strimas-Mackey, & Miller, 2022), and predicting threats due to anthropogenic change (Velásquez-Tibatá, Salaman, & Graham, 2013).​​ A key research tool in this work is species distribution modeling, which relates information on species occurrence (presence, presence-absence, and/or abundance) to environmental covariate sets (Peele, Marra, Sillett, & Sherry, 2016; Stowell, Wood, Stylianou, & Glotin, 2016; Ahmed, Atzberger, & Zewdie, 2020; Stowell 2022). Most of this work has focused on north temperate species due to longstanding historical biases in biological research (Stutchbury & Morton, 2008). While applying species distribution modeling techniques to vastly more speciose tropical communities is becoming more feasible as data steadily accumulates over time and more efficient surveying techniques are developed (Fernández‐Arellano et al. 2021; Rumelt, Basto, & Mere Roncal, 2021), data density shortcomings still represent significant research impediments.
The Amazon rainforest is one of the most speciose regions for birds globally (Amadon, 1973; Haffer, 1978). Although much theoretical ecology work in this region has examined rivers and other habitat discontinuities (Hayes & Swelal, 2004; Fernandes, Cohn-Haft, Hrbek, & Farias, 2014; Oliveira, Vasconcelos, & Santos, 2017) or interspecific competitive interactions (Pearson, 1977; Brooks, 1998) as drivers of diversity, it is clear that they have also been shaped by local-scale spatial gradients and habitat heterogeneity over evolutionary timescales (Stratford & Stouffer, 2013; Castilheiro, Santos-Filho, & Oliveira, 2017; Fluck, Cáceres, Hendges, Brum, & Dambros, 2020; Maximiano et al., 2020). Efficiently gathering data on how these gradients mediate bird occurrence patterns has the potential to answer important questions about community assembly processes and habitat partitioning within clades of Amazonian birds.
Acoustic monitoring is a key technique for surveying tropical rainforest birds as it requires limited person-hours in the field, is more effective at detecting species living in heavily vegetated understory than camera traps or human observers, and can be used to survey for long periods of time, providing a potentially rapid solution to current data limitations in tropical regions (Alvarez-Berríos et al., 2016; Leach, Burwell, Ashton, Jones, & Kitching, 2016; Rumelt et al., 2021). Due to the a priori nature of acoustic survey design, this technique has high location specificity, low spatiotemporal bias, and targeted coverage of local-scale habitat gradients thought to be important to the study species (Pacifici et al., 2017; Robinson, Ruiz‐Gutierrez, & Fink, 2018; Miller, Pacifici, Sanderlin, & Reich, 2019). By repeatedly sampling survey sites, researchers can also use these data in occupancy models to jointly predict occurrence and detection processes (Mackenzie et al., 2002). As with other types of field surveys, however, acoustic monitoring projects are limited by cost, logistics, and scheduling concerns, and are typically only used to gather data on specific sites or areas and at specific times of year. An important research priority is to find effective ways to combine acoustic survey data with other datasets to increase generalizability over larger spatial areas.
Citizen science datasets have shown real promise as a means of filling spatial gaps in ecological data (Dickinson, Zuckerberg, & Bonter, 2010; Bonney et al., 2014; Bradter et al., 2018; Gouraguine et al. 2019) as they have high data densities, are free to use, and cover broad spatial scales. However, they also typically contain higher levels of spatiotemporal survey bias and greater unevenness in observer quality relative to traditional survey methods (Crall et al., 2011; Aceves-Bueno et al., 2017). A variety of research has therefore examined how to mitigate sources of bias in citizen science data (Pacifici et al., 2017; Miller et al., 2019, Steen, Elphick, & Tingley, 2019; Johnston et al., 2020, Feldman et al., 2021; Van Eupen et al., 2021). The largest citizen science project for birds is the eBird project (Sullivan et al. 2009; Wood, Sullivan, Iliff, Fink, & Kelling, 2011; Sullivan et al., 2014), with 70.5 million bird observation records submitted by casual observers around the world (eBird, 2022), and eBird-based species distribution modeling methods, typically incorporating the use of 500 m spatial resolution MODIS landcover variables to describe important factors mediating occurrence, have been in development for over a decade (Fink et al., 2010; Fink et al., 2014; Kelling et al., 2015; Fletcher et al., 2019, Johnson et al., 2020; Fink et al., 2020). Several recent publications have demonstrated that combining eBird data with small amounts of structured survey data collected from the same geographic area and time period (referred to in the literature as “data pooling;” Fithian, Elith, Hastie, & Keith, 2015) is a simple yet powerful way to increase the predictive power of eBird species distribution models (SDMs) (Johnson et al., 2020). This approach has the potential to address long-standing regional biases in ornithology and rapidly scale up to enhance distributional knowledge of large numbers of bird species. However, prior integrated modeling approaches have combined eBird data with structured survey data from The Nature Conservancy and the Breeding Bird Atlas (Robinson et al., 2020; Johnston et al., 2021), and similar large bodies of survey data are largely unavailable in the global tropics. An important step in scaling up the data pooling approach is therefore a rigorous test of this methodology’s generalizability to other forms of structured survey data, including acoustic monitoring data, that can serve as a replacement for human survey data in regions where the latter does not exist.
Here, we apply a framework that integrates data from acoustic surveys with eBird observations to model the occurrence of several species of secretive interior forest suboscine songbirds in a highly diverse region of primary forest in southwestern Amazonia. This is the first time an integrated modeling framework combining acoustic monitoring and citizen science data has been applied to Amazonian birds. We aim to test the impact of this data pooling approach, along with a high resolution land cover dataset of the primary ecological gradients in this region (finer spatial scale than the 500 m grid typically used for eBird species distribution modeling) on the classification accuracy of eBird-based SDMs produced for bird species whose life history characteristics make them less likely to be detected by human observers. This is a first test of this methodology prior to its potential application to other study systems in the Neotropics as well as other regions of the global tropics.
MATERIALS AND METHODS:
1. Survey site and study species All acoustic survey data were collected at Los Amigos Biological Station, which contains approximately 145,000 hectares of lowland primary rainforest in the province of Madre de Dios in southeastern Peru (Pitman 2010), on the station’s existing transect grid (Tobler, Carrillo‐Percastegui, Leite Pitman, & Powell, 2008). Habitat structure in this region is heterogeneous, being structured by successional gradients associated with increasing distance from the channels of large rivers (Gentry, 1993), and this habitat diversity occurs alongside high avian alpha diversity (~600 species: eBird, 2022). The two high-level forest habitat types present at the site are floodplain forest (flooded for at least part of the year) and terra firme(never flooded); differences in time since disturbance, soil moisture, and edge effects are responsible for creating distinct understory vegetation characteristics in these two habitats as well as the ecotone between them (Gentry, 1993; Nobre et al., 2011). Members of several avian clades regularly segregate spatially by habitat type, suggesting that these high habitat and species diversity metrics are related (Rosenberg, 1990; Harvey, Aleixo, Ribas, & Brumfield et al., 2017; Mere Roncal et al.; 2019).
We focused our analyses on a set of five suboscine songbirds in Furnariida, an extremely speciose and ecologically varied clade in Amazonian forests (Oliveiros et al., 2019; eBird, 2022): Formicarius analis, F. colma, Myrmothera campanisona , Akletos goeldii, and Oneillornis salvini (Table 1). We chose these species because they had the most occurrences in the acoustic dataset among the Furnariida present at Los Amigos. Although these five species vary with respect to their utilization of the key ecological gradients in this region, all are terrestrial or semi-terrestrial, cryptically-colored, and have strong affinities for intact forest with heavy understory vegetation (Marcondes & Brumfield, 2019), which makes them important indicators of forest disturbance (Laurence, Stouffer, & Laurence, 2004; Thornton, Branch, & Sunquist, 2012). This set of life history traits makes detecting our target species using in-person survey methods challenging; however, they all have loud, distinctive vocalizations that are easy to record with acoustic monitors.2. Acoustic data collectionAcoustic data were collected as part of an automated monitoring project using 10 SWIFT autonomous recording units (ARUs; Kahl et al., 2019) cycled across 34 survey points (Figure 1a) spanning the main habitat gradients at this site. One field season ran from late July to early October 2019, during the dry season in southeastern Peru; the second ran during the wet season from early January to late February 2020 (Pitman 2010). Recorder sampling frequency was set to 16kHz to cut acoustic frequencies greater than 8kHz (Landau 1967); this allowed us to reduce recording file size while retaining acoustic fidelity in the frequency bands in which our target species’ vocalizations occur (<3.5 kHz). Microphone gain was left at the default -33 dB. Recorders were placed ~1m above the ground with the microphone facing towards the ground, which is optimal for recording terrestrial birds. Although this survey methodology was designed for tinamous, these parameters are also appropriate for capturing the vocalizations of terrestrial suboscines as the two clades broadly overlap in forest strata and acoustic frequency bands. Recording during the 2019 field season took place continuously between 5:00 and 7:30 hrs and 16:00 and 18:30 hrs to capture the dawn and dusk choruses, respectively. A change in the scope of the tinamou project during the 2020 field season led us to increase our recording windows to continuously from 3:30 to 7:30 hrs and from 14:30 to 18:30 hrs, as well as 15 minutes every half-hour between 8:30 hrs and 14:00 hrs and between 19:30 and 24:00 hrs. Additional details of recording site selection, placement, and recording parameters for the 2019 field season are described in [removed for double-blind review].
3. Processing of acoustic data Survey audio was captured as a series of 30 minute WAV files (or 15 minutes outside of peak recording times during the 2020 season) spanning each day’s recording period. Processing was performed using the BirdNET-Analyzer platform, which is pre-trained on a list of ~3000 bird species that includes our five target species and much of the avifauna of Amazonian Peru (Kahl et al., 2021). The classifier examines 3-second moving windows spanning the source audio and returns possible detections with associated detection probabilities (see Table S1 for full classification parameters). A characteristic of all probability-based moving window classifiers is the need to set a minimum threshold for accepted detections. We lumped overlapping and adjacent detections for each species into a single acoustic event and calculated maximum and mean detection probability (pmaxand pmin) for each event. All events with pmax > 0.80 were retained; these parameters were chosen by examining a random sample of 300 detections for each species (or all detections if the number was less than 300) and selecting a cutoff that eliminated the false positives in each sample. Positive detections for each species were plotted on a map and outliers manually checked for accuracy. Finally, these data were joined to a database of all recordings made during the two field seasons as well as a lookup table containing date-time and spatial metadata; this had the effect of creating a presence-absence dataset in which 30-minute recordings containing the target species are labeled “presences” and those without “absences” (we hereafter refer to presence-absence datasets formatted in this manner as “zero-filled” as this is the term used in the literature on eBird SDM; Strimas-Mackey et al., 2020).
4. eBird data and landcover covariates Occurrence data for the five target species were acquired from the eBird data portal (Sullivan et al., 2014). Data were pre-processed using the ‘auk’ package in R (Strimas-Mackey, Miller, & Hochachka, 2018) to select for checklists containing our five target species that were submitted in Madre de Dios between 2010-2022 using semi-structured protocols (Stationary or Traveling). This list was further refined to retain complete checklists (those where the observer specifies that they reported all species they were able to identify) with survey duration less than five hours, effort distance less than 5 km (for traveling checklists), and number of observers less than 10 in order to increase sampling effort evenness within the dataset. As with the acoustic survey data, eBird presence data were joined to the complete database of eBird checklists from this region with the same filter parameters and zero-filled; this had the effect of creating a presence-absence dataset in which checklists containing the target species were reported as “presences” and those without as “absences.” The canonical eBird species distribution modeling framework (Strimas-Mackey et al., 2020) uses a MODIS land-use/land-cover dataset (Freidl et al., 2002) as the basis for analyses, with elevation as an auxiliary continuous predictor variable. As the MODIS land cover dataset lumps most Amazonian forest types into an “evergreen broadleaf forest” class, and is relatively low resolution (500 m), we felt it necessary to develop a land cover dataset that is resolved to a finer spatial scale and contains more nuanced land cover classes that are relevant for analyses in the Amazon. The base layer for our land cover classification is the Sentinel-2 10m Land Use/Land Cover Time Series (Kara et al., 2021), which contains 11 land cover classes. We augmented class four of the Sentinel-2 dataset (“Flooded Vegetation”) by superimposing two land cover classes (“occasionally open water” and “always inundated”) from a synthetic aperture radar dataset of forest inundation (Sheng & Alsdorf, 2005). In order to better capture the natural gradient that exists between terra firme and floodplain forest types in lowland Madre de Dios, we chose to evaluate areas covered by class two in the Sentinel-2 dataset (“Trees”) with three types of continuous data: the SRTM+ 30m Digital Elevation Model (DEM) (Farr et al., 2007), height above nearest drainage (HAND) (Nobre et al., 2011) calculated from the same DEM, and a Sentinel-2-derived canopy height layer (Lang, Jetz, Schindler, & Wegner, 2022). Although we produced our land cover dataset at the spatial extent of the Amazon basin, we clipped it to the outline of the Madre de Dios province for the purposes of this study. All raster values were resampled to 30m for improved processing efficiency. GIS layer manipulation was performed using QGIS version 3.26.0 (QGIS.org 2022).
5. Modeling framework Our modeling approach follows that used by Robinson et al. (2020), Strimas-Mackey et al. (2020), and Johnson et al. (2021), unless otherwise noted. Although Strimas-Mackey et al. (2020) discusses how to use eBird data to create encounter rate, single-species occupancy models, and GAM-based abundance models, our acoustic data do not capture abundance, and we are therefore only able to perform the first two types of analyses here. All model processing was performed in R version 4.2.0 (R Core Team, 2022).5a. Encounter rate Encounter rate is a frequently-used metric in the literature on eBird SDM modeling that is proportional to occupancy and uses survey date-time, duration, distance covered, and number of observers as covariates in order to reduce between-site variability in detection (Guillera-Arroita et al., 2015; Strimas-Mackey et al., 2020). To produce encounter rate models for our species and evaluate the benefit of including structured survey data from the acoustic dataset, we used a bootstrapping analysis to generate submodels either from eBird data alone or from the pooled eBird+acoustic dataset (n=25 per analysis; Johnston et al., 2021). In each bootstrap iteration, the eBird dataset for our target species was spatiotemporally subsampled by overlaying a 3 km hexagonal grid created using the ‘dggridr’ package (Barnes & Stahr, 2017) over Madre de Dios and sampling one observation per grid cell per week; positive and negative detections were sampled separately to reduce class bias (Robinson et al., 2018). Spatiotemporal subsampling was not performed on the acoustic survey data because our strategy of placing recorders at set intervals along transects inherently reduced spatial bias. For the set of iterations using pooled data, we combined the two datasets by treating each acoustic sampling event as a pseudo-checklist submitted under the Stationary protocol with number of observers = 1 and duration = 30 or 15 min (depending on the length of the audio file); this is an established strategy in the literature for combining structured survey data with eBird data for occupancy modeling (Robinson et al., 2020). Land cover summary data were extracted for all spatial points by setting a 300m buffer radius around each point and calculating percent coverage of each land cover class and mean of the continuous predictor datasets within the buffer (“pland_XX_” and “X_median”, respectively). A class-balanced, GAM-calibrated (Robinson et al., 2018) random forest model was produced from the subsampled eBird data using the ‘scam’ (Pya, 2021) and ‘ranger’ (Wright & Ziegler, 2017) packages to model encounter rate. We used an 80%-20% data partition to train and evaluate each model, respectively; the validation dataset for the combined dataset contained both eBird and acoustic data. Models were evaluated based on: area under the curve (AUC), which measures the degrees to which the model correctly predicts presence localities and minimizes false presence predictions across all binary cutoff thresholds; sensitivity, the probability that a given presence prediction represents a true presence; specificity, the probability that a given absence prediction represents a true absence, and Mean-Squared Error (MSE), which measures the mean of the squared deviations between predicted probability of presence and true presence-absence state. These metrics were as applied to the hold-out set for the model produced in each bootstrap iteration and used to assess changes in accuracy between the two data pooling approaches. Package ‘ranger’ also calculates relative predictor importance for each model using the Gini Score (Gini, 1936), where value of importance is equal to the mean decrease in model accuracy when the predictor is excluded during the fitting process (Strimas-Mackey et al., 2020), and we extracted the importance values from each model to determine whether relative predictor importance was different between the eBird and pooled bootstrap runs. Finally, we made two prediction surfaces for Madre de Dios, one by taking the mean of the output rasters of the 25 bootstrap runs using just eBird data and the other from the mean of the output rasters from all 25 runs using eBird+acoustic data.
5b. Occupancy modeling Occupancy modeling was used to jointly model occurrence and detection probability as separate processes. This modeling approach involves partitioning predictors into site- and observation-level subsets and selecting between several candidate models using subsets of these predictors. eBird data can be used for occupancy modeling, but it requires special processing to select a subset of the data that represents repeat visits by the same observer to the same site within closure periods (Mackenzie et al., 2002) to meet the standard set of occupancy model assumptions (Strimas-Mackey et al., 2020). We used observation start time, checklist duration, day of year, number of observers, and distance traveled as detection covariates for both datasets (number of observers and kilometers traveled set equal to 1 and 0, respectively, for all acoustic records). All predictors were standardized by subtracting the mean and dividing by the standard deviation, and any correlated predictors (r ≥ 0.75) were removed. Spatiotemporal subsampling was again applied to eBird data: we discovered that the additional constraint of only including repeat visits necessitated less stringent subsampling to ensure adequate sample sizes for analysis, so we randomly selected 10 samples per site per 3 km grid cell rather than one. As with the encounter rate model we did not spatially subsample the acoustic data.
We first created single-species occupancy models using the R package ‘unmarked’ (Fiske & Chandler, 2011) with eBird data. We evaluated our global models for goodness-of-fit using SSE, χ2, and Freeman-Tukey tests (n=1000; MacKenzie & Bailey, 2004). We then performed AICc-based model selection using the “dredge” function from MuMIn(Bartoń, 2022), which evaluates all possible models. We retained all models with ΔAICc ≤ 2.5 and used model averaging to produce a final occupancy model (Supplement). While we used the same procedure to produce pooled models, it was critical to address the issue of differing detection/non-detection ratios between the two component datasets. The nature of our acoustic survey design yielded long segments of audio containing zero bird detections, mainly in the middle of the day and after dark. By contrast, it is exceedingly rare for eBird observers to submit “null” checklists where zero species are reported. As a result, the number of sampling events was much higher for the acoustic dataset than for the eBird dataset, with a correspondingly lower detection/non-detection ratio. As working with data where this ratio is closer to one seems to be key for producing this type of occupancy model (Strimas-Mackey et al., 2020), we subsampled the acoustic detections so that the total number of sampling events and the ratio of positive to negative detections was the same as in the eBird dataset before creating the pooled dataset. The end result of each modeling attempt was a raster layer for Madre de Dios consisting of occurrence probabilities for the given species. AUC, sensitivity, and specificity metrics were collected from each occupancy model by comparing site detection/non-detection status with predicted occurrence probability and choosing a cutoff threshold that maximizes Cohen’s Kappa (Johnston et al., 2021).
RESULTS:
Our total sampling effort amounted to 5792.5 hours of collected audio, 1216.5 from the 2019 field season and 4575 from the 2020 field season, of which 66% came from terra firme and 34% from floodplain. Relative time allocation within terra firme and floodplain was approximately equal to the number of recorder-deployments within each habitat (42 vs. 23; 65% vs. 35%). We recorded 222 independent detections for O. salvini , 159 for F. analis , 100 forF. colma , 99 for A. goeldii , and 45 for M. campanisona . Filtering the eBird dataset with the chosen effort metrics yielded 1789 records for F. analis , 624 for A. goeldii , 279 for F. colma, 232 for M. campanisona , and 217 forO. salvini . In the encounter rate analysis, AUC values were higher for all species using the pooled models than using the eBird-only models, with mean AUC increasing from 0.82 to 0.96 (Figure 2). This pattern appears to have mainly been driven by increases in specificity (all species had higher or equivalent specificity values in the pooled models, versus one out of five with higher sensitivity, Figure 2). The habitat variables that were most important for the encounter rate models for all species, irrespective of data source, were elevation_median ,HAND_median , and canopy_cover_median (Figure 3). In the pooled models however, the absolute importance of all habitat predictors was higher than in the eBird-only models, and the importance of detection covariates relative to habitat predictors generally decreased. The covariate time_observations_started , which was the most important predictor variable in the eBird-only models for all species except M. campanisona , dropped in relative importance in the pooled models to between 2nd and 5th place in all but one species. Similarly, duration_minutes , which was among the top five predictors for all species in the eBird-only models, decreased in relative importance for all five species in the pooled models. By contrast, elev_median increased or maintained its relative importance in the pooled models for all five species, andhand_median increased or maintained its relative importance in the pooled models for four of the five species.
Occupancy model metrics showed mixed results with respect to overall model performance. When comparing the pooled occupancy models to the eBird-only models, AUC values were higher in two species, showed no appreciable difference in one species, and shifted lower in the other two species. However, breaking AUC into its component pieces suggested that specificity was much higher (with concomitant decreases in sensitivity) in the floodplain species’ pooled models, and vice versa for two of the three terra firme species (Figure 4). Inspection of the parameter estimates showed that in all but one species, adding acoustic survey data increased the significance of one or more of the three habitat variables that were a priori though to have the most influence on occurrence (Table 2) (the exception, M. campanisona , was likely due to sample size limitations as this was the species with the lowest number of total detections in the combined dataset). Visual inspection of occupancy model outputs (Figure 5) revealed a similar story: eBird-only models for all species predict essentially constant rates of occupancy across forested sites in Madre de Dios, while the pooled occupancy models conform much more closely to the terra firme -floodplain gradient and exhibited broadly realistic occurrence patterns (i.e., occupancy for terra firme species was higher interra firme , and vice versa for floodplain species). The exception was unrealistic occupancy values observed for the threeterra firme species in montane areas of Madre de Dios, which is likely the result of extrapolating the positive relationship between elevation and occupancy far outside the parameter space of the models. We found little evidence for lack of fit in most species-model combinations, though the pooled model for F. colma, eBird-only model for Myrmothera campanisona and both models for O. salvini each failed one of the three statistical tests (Table S2).
DISCUSSION:
Our results broadly support previous work showing citizen science data can be combined with small amounts of structured survey data collected over ecological gradients of known importance to target species to better model species distribution patterns (Pacifici et al., 2017; Miller et al., 2019, Robinson et al., 2020, Johnston et al., 2021). Including acoustic survey data led to significantly improved model accuracy and error metrics for encounter rate models. Improvements to the occupancy models was more modest, but they nonetheless yielded far more realistic habitat parameter estimates and occurrence maps than the eBird-only occupancy models
A potential explanatory factor for these results is that eBird checklists in Madre de Dios are non-randomly distributed, being most common near the major rivers that serve as the main means of transportation in this region (Figure 1b). We included traveling checklists with lengths <5 km in this analysis, and although this ensured we had access to an adequate amount of data for modeling, this distance is large relative to the length of the terra firme -floodplain gradient. Many eBird records in this region are submitted to station or lodge “hotspots” that represent a network of trails extending across a larger area, leading to hotspots containing a mixture of terra firme and floodplain data. The coordinate point used for a hotspot is therefore often not representative of the habitat in which data collection occurred, which makes using eBird data for inference over short, steep ecological gradients more challenging. It is therefore likely that eBird data more accurately reflects the habitat affinities of floodplain species than terra firme species, and the overall effect of adding acoustic data to the occupancy models is an improvement in the models’ ability to predict where terra firmespecies do occur, and where floodplain species do notoccur. The fact that key habitat variables increased in importance relative to detection covariates with the addition of structured survey data in the encounter rate analysis provides further evidence for this assertion by showing that the observed improvements are being driven in large part by higher weighting of habitat variables relative to detection covariates.
That integrated modeling can be used to improve SDM models for Amazonian birds has important implications for understanding the distribution of tropical species, as does the finding that these improvements can be realized with acoustic data. Despite being vastly more speciose than the north temperate zone, the Tropics have significantly lower participation rates in eBird (La Sorte & Somveille, 2020), yielding lower overall data densities. Although we are aware of a handful of recent studies that have used eBird data to perform SDM analyses specific to tropical birds (Girish & Srinivasan, 2022; Ramesh, Gupte, Tingley, Robin, & DeFries, 2022), these studies were both conducted in regions of South Asia that have higher eBird data densities than the Tropics as a whole (La Sorte & Somveille, 2020), and neither used an integrated modeling approach similar to ours or those of Robinson et al. (2020). An important characteristic of eBird observations is that they are non-randomly distributed, with preference given to areas near roads and known public recreation areas (Zhang 2020). This makes using eBird data for modeling particularly challenging in regions such as the Amazon that have high levels of beta diversity and relatively few, narrow transportation corridors relative to their land area. Low availability of eBird data away from these corridors can give rise to unrealistic model predictions, particularly when there is a known association between habitat and transportation corridor placement, i.e., major rivers being associated with both boat traffic and floodplain forest (Figures 1b and 5). Our results suggest that including a small quantity of structured survey data away from human movement corridors alleviates some of these problems.
Another advancement is the use of acoustic monitoring rather than in-person field surveys as the structured dataset. Prior integrated modeling approaches using eBird data have incorporated data from TNC and the Breeding Bird Atlas (Robinson et al., 2020, Johnston et al., 2021). Few tropical countries have a significant history of conducting similar bird atlas surveys (Horns, Adler, & Şekercioğlu, 2018); by contrast, an increasing number of biological stations have available acoustic survey data (Alvarez-Berríos et al., 2016; Do Nascimento, Campos-Cerqueira, & Beard, 2020; Blake, 2021; Campos et al., 2021; Rumelt et al., 2021). We argue that acoustic surveying techniques are well suited to addressing this data gap; providing a similar quality of data for highly vocal species as human observations (Blake, 2021), they have the added benefits of being non-invasive and easier to conduct for extended periods of time. They also have a proven history of providing critical information on poorly-known species (Pérez-Granados & Schuchmann, 2021; Zhong et al., 2021; Pérez-Granados & Schuchmann, 2022). In the past, the use of acoustic survey data for landscape-scale analysis has been hindered significantly by the need for manual processing of audio by trained listeners with intimate familiarity of the avian soundscape being examined (Sugai, Silva, Ribeiro, & Llusia, 2019); however, this barrier is rapidly being overcome withy automated detection techniques, either project-specific (Rumelt et al., 2021) or publicly-available classifiers such as BirdNet (Kahl et al., 2021).
It is important to note that future work incorporating acoustic monitoring data for occupancy modeling should involve careful consideration of survey design and the meaning of model outputs. We have used acoustic monitoring data as a stand-in for human survey data in this analysis and integrated it with a larger dataset of human observations, but we would be remiss for claiming that they offer exactly the same patterns of inference. Although acoustic monitoring has efficiency benefits over human surveys (Blake, 2021), human observers are capable of detecting non-vocal individuals by sight, and it is probable that two models generated for a less vocal species, one using human survey data and the other using acoustic monitoring data, would generate different conclusions about that species’ patterns of occupancy. Our analysis preferentially looks at a small set of secretive but highly vocal species for which acoustic monitoring is a highly suitable survey approach, but for other species or clades the considerations could be quite different. Furthermore, some species’ rates of vocalization may vary with habitat quality across their home ranges (Wood & Peery, 2022): for instance, lekking species that have particular habitat constraints on lek site selection (Durães, Loiselle, & Blake, 2008). Therefore, depending on the question being asked, it is important to consider whether the chosen survey design will yield “acoustic occupancy” rates that are comparable to the species’ true occupancy. A study that directly compared the use of point-count surveys and acoustic monitoring data in an integrated modeling framework like the one explored in this analysis would be fascinating and much appreciated. Generating a more complete picture of occupancy patterns of Amazonian birds is a critical research priority given the extent of anthropogenic change in this region. The Amazon at large currently experiences threats from deforestation (Silva-Junior et al., 2021) and fire (Silveira, Silva‐Junior, Anderson, & Aragão, 2022), and the area bordering the Ruta Interoceánica, a newly-constructed east-west road connecting Madre de Dios and the Peruvian Andean provinces with the Brazilian state of Acre, is a major deforestation hotspot within Peru (Sánchez et al., 2021). Much of this deforestation is intended to clear land for agriculture and is rapidly changing the biological characteristics of this region. Madre de Dios has also become a hotspot of illegal artisanal gold mining in recent years: this activity denudes floodplain areas near rivers and oxbow lakes of vegetative cover and releases large quantities of mercury into the environment (Diringer et al., 2019; Gerson et al., 2022). Understanding which species’ habitat use characteristics make them most vulnerable to anthropogenic pressures will be crucial for informing conservation efforts and acquisition of land for the establishment of protected areas.
CONFLICT(S) OF INTEREST:
The authors declare no conflict of interest.
REFERENCES:
Aceves-Bueno, E., Adeleye, A. S., Feraud, M., Huang, Y., Tao, M., Yang, Y., & Anderson, S. E. (2017). The accuracy of citizen science data: a quantitative review. Bulletin of the Ecological Society of America , 98(4), 278-290.
Ahmed, N., Atzberger, C., & Zewdie, W. (2020). Integration of remote sensing and bioclimatic data for prediction of invasive species distribution in data-poor regions: a review on challenges and opportunities. Environmental Systems Research , 9(1), 1-18.
Alvarez-Berríos, N., Campos-Cerqueira, M., Hernández-Serna, A., Amanda Delgado, C. J., Román-Dañobeytia, F., & Aide, T. M. (2016). Impacts of small-scale gold mining on birds and anurans near the Tambopata Natural Reserve, Peru, assessed using passive acoustic monitoring.Tropical Conservation Science , 9(2), 832-851.
Amadon, D. (1973). Birds of the Congo and Amazon forests: a comparison. Tropical forest ecosystems in Africa and South America: a comparative review. Smithsonian Institution Press , Washington, DC, 267-277.
Austad, A. M. (2021). Birds on camera: assessing avifaunal detections by camera traps in the Amazon (Master’s thesis, Norwegian University of Life Sciences, Ås ).
Barnes, R. & Sahr, K. (2017). dggridR: Discrete Global Grids for R. R package version 2.0.4. https://github.com/r-barnes/dggridR/. doi:10.5281/zenodo.1322866
Bartoń, K. (2022). MuMIn: Multi-Model Inference. R package version 1.46.0. https://CRAN.R-project.org/package=MuMIn.
Blake, J. G. (2021). Acoustic monitors and direct observations provide similar but distinct perspectives on bird assemblages in a lowland forest of eastern Ecuador. PeerJ , 9, e10565.
Bonney, R., Shirk, J. L., Phillips, T. B., Wiggins, A., Ballard, H. L., Miller-Rushing, A. J., & Parrish, J. K. (2014). Next steps for citizen science. Science, 343(6178), 1436-1437.
Bradter, U., Mair, L., Jönsson, M., Knape, J., Singer, A., & Snäll, T. (2018). Can opportunistically collected Citizen Science data fill a data gap for habitat suitability models of less common species?. Methods in Ecology and Evolution, 9(7), 1667-1678.
Brambilla, M., & Ficetola, G. F. (2012). Species distribution models as a tool to estimate reproductive parameters: a case study with a passerine bird species. Journal of Animal Ecology, 81(4), 781-787.
Brooks, D. M. (1998). Competition and coexistence in Neotropical birds: a latitudinal comparison. Texas A&M University.
Brotons, L., Herrando, S., & Pla, M. (2007). Updating bird species distribution at large spatial scales: applications of habitat modeling to data from long‐term monitoring programs. Diversity and distributions, 13(3), 276-288.
Callaghan, C. T., Martin, J. M., Major, R. E., & Kingsford, R. T. (2018). Avian monitoring–comparing structured and unstructured citizen science. Wildlife Research, 45(2), 176-184.
Campos, I. B., Fewster, R., Truskinger, A., Towsey, M., Roe, P., Vasques Filho, D., … & Gaskett, A. (2021). Assessing the potential of acoustic indices for protected area monitoring in the Serra do Cipó National Park, Brazil. Ecological Indicators120 , 106953.
Castilheiro, W. F. F., Santos-Filho, M. D., & Oliveira, R. F. D. (2017). Beta diversity of birds (Passeriformes, Linnaeus, 1758) in Southern Amazon. Ciência animal brasileira, 18.
Cerqueira, M. C., Cohn‐Haft, M., Vargas, C. F., Nader, C. E., Andretti, C. B., Costa, T. V., … & Ferraz, G. (2013). Rare or elusive? A test of expert knowledge about rarity of Amazon forest birds. Diversity and Distributions, 19(7), 710-721.
Chaves, W. A., Sieving, K. E., & Fletcher Jr, R. J. (2017). Avian responses to reduced-impact logging in the southwestern Brazilian Amazon. Forest ecology and management, 384, 147-156.
Cintra, R., & Cancelli, J. (2008). Effects of forest heterogeneity on occurrence and abundance of the scale-backed antbird, Hylophylax poecilinotus (Aves: Thamnophilidae), in the Amazon forest. Revista Brasileira de Zoologia, 25, 630-639.
Condit, R., Pitman, N., Leigh Jr, E. G., Chave, J., Terborgh, J., Foster, R. B., … & Hubbell, S. P. (2002). Beta-diversity in tropical forest trees. Science, 295(5555), 666-669.
Cooper, C. B., Dickinson, J., Phillips, T., & Bonney, R. (2007). Citizen science as a tool for conservation in residential ecosystems. Ecology and society, 12(2).
Crall, A. W., Newman, G. J., Stohlgren, T. J., Holfelder, K. A., Graham, J., & Waller, D. M. (2011). Assessing citizen science data quality: an invasive species case study. Conservation Letters, 4(6), 433-442.
Dickinson, J. L., Zuckerberg, B., & Bonter, D. N. (2010). Citizen science as an ecological research tool: challenges and benefits. Annual review of ecology, evolution, and systematics, 149-172.
Diringer, S. E., Berky, A. J., Marani, M., Ortiz, E. J., Karatum, O., Plata, D. L., … & Hsu-Kim, H. (2019). Deforestation due to artisanal and small-scale gold mining exacerbates soil and mercury mobilization in Madre de Dios, Peru. Environmental science & technology, 54(1), 286-296.
Do Nascimento, L. A., Campos-Cerqueira, M., & Beard, K. H. (2020). Acoustic metrics predict habitat type and vegetation structure in the Amazon. Ecological Indicators117 , 106679.
Durães, R., Loiselle, B. A., & Blake, J. G. (2008). Spatial and temporal dynamics at manakin leks: reconciling lek traditionality with male turnover. Behavioral Ecology and Sociobiology, 62(12), 1947-1957.
eBird. 2022. eBird: An online database of bird distribution and abundance [web application]. eBird, Cornell Lab of Ornithology, Ithaca, New York. Available: http://www.ebird.org. (Accessed: Date [e.g., September 7, 2022]).
Farr, T. G., Rosen, P. A., Caro, E., Crippen, R., Duren, R., Hensley, S., … & Alsdorf, D. (2007). The shuttle radar topography mission.Reviews of geophysics , 45 (2).
Feldman, M. J., Imbeau, L., Marchand, P., Mazerolle, M. J., Darveau, M., & Fenton, N. J. (2021). Trends and gaps in the use of citizen science derived data as input for species distribution models: A quantitative review. PLoS One, 16(3), e0234587.
Fernandes, A. M., Cohn-Haft, M., Hrbek, T., & Farias, I. P. (2014). Rivers acting as barriers for bird dispersal in the Amazon. Revista Brasileira de Ornitologia, 22(4), 363-373.
Fernández‐Arellano, G. J., Teixido, A. L., Bernardon, B., Bueno, E. R., Ferreira, T. V., Goncalves, S. R., … & Pinho, J. B. (2021). Knowledge gaps and biases in the Pantanal indicate future directions for ornithological research in large wetlands. Ibis, 163(3), 784-797.
Finer, M., Yupanqui, O., Suarez, D., & Novoa, S. (2021). Using Satellites to Detect Illegal Logging in Peruvian Amazon. MAAP, 139, 7.
Fink, D., Hochachka, W. M., Zuckerberg, B., Winkler, D. W., Shaby, B., Munson, M. A., … & Kelling, S. (2010). Spatiotemporal exploratory models for broad‐scale survey data. Ecological Applications, 20(8), 2131-2147.
Fink, D., Damoulas, T., Bruns, N. E., La Sorte, F. A., Hochachka, W. M., Gomes, C. P., & Kelling, S. (2014). Crowdsourcing meets ecology: hemisphere-wide spatiotemporal species distribution models. AI magazine, 35(2), 19-30.
Fink, D., Auer, T., Johnston, A., Ruiz‐Gutierrez, V., Hochachka, W. M., & Kelling, S. (2020). Modeling avian full annual cycle distribution and population trends with citizen science data. Ecological Applications, 30(3), e02056.
Fiske, I., & Chandler, R. (2011). Unmarked: an R package for fitting hierarchical models of wildlife occurrence and abundance. Journal of statistical software, 43, 1-23.
Fithian, W., Elith, J., Hastie, T., & Keith, D. A. (2015). Bias correction in species distribution models: pooling survey and collection data for multiple species. Methods in Ecology and Evolution, 6(4), 424-438.
Fletcher Jr, R. J., Hefley, T. J., Robertson, E. P., Zuckerberg, B., McCleery, R. A., & Dorazio, R. M. (2019). A practical guide for combining data to model species distributions. Ecology, 100(6), e02710.
Fluck, I. E., Cáceres, N., Hendges, C. D., Brum, M. D. N., & Dambros, C. S. (2020). Climate and geographic distance are more influential than rivers on the beta diversity of passerine birds in Amazonia. Ecography, 43(6), 860-868.
Freeman, B. G., Scholer, M. N., Ruiz-Gutierrez, V., & Fitzpatrick, J. W. (2018). Climate change causes upslope shifts and mountaintop extirpations in a tropical bird community. Proceedings of the National Academy of Sciences, 115(47), 11982-11987.
Freeman, B. G., Strimas-Mackey, M., & Miller, E. T. (2022). Interspecific competition limits bird species’ ranges in tropical mountains. Science, 377(6604), 416-420.
Friedl, M. A., McIver, D. K., Hodges, J. C., Zhang, X. Y., Muchoney, D., Strahler, A. H., … & Schaaf, C. (2002). Global land cover mapping from MODIS: algorithms and early results. Remote sensing of Environment, 83(1-2), 287-302.
Gentry, A. H. (Ed.). (1993). Four neotropical rainforests. Yale University Press.
Gerson, J. R., Szponar, N., Zambrano, A. A., Bergquist, B., Broadbent, E., Driscoll, C. T., … & Bernhardt, E. S. (2022). Amazon forests capture high levels of atmospheric mercury pollution from artisanal gold mining. Nature communications, 13(1), 1-10.
Gini, C. (1936). On the measure of concentration with special reference to income and statistics. Colorado College Publication, General Series, 208(1), 73-79.
Girish, K. S., & Srinivasan, U. (2022). Community science data provide evidence for upward elevational range shifts by Eastern Himalayan birds. Biotropica.
Gouraguine, A., Moranta, J., Ruiz-Frau, A., Hinz, H., Reñones, O., Ferse, S. C., … & Smith, D. J. (2019). Citizen science in data and resource-limited areas: A tool to detect long-term ecosystem changes. PloS one, 14(1), e0210007.
Greeney, H. F. (2020). Thrush-like Antpitta (Myrmothera campanisona ), version 1.0. In Birds of the World (S. M. Billerman, B. K. Keeney, P. G. Rodewald, and T. S. Schulenberg, Editors). Cornell Lab of Ornithology, Ithaca, NY, USA. https://doi.org/10.2173/bow.thlant2.01
Guillera‐Arroita, G., Lahoz‐Monfort, J. J., Elith, J., Gordon, A., Kujala, H., Lentini, P. E., … & Wintle, B. A. (2015). Is my species distribution model fit for purpose? Matching data and models to applications. Global Ecology and Biogeography, 24(3), 276-292.
Guisan, A., & Thuiller, W. (2005). Predicting species distribution: offering more than simple habitat models. Ecology letters, 8(9), 993-1009.
Haffer, J. (1978) Distribution of Amazon forest birds. Bonn. Zool. Beitr, 29, 38–78.
Harvey, M. G., Aleixo, A., Ribas, C. C., & Brumfield, R. T. (2017). Habitat association predicts genetic diversity and population divergence in Amazonian birds. The American Naturalist, 190(5), 631-648.
Hayes, F. E., & Sewlal, J. A. N. (2004). The Amazon River as a dispersal barrier to passerine birds: effects of river width, habitat and taxonomy. Journal of Biogeography, 31(11), 1809-1818.
Hochachka, W. M., Fink, D., Hutchinson, R. A., Sheldon, D., Wong, W. K., & Kelling, S. (2012). Data-intensive science applied to broad-scale citizen science. Trends in ecology & evolution, 27(2), 130-137.
Horns, J. J., Adler, F. R., & Şekercioğlu, Ç. H. (2018). Using opportunistic citizen science data to estimate avian population trends. Biological conservation, 221, 151-159.
Householder, J. E., Janovec, J. P., Tobler, M. W., Page, S., & Lähteenoja, O. (2012). Peatlands of the Madre de Dios River of Peru: distribution, geomorphology, and habitat diversity. Wetlands, 32(2), 359-368.
Johnston, A., Auer, T., Fink, D., Strimas‐Mackey, M., Iliff, M., Rosenberg, K. V., … & Kelling, S. (2020). Comparing abundance distributions and range maps in spatial conservation planning for migratory species. Ecological Applications, 30(3), e02058.
Johnston, A., Moran, N., Musgrove, A., Fink, D., & Baillie, S. R. (2020). Estimating species distributions from spatially biased citizen science data. Ecological Modeling, 422, 108927.
Johnston, A., Hochachka, W. M., Strimas‐Mackey, M. E., Ruiz Gutierrez, V., Robinson, O. J., Miller, E. T., … & Fink, D. (2021). Analytical guidelines to increase the value of community science data: An example using eBird data to estimate species distributions. Diversity and Distributions, 27(7), 1265-1277.
Kahl, S., Stöter, F. R., Goëau, H., Glotin, H., Planque, R., Vellinga, W. P., & Joly, A. (2019, September). Overview of BirdCLEF 2019: large-scale bird recognition in soundscapes. In CLEF 2019-Conference and Labs of the Evaluation Forum (Vol. 2380, No. 256). CEUR.
Kahl, S., Wood, C. M., Eibl, M., & Klinck, H. (2021). BirdNET: A deep learning solution for avian diversity monitoring. Ecological Informatics, 61, 101236.
Karra, K., Kontgis, C., Statman-Weil, Z., Mazzariello, J. C., Mathis, M., & Brumby, S. P. (2021, July). Global land use/land cover with Sentinel 2 and deep learning. In 2021 IEEE international geoscience and remote sensing symposium IGARSS (pp. 4704-4707). IEEE.
Kelling, S., Johnston, A., Hochachka, W. M., Iliff, M., Fink, D., Gerbracht, J., … & Yu, J. (2015). Can observation skills of citizen scientists be estimated using species accumulation curves?. PloS one, 10(10), e0139600.
Krabbe, N. and T. S. Schulenberg (2020). Rufous-capped Antthrush (Formicarius colma), version 1.0. In Birds of the World (J. del Hoyo, A. Elliott, J. Sargatal, D. A. Christie, and E. de Juana, Editors). Cornell Lab of Ornithology, Ithaca, NY, USA. https://doi.org/10.2173/bow.rucant2.01
La Sorte, F. A., & Somveille, M. (2020). Survey completeness of a global citizen‐science database of bird occurrence. Ecography, 43(1), 34-43.
Landau, H. J. (1967). Sampling, data transmission, and the Nyquist rate. Proceedings of the IEEE, 55(10), 1701–1706.
Lang, N., Jetz, W., Schindler, K., & Wegner, J. D. (2022). A high-resolution canopy height model of the Earth.arXiv preprint arXiv:2204.08322 .
Laurance, S. G., Stouffer, P. C., & Laurance, W. F. (2004). Effects of road clearings on movement patterns of understory rainforest birds in central Amazonia. Conservation biology, 18(4), 1099-1109.
Leach, E. C., Burwell, C. J., Ashton, L. A., Jones, D. N., & Kitching, R. L. (2016). Comparison of point counts and automated acoustic monitoring: detecting birds in a rainforest biodiversity survey. Emu, 116(3), 305-309.
MacKenzie, D. I., Nichols, J. D., Lachman, G. B., Droege, S., Andrew Royle, J., & Langtimm, C. A. (2002). Estimating site occupancy rates when detection probabilities are less than one. Ecology, 83(8), 2248-2255.
MacKenzie, D. I., & Bailey, L. L. (2004). Assessing the fit of site-occupancy models. Journal of Agricultural, Biological, and Environmental Statistics , 9 (3), 300-318.
Marcondes, R. S., & Brumfield, R. T. (2019). Fifty shades of brown: Macroevolution of plumage brightness in the Furnariida, a large clade of drab Neotropical passerines. Evolution, 73(4), 704-719.
Marra, P. P., & Remsen Jr, J. V. (1997). Insights into the maintenance of high species diversity in the Neotropics: habitat selection and foraging behavior in understory birds of tropical and temperate forests.Ornithological Monographs , 445-483.
Matutini, F., Baudry, J., Pain, G., Sineau, M., & Pithon, J. (2021). How citizen science could improve species distribution models and their independent assessment. Ecology and evolution, 11(7), 3028-3039.
Maximiano, M. F. D. A., d’Horta, F. M., Tuomisto, H., Zuquim, G., Van Doninck, J., & Ribas, C. C. (2020). The relative role of rivers, environmental heterogeneity and species traits in driving compositional changes in southeastern Amazonian bird assemblages. Biotropica, 52(5), 946-962.
Melo, A. S., Rangel, T. F. L., & Diniz‐Filho, J. A. F. (2009). Environmental drivers of beta‐diversity patterns in New‐World birds and mammals. Ecography, 32(2), 226-236.
Mere Roncal, C., Middendorf, E., Forsyth, A., Cáceres, A., Blake, J. G., Almeyda Zambrano, A. M., & Broadbent, E. N. (2019). Assemblage structure and dynamics of terrestrial birds in the southwest Amazon: a camera‐trap case study. Journal of Field Ornithology, 90(3), 203-214.
Miller, D. A., Pacifici, K., Sanderlin, J. S., & Reich, B. J. (2019). The recent past and promising future for data integration methods to estimate species’ distributions. Methods in Ecology and Evolution, 10(1), 22-37.
Nobre, A. D., Cuartas, L. A., Hodnett, M., Rennó, C. D., Rodrigues, G., Silveira, A., & Saleska, S. (2011). Height Above the Nearest Drainage–a hydrologically relevant new terrain model. Journal of Hydrology , 404 (1-2), 13-29.
Oliveira, U., Vasconcelos, M. F., & Santos, A. J. (2017). Biogeography of Amazon birds: rivers limit species composition, but not areas of endemism. Scientific Reports, 7(1), 1-11.
Oliveros, C. H., Field, D. J., Ksepka, D. T., Barker, F. K., Aleixo, A., Andersen, M. J., … & Faircloth, B. C. (2019). Earth history and the passerine superradiation. Proceedings of the National Academy of Sciences, 116(16), 7916-7925.
Pacifici, K., Reich, B. J., Miller, D. A., Gardner, B., Stauffer, G., Singh, S., … & Collazo, J. A. (2017). Integrating multiple data sources in species distribution modeling: a framework for data fusion. Ecology, 98(3), 840-850.
Patten, M. A., J. van Dort, and P. F. D. Boesman (2021). Black-faced Antthrush (Formicarius analis), version 1.1. In Birds of the World (T. S. Schulenberg, Editor). Cornell Lab of Ornithology, Ithaca, NY, USA. https://doi.org/10.2173/bow.blfant1.01.1
Pearson, D. L. (1977). Ecological relationships of small antbirds in Amazonian bird communities. The Auk, 94(2), 283-292.
Peele, A. M., Marra, P. M., Sillett, T. S., & Sherry, T. W. (2015). Combining survey methods to estimate abundance and transience of migratory birds among tropical nonbreeding habitats. The Auk: Ornithological Advances, 132(4), 926-937.
Pérez-Granados, C., & Schuchmann, K. L. (2021). Passive acoustic monitoring of Chaco Chachalaca (Ortalis canicollis) over a year: vocal activity pattern and monitoring recommendations. Tropical Conservation Science, 14, 19400829211058295.
Pérez-Granados, C., & Schuchmann, K. L. (2022). Automated signal recognition as a useful tool for monitoring little-studied species: The case of the Band-tailed Nighthawk. Ecological Informatics, 101861.
Pitman, N. C. A. (2010). An overview of the Los Amigos watershed, Madre de Dios, southeastern Peru. Amazon Conservation Association, Washington, DC.ß
Pya N (2021). scam: Shape Constrained Additive Models . R package version 1.2-12. https://CRAN.R-project.org/package=scam.
QGIS.org, 2022. QGIS Geographic Information System. QGIS Association. http://www.qgis.org
R Core Team (2022). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.
Ramesh, V., Gupte, P. R., Tingley, M. W., Robin, V. V., & DeFries, R. (2022). Using citizen science to parse climatic and land cover influences on bird occupancy in a tropical biodiversity hotspot. Ecography, 2022(9), e06075.
Robinson, S. K., & Terborgh, J. (1995). Interspecific aggression and habitat selection by Amazonian birds. Journal of Animal Ecology, 1-11.
Robinson, O. J., Ruiz‐Gutierrez, V., & Fink, D. (2018). Correcting for bias in distribution modeling for rare species using citizen science data. Diversity and Distributions, 24(4), 460-472.
Robinson, O. J., Ruiz‐Gutierrez, V., Reynolds, M. D., Golet, G. H., Strimas‐Mackey, M., & Fink, D. (2020). Integrating citizen science data with expert surveys increases accuracy and spatial extent of species distribution models. Diversity and Distributions, 26(8), 976-986.
Rosenberg, G. H. (1990). Habitat specialization and foraging behavior by birds of Amazonian river islands in northeastern Peru. The Condor, 92(2), 427-443.
Rumelt, R. B., Basto, A., & Mere Roncal, C. (2021). Automated audio recording as a means of surveying tinamous (Tinamidae) in the Peruvian Amazon. Ecology and Evolution11 (19), 13518-13531.
Sánchez, A. C., Bandopadhyay, S., Briceño, N. B. R., Banerjee, P., Guzmán, C. T., & Oliva, M. (2021). Peruvian Amazon disappearing: Transformation of protected areas during the last two decades (2001–2019) and potential future deforestation modelling using cloud computing and MaxEnt approach. Journal for Nature Conservation, 64, 126081.
Schulenberg, T. S., Stotz, D. F., Lane, D. F., O’Neill, J. P., & Parker, T. A. (2010). Birds of peru. In Birds of Peru . Princeton University Press.
Segal, R. D., Massaro, M., Carlile, N., & Whitsed, R. (2021). Small‐scale species distribution model identifies restricted breeding habitat for an endemic island bird. Animal Conservation, 24(6), 959-969.
Sheng, Y., & Alsdorf, D. E. (2005). Automated georeferencing and orthorectification of Amazon basin-wide SAR mosaics using SRTM DEM data. IEEE transactions on geoscience and remote sensing, 43(8), 1929-1940.
Shirley, S. M., Yang, Z., Hutchinson, R. A., Alexander, J. D., McGarigal, K., & Betts, M. G. (2013). Species distribution modeling for the people: unclassified landsat TM imagery predicts bird occurrence at fine resolutions. Diversity and Distributions, 19(7), 855-866.
Silva-Junior, C. H., Pessoa, A., Carvalho, N. S., Reis, J. B., Anderson, L. O., & Aragão, L. E. (2021). The Brazilian Amazon deforestation rate in 2020 is the greatest of the decade. Nature Ecology & Evolution, 5(2), 144-145.
Silveira, M. V., Silva‐Junior, C. H., Anderson, L. O., & Aragão, L. E. (2022). Amazon fires in the 21st century: The year of 2020 in evidence. Global Ecology and Biogeography, 31(10), 2026-2040.
Steen, V. A., Elphick, C. S., & Tingley, M. W. (2019). An evaluation of stringent filtering to improve species distribution models from citizen science data. Diversity and Distributions, 25(12), 1857-1869.
Stevens, H. C., Metz, E. M., Saboya Del Castillo, P., Alván, J. D., & Bowler, M. T. (2019). Use of autonomous audio recordings for the rapid inventory of birds in the white‐sand forests of the Peruvian Amazon. Journal of Field Ornithology, 90(1), 70-79.
Stouffer, P. C. (1997). Interspecific aggression in Formicarius antthrushes? The view from central Amazonian Brazil. The Auk, 114(4), 780-785.
Stowell, D., Wood, M., Stylianou, Y., & Glotin, H. (2016, September). Bird detection in audio: a survey and a challenge. In 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP) (pp. 1-6). IEEE.
Stowell, D. (2022). Computational bioacoustics with deep learning: a review and roadmap. PeerJ, 10, e13152.
Stratford, J. A., & Stouffer, P. C. (2013). Microhabitat associations of terrestrial insectivorous birds in Amazonian rainforest and second‐growth forests. Journal of Field Ornithology, 84(1), 1-12.
Strimas-Mackey, M., Miller, E., & Hochachka, W. (2018). auk: eBird Data Extraction and Processing with AWK. R package version 0.3. 0.
Strimas-Mackey, M., Hochachka, W. M., Ruiz-Gutierrez, V., Robinson, O. J., Miller, E. T., Auer, T., … & Johnston, A. (2020). Best practices for using eBird Data. Version 1.0. Ithaca, NY: Cornell Laboratory of Ornithology.
Stutchbury, B. J., & Morton, E. S. (2008). Recent advances in the behavioral ecology of tropical birds. The Wilson Journal of Ornithology, 120(1), 26-37.
Sugai, L. S. M., Silva, T. S. F., Ribeiro Jr, J. W., & Llusia, D. (2019). Terrestrial passive acoustic monitoring: review and perspectives. BioScience, 69(1), 15-25.studied species: The case of the Band-tailed Nighthawk. Ecological Informatics, 101861.
Sullivan, B. L., Wood, C. L., Iliff, M. J., Bonney, R. E., Fink, D., & Kelling, S. (2009). eBird: A citizen-based bird observation network in the biological sciences. Biological conservation, 142(10), 2282-2292.
Sullivan, B. L., Aycrigg, J. L., Barry, J. H., Bonney, R. E., Bruns, N., Cooper, C. B., … & Kelling, S. (2014). The eBird enterprise: An integrated approach to development and application of citizen science. Biological conservation, 169, 31-40.
Terborgh, J., Robinson, S. K., Parker III, T. A., Munn, C. A., & Pierpont, N. (1990). Structure and organization of an Amazonian forest bird community. Ecological Monographs, 60(2), 213-238.
Thornton, D. H., Branch, L. C., & Sunquist, M. E. (2012). Response of large galliforms and tinamous (Cracidae, Phasianidae, Tinamidae) to habitat loss and fragmentation in northern Guatemala. Oryx, 46(4), 567-576.
Tobler, M. W., Carrillo‐Percastegui, S. E., Leite Pitman, R., Mares, R., & Powell, G. (2008). An evaluation of camera traps for inventorying large‐and medium‐sized terrestrial rainforest mammals. Animal conservation, 11(3), 169-178.
Van Eupen, C., Maes, D., Herremans, M., Swinnen, K. R., Somers, B., & Luca, S. (2021). The impact of data quality filtering of opportunistic citizen science data on species distribution model performance. Ecological Modelling, 444, 109453.
Velásquez-Tibatá, J., Salaman, P., & Graham, C. H. (2013). Effects of climate change on species distribution, community structure, and conservation of birds in protected areas in Colombia. Regional Environmental Change, 13(2), 235-248.
Wilkinson, D. (2020). Fitting occupancy models with unmarked. https://doi90.github.io/lodestar/fitting-occupancy-models-with-unmarked.html
Winkler, D. W., S. M. Billerman, and I. J. Lovette (2020). Antthrushes (Formicariidae), version 1.0. In Birds of the World (S. M. Billerman, B. K. Keeney, P. G. Rodewald, and T. S. Schulenberg, Editors). Cornell Lab of Ornithology, Ithaca, NY, USA. https://doi.org/10.2173/bow.formic2.01
Wright, M.N., & Ziegler, A. (2017). ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R. Journal of Statistical Software, 77(1), 1-17. doi:10.18637/jss.v077.i01
Wood, C., Sullivan, B., Iliff, M., Fink, D., & Kelling, S. (2011). eBird: engaging birders in science and conservation. PLoS biology, 9(12), e1001220.
Wood, C.M. and Peery, M.Z. (2022), What does ‘occupancy’ mean in passive acoustic surveys?. Ibis, 164: 1295-1300. https://doi.org/10.1111/ibi.13092
Zhang, G. (2020). Spatial and temporal patterns in volunteer data contribution activities: A case study of eBird. ISPRS International Journal of Geo-Information, 9(10), 597.
Zhong, M., Taylor, R., Bates, N., Christey, D., Basnet, H., Flippin, J., … & Ferres, J. L. (2021). Acoustic detection of regionally rare bird species through deep convolutional neural networks. Ecological Informatics, 64, 101333.
Zimmer, K. and M.L. Isler (2020). Goeldi’s Antbird (Akletos goeldii), version 1.0. In Birds of the World (J. del Hoyo, A. Elliott, J. Sargatal, D. A. Christie, and E. de Juana, Editors). Cornell Lab of Ornithology, Ithaca, NY, USA. https://doi.org/10.2173/bow.goeant1.01
Zimmer, K. and M.L. Isler (2020). White-throated Antbird (Oneillornis salvini), version 1.0. In Birds of the World (J. del Hoyo, A. Elliott, J. Sargatal, D. A. Christie, and E. de Juana, Editors). Cornell Lab of Ornithology, Ithaca, NY, USA. https://doi.org/10.2173/bow.whtant1.01
Figure 1: (a) survey points at Los Amigos Biological Station and (b) eBird checklists within Madre de Dios. Most eBird checklist locations occur near riverine corridors, whereas structured survey points span the HAND gradient. Black and yellow box in (b) denotes extent of (a).