Introduction

Infectious agents (bacteria, viruses, fungi, and parasites; also referred to as pathogens throughout) are strong regulators of the abundance, distribution, and life histories of animals1. Animal movements and migrations may provide refuge from infections through a departure from infested locations but can also elevate infection risk by exposing hosts to new agents or aggregating stressed hosts2. The interplay of host movements, alternate host ranges, anthropogenic activities, and environmental conditions determines the spatiotemporal distribution of infected hosts, sometimes resulting in clusters of infected individuals known as “hotspots.” For infectious agents that are not well-studied in wild populations or are known to have population level impacts, identifying infection hotspots is an essential step in further understanding or mediating and managing problematic infections. For example, a recent hotspot analysis of the SARS-CoV-2 virus contributed to the epidemiological knowledge of the novel virus while also informing public health resource allocation3.

While many vertebrates experience some level of infection in their lifetimes, not all infections lead to disease and pathogen-induced mortality. However, sub-lethal impacts of infection (e.g. pathogen-induced changes to mobility or behavior) in wild organisms can lead to selective predation, reducing the likelihood of observing infected individuals4,5,6. Mobile aquatic organisms, already elusive, are rarely recovered after perishing in marine or riverine environments, often as a result of predation7,8. In these cases, documenting infectious agent presence independent of, or at least prior to, clinical disease is necessary for determining the distribution of individuals infected by or carrying a given pathogen. Such a task is well-suited to molecular screening methods. High-throughput quantitative polymerase chain reaction (HT-qPCR) platforms can provide datasets ideal for studying dozens of infectious agents simultaneously, across large numbers of test subjects5,9. These methods have revolutionized our ability to study the spatial distribution of infections, including those contributing to indirect mortality via sub-lethal effects, and identify infection hotspots.

Although Pacific salmon (Oncorhynchus spp.) hold great cultural and economic importance, few studies10,11,12 have documented infectious agent distributions (aside from sea lice) in wild salmon over large geographic scales. The majority of relevant knowledge regarding pathogen distributions in the marine environment comes from studies of domesticated fish in marine aquaculture netpens7. Because conditions in netpens differ substantially from the wild in terms of host species, density, predation risk, foraging opportunity, and freedom of movement, infections in aquaculture cannot accurately represent infectious agent distributions in nearby wild populations7,13. Meanwhile, the incidence of marine disease across a broad range of flora and fauna has increased in recent decades14,15,16, highlighting the potential importance of infectious agents for wild salmon during their marine phase. Populations of Sockeye (O. nerka), Coho (O. kisutch), and Chinook (O. tshawytscha) salmon in British Columbia have experienced declines in recent decades17,18,19. One explanation for these declines centers on poor early marine survival20,21,22,23, variously ascribed to predation24, starvation21,25, large-scale oceanic regime shifts26,27, and inter-species competition22,28. A recent study provided evidence that some infectious agents are associated with population-level survival in wild salmon29.

Differences in behavior, physiology and life history between Oncorhynchus spp. of the Northeast Pacific Ocean result in varying exposure and susceptibilty to infectious agents. For example, variations in freshwater rearing habitats have important implications for exposure to pathogens. Sockeye salmon usually rear in lakes for at least a year, whereas Coho salmon typically rear in small streams for a year, and Chinook salmon rear either in large streams for a year or move into the estuary within months of emergence (stream-type vs ocean-type, respectively)30. Freshwater-transmitted infections may carry over into the marine environment, and have measurable impact there11,12,29. Evolved immunological differences between Oncorhynchus spp. result in varied susceptibility to some pathogens31,32, which may manifest in spatial variation in prevalence between species. Migratory behavior in the ocean has important implications for marine infections and our ability to monitor them. Upon ocean entry, most stocks of Sockeye salmon in the Northeast Pacific Ocean migrate rapidly northward33. The majority of Chinook salmon remain within 200 km of their river estuary of origin for their first marine year, except for yearling Columbia River populations that behave more like Sockeye34,35. Coho salmon demonstrate more variability, with some stocks moving shorter distances than others in the first marine year, and larger-bodied individuals migrating further north36.

In this study we sought to identify infection hotspots for 56 infectious agents in Chinook, Sockeye, and Coho salmon during their first marine year. We used a spatial epidemiological tool37 to determine the locations of prevalence clusters for each infectious agent in each salmon host species. We overlaid clusters from all three salmon host species to identify locations in the study area with multiple hotspots across pathogen taxa. We qualitatively related cluster location to known aspects of host population of origin and pathogen biology. Our goal is to provide a resource to further the ecological knowledge regarding these pathogens, inform management activities (e.g., remediation), and prioritize locations for further research on select pathogens.

Results

A total of 41 infectious agent taxa were detected (Chinook = 38, Coho = 36, Sockeye = 37) out of a total of 56 assayed (Table 1). Individual fish had positive detections for an average of 4.0 to 4.8 pathogens (total pathogen taxa; range Chinook = 0–12, Coho = 0–10, Sockeye = 0–13), with little variability between species and seasons (Table 1). Infection cluster analysis revealed that most infectious agents had one or more areas with higher or lower infection prevalence than expected by chance. We present the marine-transmitted microsporidian, Loma salmonae, as an illustrative example (Fig. 1). Descriptions of cluster locations for each pathogen and corresponding maps can be found in Supplementary Material 1. Boxplots of pathogen loads divided by species and season are also available in Supplementary Material 1.

Table 1 Prevalence of infectious agents in Pacific salmon mixed-tissue marine samples (2008–2018) tested by HT-qRT-PCR (DFO Pacific Biological Station, Nanaimo, BC).
Figure 1
figure 1

Infection clusters for the marine-transmitted microsporidian, Loma salmonae, occurred for all host species but prevalence was greatest in Coho salmon. The color of 30 km hexagons indicates L. salmonae prevalence while empty cells indicate that samples were collected but the pathogen was not detected (0% prevalence). Red circles indicate positive clusters (significantly higher than random likelihood of infection or infection intensity). Blue circles indicate significantly lower than expected regions of infection. Similar maps for 40 other pathogen taxa are available in Supplementary Material 1. Basemap data are from the GSHHG (Global Self-consistent, Hierarchical, High-resolution Geography) Database38. The coordinate system for the data is WGS 1984 and the maps are projected in NAD 1983.

Genetic stock identification

Genetic stock identification revealed that the majority of Chinook and Sockeye salmon sampled along the west coast of Vancouver Island (WVI) in spring-summer originated in the Columbia River (Fig. 2, Tables S1, S3) whereas for Coho salmon, the largest population grouping found in WVI during the spring-summer was from Washington State, followed by the Columbia River (Table S2). In fall-winter, few Columbia River salmon remained in the study area and WVI samples were dominated by WVI origin Chinook and Washington origin Coho salmon. Relatively few Sockeye salmon were sampled in the WVI region during fall-winter. Year round, fish sampled on the east coast of Vancouver Island (EVI) were predominantly from East Vancouver Island rivers and the Fraser River. Chinook salmon sampled in the northern coastal areas tended to originate from northern watersheds whereas for Sockeye salmon, Fraser River origin fish represented the largest population grouping sampled in this region (Fig. 2, Tables S1, S3).

Figure 2
figure 2

Regional stock compositions, as determined by genetic stock identification, for Chinook salmon collected along the British Columbia coast, 2008–2018. Red outline in inset panel indicates study area extent. Pie charts represent the proportions of stock groupings sampled from regions adapted from DFO’s Marine Adaptive Zones39 (black borders, text in right panel). Total Chinook sampled in each zone provided. Total sampled per zone and stock percentages for all species are in Tables S1, S2, S3. Basemap data are from the GSHHG (Global Self-consistent, Hierarchical, High-resolution Geography) Database38. The coordinate system for the data is WGS 1984 and the maps are projected in NAD 1983.

Clusters summed across species and total pathogen taxa

In both seasons, a hexagon containing a high number of cluster centers (nine in spring-summer, six in fall-winter) occurred in southern EVI (Fig. 3, Supplementary Material 2). In spring-summer, this high density hexagon was found in the Gulf Islands, containing the Cowichan River estuary and Salt Spring Island (clusters for Chinook: Piscirickettsia salmonis, Renibacterium salmoninarum, Candidatus Syngnamydia salmonis, Paranucleospora theridion, Myxobolus arcticus, Ichthyophthirius multifiliis, Atlantic Salmon Calicivirus; Sockeye: P. theridion, M. arcticus). During the fall-winter, the most cluster centers fell in a hexagon northwest of the Fraser River estuary, encompassing the mouth of Howe Sound (clusters for Chinook: Rickettsia-like Organism (RLO), Tenacibaculum maritimum, Ceratonova shasta; Sockeye: RLO, M. arcticus, Parvicapsula minibicornis). High density clusters (7–8 cluster centers per 30 km hexagon) were found elsewhere along the EVI in the spring-summer, from the Strait of Georgia (SOG) through Johnstone Strait (Fig. 3, Supplementary Material 2).

Another group of hexagons with high density clusters was found along WVI in the spring-summer (Fig. 3). These clusters were primarily composed of Columbia River salmon (Chinook: 84–100%, Sockeye: 75–100%, Coho: 50–100%) and included C. shasta (n clusters per species: Chinook = 4, Sockeye = 5, Coho = 2), P. minibicornis (n: 4, 0, 0), Piscine orthoreovirus (PRV; n: 3, 2, 0), and Facilispora margolisi (n: 3, 0, 1). WVI was also home to the 30 km hexagon with the highest cluster count in the fall-winter (Fig. 3), a hexagon with seven clusters (five in Chinook, two in Coho) positioned over Quatsino Sound (clusters included F. margolisi, L. salmonae, P. theridion (also in Coho), Parvicapsula kabatai, Tetracapsuloides bryosalmonae, and Erythrocytic Necrosis Virus (ENV, in Coho)).

Visualization of mean total pathogen taxa (the number of pathogen taxa detected per individual) revealed lower counts at the upstream ends of mainland fjords relative to other regions (Supplementary Material 1). Ordinal cluster analysis of total pathogen taxa indicated that all host species had low clusters in the SOG during fall-winter. Coho salmon had consistent (in both spring-summer and fall-winter) positioning of a low cluster in the southern SOG and a high cluster in Quatsino Sound. Chinook had multiple high clusters along the SOG and WVI in spring-summer and a single high cluster centered on Nootka Sound (WVI) in fall-winter. Sockeye had high clusters for total pathogen taxa in Johnstone Strait in spring-summer and a single high cluster along the Alaska coastline in fall-winter.

Figure 3
figure 3

The total number of statistically significant positive prevalence cluster centers per 30 km hexagon. Empty cells indicate that samples were collected in these locations but no clusters were centered there. An interactive map that identifies pathogen and host species identities for clusters in each hexagon is available in Supplementary Material 2.

Discussion

In an era when many populations of wild Pacific salmon are experiencing consistently poor survival19, and both human activities and rapid climate change are undermining the host–pathogen homeostasis of marine organisms40, greater understanding of the role of infectious agents in the survival of wild salmon is essential. An important step towards this understanding is to determine the distributions of infectious agents within wild salmon hosts in the marine environment. Through the combination of extensive marine sampling, HT-qPCR, and spatial analysis we substantially expanded the current knowledge regarding pathogen distributions in Chinook, Coho, and Sockeye salmon along the coast of British Columbia. Using purpose-built software37 to reveal higher- and lower-than random spatial clustering in pathogen prevalence, we found a high density of clusters for multiple infectious agent taxa within the Salish Sea along the east coast of Vancouver Island, a major migration route and rearing area for multiple species of Pacific salmon. Many notable taxon-specific patterns were also observed and are discussed in Supplementary Material 1.

Locations where hotspots for multiple infectious agents overlap may contribute to detrimental co-infections5,41, can be the result of anthropogenic activities42,43, and could be ideal targets for remediation (e.g., locations where habitat restoration or reductions to municipal or industrial effluent could reduce infection pressure42,44). We found a high density of infection clusters occurring in both seasons in the southern Strait of Georgia. This region is home to the highest urban density in British Columbia, has experienced the most rapid increase in sea surface temperature in the study area45, and is a rearing area and migratory route for multiple declining populations of Pacific salmon18,19.

High densities of pathogen clusters occurred along the east side of Vancouver Island throughout the year. The multiple clusters located around Salt Spring Island at south EVI were composed of freshwater (four) and seawater (five) transmitted pathogens, found in primarily EVI and Fraser River populations of Chinook and Sockeye salmon. Abundant wild and hatchery salmon populations in the Cowichan River, close proximity to human population density42,46, industrial activity, elevated SST45, and poor marine rearing conditions46 are all factors that could have contributed to abundant infection clusters here. Moving north through the SOG, another concentration of infection clusters was detected at the south end of Texada Island, primarily composed of seawater-transmitted infections (e.g., ENV, P. kabatai, I. hoferi), which could be expected following departure from the lower salinity of the Fraser River plume and with increased encounters with Pacific herring, Clupea pallasii (true for ENV and I. hoferi, see below). High densities of infection clusters persisted through the Discovery Islands and Johnstone Strait, regions featuring few feeding opportunities47 and home to a large component of British Columbia’s salmon aquaculture (at the time our samples were collected). Multiple clusters of T. maritimum, a bacterium with evidence of elevated transmission from aquaculture to wild fish in the Discovery Islands48 and potential impacts on wild populations29, were detected in wild Chinook and Sockeye (two clusters for each host) in this region, similar to previous observations49.

Spring-summer infection clusters occurring at a high density along WVI were primarily composed of Columbia River origin salmon. Conditions in the Columbia River including high water temperature, low-current reservoirs, and abundant sediment are ideal for transmission of P. minibicornis and C. shasta50,51, two of the pathogens with multiple clusters along WVI. The migration of juvenile salmon from the Columbia River past Vancouver Island could act as a route of transmission of infectious agents from one region to another. However, C. shasta and P. minibicornis are parasites incapable of horizontal transmission. In contrast, PRV, which also composed spring-summer WVI clusters with predominately Columbia River fish, could be transmitted from one population to another during long-distance migrations52. A recent study found that Columbia River Chinook salmon sampled throughout our study area all shared a common lineage of PRV, suggesting that they were infected in freshwater and then dispersed53. While PRV is already widely dispersed in BC, this potential transmission route could be of future importance in the context of the emergence of virulent pathogens, a phenomenon accelerated by high density fish culture facilities54 such as the many hatcheries on the Columbia River and BC salmon farms.

In the fall-winter period, Quatsino Sound, on the northwest coast of Vancouver Island, was the location with the highest density of infection clusters. Inlets and embayments sometimes host higher infectious agent densities, potentially due to constrained water circulation, elevated temperatures, or seasonal interactions with other host species55,56,57. The Marble River enters Quatsino Sound in an eastern arm and contributes a Chinook salmon population that is resident inside the sound for its first marine year58, and this population composed 78% of our samples here. It is possible, therefore, that unknown factors specific to this population could play a role in the high density of infection clusters here. This region also contains a relatively high density of aquaculture facilities in a narrow fjord, and recent work has shown that nucleic acids of multiple pathogens tend to be elevated around active aquaculture59 and that the probability of infection with some agents increases with proximity to aquaculture48,53.

Fall-winter clusters for PRV in Chinook salmon were detected in Quatsino and Nootka Sounds along WVI, in stocks originating in this area. In addition to the evidence indicating likely transmission of PRV from aquaculture to wild fish53, Atlantic salmon freshwater hatcheries were previously shown to harbor PRV infections60 and thus we might consider whether Pacific salmon enhancement facilities in BC could also play a role in transmission. Further sampling in the WVI region could help determine whether or not a PRV reservoir persists in WVI inlets (either of anthropogenic or natural origin) and could potentially be remeditated.

Cluster locations for total pathogen taxa within individuals did not correspond to locations with overlapping cluster centers from multiple pathogens. This indicates that while clusters from multiple pathogens could be spatially concentrated, individual fish in such a location were not necessarily burdened with a high number of infections. Tucker et al.61 found that mean total pathogen taxa in early marine Chinook salmon increased from 1.5 to 4.2 following marine entrance but then declined to 3.0 in winter. Accordingly, we saw evidence of lower total pathogen taxa within individuals in mainland fjords where collected fish were more likely to have recently left freshwater. While we assayed approximately 10 more pathogens than Tucker et al.61 and we did not see a drop in total pathogen taxa in the fall-winter, the fact that total pathogen taxa tends to average around four in both studies may indicate that there is some upper threshold to the number of different pathogens a Pacific salmon host can sustain, perhaps due to elevated mortality and greater predation risks at higher numbers of co-infections5,6,29. If this is the case, the regions where we observed higher clusters for total pathogen taxa may indicate locations of elevated mortality for each host species (Supplementary Material 1).

Although the majority of infectious agents detected in this study presented similar prevalences and distributions across host salmon species, there were several exceptions. Inter-host-species differences in pathogen prevalence could arise from species-based variation in exposure due to host life histories and food or habitat preferences, species-based variation in susceptibility, statistical artifacts introduced by spatially and seasonally non-random sampling, and potential biases from the use of qPCR (see below). In some cases, the difference in overall prevalence between host species is consistent with what is known regarding host susceptibility, including for P. salmonis62, C. shasta63, M. arcticus64, and Pacific salmon nidovirus (PsNV), Salmon Pescarenavirus 1 (SPAV1), and SPAV265. For L. salmonae, susceptibility is expected to be similar between Chinook and Coho66,67 but we found 1.5–2 times higher prevalence in Coho. Some less-studied taxa, including Aliivibrio salmonicida, Dermocystidium salmonis, Myxobolus insidiosus, and Pacific Salmon Parvovirus (PSPV), presented inter-host-species differences in prevalence that have not previously been described. While unbalanced sampling may have given rise to inter-species variation in these pathogen taxa, our results could also represent epidemiological clues towards aspects of transmission, host susceptibility, and pathogenicity for hosts. For example, the near absence of the freshwater-transmitted PSPV in Chinook and Coho salmon contrasted with high prevalence in Sockeye salmon suggests that Sockeye are considerably more susceptible to infection with this virus. Note that inter-species differences are very apparent for some pathogen taxa when infective load is visualized (Supplementary Material 1).

As is inevitable with an analysis such as ours, a number of simplifying assumptions had to be made. The primary assumption for a spatial analysis of infectious agents in a migratory species is that pathogen profiles in sampled fish represent environmental pathogen presence at the capture location. Clearly this is not an accurate assumption for Sockeye salmon, which are migrating around 10–25 km/day in the marine environment68. Therefore, the reader must consider that in some cases the position of a cluster reflects experiences earlier in the migration. Of all the Pacific salmon species, Chinook might be the most suitable for such an analysis since many stocks tend to remain within 100–200 km of their natal river until their second ocean year, with Columbia River fish being the highly mobile exception34. In our dataset, the sampling locations of 65% of spring-summer and 39% of fall-winter Chinook salmon were less than 50 km from their natal stream (as determined by GSI).

Although our analysis provided a novel, descriptive spatial analysis of dozens of infectious agents in salmon along the British Columbia coast, it could only be conducted as a purely spatial analysis (data from multiple years combined) due to sample size (with the exception of our division of the data into two seasons). This approach assumes that infection clusters are relatively stable from year to year. While this assumption may be valid for a portion of the pathogens we assayed, it is unlikely to be true for all. Pathogens that appeared at lower prevalence, did not have a life cycle requiring an intermediate host, and were more likely to have acute impacts on infected fish (e.g., some bacteria and viruses such as P. rickettsia and Viral Hemorragic Septicemia Virus) might show more inter-annual variation in distribution. Therefore, when interpreting the presence of a given cluster (particularly one for a pathogen with the life history characteristics just described), it is necessary to consider that a high density of infections may have occurred at this location in a subset of or even just one of the study years. Ideally, samples collected evenly over space and time would have allowed us to conduct a spatiotemporal cluster analysis, but this was not logistically possible given limitations imposed by the expense of marine sampling.

Quantitative PCR using TaqMan probes allows for high assay specificity9. While this may be a positive attribute which is responsible for the relatively low false positive rate of PCR, it may also enable us to overlook closely-related but divergent lineages of certain pathogens, particularly for agents with high mutation rates, most notably RNA viruses69. Consider the example of SPAV1 and SPAV2. Had we only designed primers for one of these closely-related viruses, we would have hypothesized that a salmon arenavirus is found in solely Chinook or Sockeye salmon. Therefore, we must consider that in cases where we saw high host tropism (e.g., PSPV) there may have been a lineage capable of infecting other hosts that went undetected due to the specificity of our assays.

Our study represents the most comprehensive assessment of the marine distributions of infectious agents in three ecologically, culturally, and commercially important salmon species. We have provided observations of the marine distribution of dozens of pathogen taxa for which little is currently known in wild Pacific salmon. We anticipate that these distributions will prove useful to those conducting further studies of pathogens in Pacific salmon as well as those seeking to identify potential locations for remediation. Our high-throughput qPCR methodology, especially paired with new innovations in sampling environmental DNA70, is an efficient tool for monitoring a broad range of salmon pathogens and would prove quite useful for the proactive identification of pathogens in aquaculture and hatchery environments. For research, an important next step is to identify the intrinsic (fish related) and extrinsic (environmental) factors associated with the distributions of pathogens that we have described in this study.

Methods

Field sampling

Data used in this study were from a database created over the course of the Strategic Salmon Health Initiative https://www.canada.ca/en/fisheries-oceans/news/2016/05/strategic-salmon-health-initiative.html and subsets of these data have been used previously in unrelated analyses29,49,71,72. Pacific salmon (n: Chinook = 5719, Coho = 2059, Sockeye = 3387) in their first year of ocean residence (April through March of the following year) were opportunistically collected during several Fisheries and Oceans Canada research programs from 2008 to 2018 (scientific fishing permit MECTS # 2014-502-00249). Samples were collected as far north as Fredrick Sound, Alaska (56\(^{\circ }\)N) and as far south as the mid-Puget Sound, Washington (47.5\(^{\circ }\)N, Fig. 2). Methods used to capture salmon included mid-water trawl (n: Chinook = 4308, Coho = 1869, Sockeye = 2431), purse seine (n: Chinook = 458, Coho = 87, Sockeye = 956), and beach seine (n: Chinook = 953, Coho = 103, Sockeye = 0). Fish were either dead upon landing or euthanized in buffered tricaine methanesulfonate. A tissue sample was taken from the adipose fin or operculum and preserved in 95% ethanol to determine the population of origin for each fish, a process known as genetic stock identification (GSI)73. Fish were either dissected in the field or frozen at − 80 °C after capture and subsequently dissected in the lab. In the field, tissues samples including gill, brain, liver, head kidney, and heart were placed in RNAlater (Qiagen, MD, USA) for 24 h at 4 °C and then frozen at − 80 °C.

All methods were performed in accordance with the relevant guidelines and regulations according to the Canadian Council on Animal Care’s (CCAC) Guide to the Care and Use of Experimental Animals, and project protocols were approved by the federal department of Fisheries and Oceans Canada (DFO) through its Pacific Region Animal Care Committee (Animal Use Protocol Number: 13-008). Where applicable, this study is reported in accordance with ARRIVE guidelines. Because collection of samples was reliant on the operations of multi-use research vessels and the study was observational in nature so that no experimental manipulations were imposed, not all ARRIVE guidelines are applicable.

Because sample distributions, population composition, diets, and infectious agent profiles for salmon have been shown to vary in a biologically significant manner within the first marine year11,61,74, we split samples into the “spring-summer” (April through August, n: Chinook = 2883, Coho = 1062, Sockeye = 2826) and “fall-winter” (September through March, n: Chinook = 2836, Coho = 997, Sockeye = 561) sample periods (sample size precluded further seasonal division). We utilized this same seasonal division in a recent study of the associations between infectious agent prevalence and population-level survival29. No samples were excluded from analysis.

Molecular methods

Tissue samples were screened for the presence of 56 infectious agent taxa (Tables 1, S4, Supplementary Material 1), using HT-qPCR on the Fluidigm Biomark Dynamic ArrayTM microfluidics platform (Fluidigm, San Francisco, CA, USA) at the Pacific Biological Station, Nanaimo, British Columbia, Canada. This platform has been analytically validated for quantitative infectious agent profiling in salmon tissue9 and applied to dozens of studies in Pacific and Atlantic salmon71,75,76. Infectious agent taxa were chosen based on knowledge of their presence in Canada, evidence of their association with disease worldwide, or recent discovery by our group9,65,77. Assays utilizing Taqman probes (Table S4, Supplementary Material 1) were designed to target both RNA and DNA. Not all of the same assays were used over the course of the qPCR runs (over 200 dynamic arrays run over the course of 6 years), as some new assays were developed and others were removed after no detections across previous studies (% of population assayed per pathogen reported in Table 1). Detailed laboratory methods are provided in Supplementary Material 1. We also determined “total pathogen taxa” per individual fish by tallying the number of detected pathogens found in each individual.

Statistical analyses

We used resampling simulations (SatScan [https://www.satscan.org]) to identify clusters of high (and low) prevalence for each infectious agent that was detected at any prevalence greater than zero (Table 1). The resampling simulations identify areas of higher or lower prevalence than would be expected if the pathogen was distributed uniformly across space78. Satscan places circular “scanning windows” of varying sizes around pre-defined geographic coordinates to determine, for a given infectious agent, whether the group of samples within a given window are significantly different in prevalence (percent positive) from all samples outside the scanning window. Statistical significance was determined using a likelihood ratio test to compare groups within and outside the scanning window, and an accompanying p-value generated through Monte Carlo simulation wherein the dataset outside the scanning window is randomly resampled 1000 times (clusters with \(p < 0.05\) are presented in maps). All analyses were conducted in the R statistical language79 using the rsatscan package80. Example R code is provided in Supplementary Material 1.

Samples from all years were aggregated across an evenly spaced grid of hexagons (10 km from center to center) and scanning windows of various sizes were applied to hexagon centers. Thus, this was a purely spatial analysis with the only temporal aspect being that a separate analysis was performed for the spring-summer and fall-winter seasons for each infectious agent. By conducting a set of year-specific (but otherwise identical) analyses on a subset of pathogens, we determined that a purely spatial analysis adequately (if not conservatively) represented cluster locations identified in the year-specific analysis, without creating additional, spurious clusters. We elected to use the purely spatial approach due to its demonstrated ability to identify year-specific clusters, the sparsity of data in some years, and the extreme number of analyses and figures that a year-specific analysis would entail. Because we wanted to prevent scanning windows from crossing land masses (particularly Vancouver Island) we set the maximum scanning window radius to 30 km, with the assumption that the inference from a single large cluster (as might occur if we used the default window size limit, a window that encompasses 50% of the total population sample size) would be similar to that from several adjacent clusters. For a given infectious agent, significant clusters were not allowed to overlap. For all infectious agents we set the model type to the Bernoulli distribution78, which simply requires the input of geographic locations and the number of positive and negative detections per location. We also performed a similar cluster analysis on total pathogen taxa (a tally of the pathogens detected per individual) but using the ordinal model type81. To visualize the spatial distribution of clusters we calculated and color-coded infectious agent prevalence for a grid of 30 km hexagons superimposed across the sample locations. Significant (\(p < 0.05\)) scanning windows of both high (red) and low (blue) clusters were superimposed onto this base map. We also tallied cluster centers across all infectious agents and host species using the 30 km hexagonal grid to visually determine whether there were any areas with high densities of clusters across multiple pathogen taxa.