Improved estimation of hunting harvest using covariates at the hunting management precinct level

Page created by Ross Wade
 
CONTINUE READING
Improved estimation of hunting harvest using covariates at the hunting management precinct level
Linköping University | Department of Physics, Chemistry and Biology
 Type of thesis, 60 hp | Educational Program: Biology
 Autumn term 2020 and Spring term 2021 | LITH-IFM-A-EX—21/3955--SE

Improved estimation of hunting
harvest using covariates at the
hunting management precinct level

Paula Jonsson

Examinator: Karl-Olof Bergman
Supervisor: Tom Lindström
Table of content
1. Introduction ........................................................................................................................ 4
2. Material & Methods ........................................................................................................... 5
 2.1 Game species ............................................................................................................... 5
 2.2 Data collection ............................................................................................................. 8
 Reporting game management ............................................................................... 8
 Actual hunting land .............................................................................................. 9
 Climate data........................................................................................................ 10
 Wildlife accidents ............................................................................................... 10
 Geographical distribution ................................................................................... 10
 2.3 Model specification ................................................................................................... 11
 Previous model (Null model) ............................................................................. 11
 Proposed model (Covariate model) .................................................................... 12
 Prior elicitation ................................................................................................... 13
 2.4 Model selection.......................................................................................................... 13
 2.5 Prediction ................................................................................................................... 14
 2.6 Computation .............................................................................................................. 14
 Leave one out cross-validation ........................................................................... 15
3. Results .............................................................................................................................. 15
 3.1 Model selection.......................................................................................................... 15
 Evaluation of predictive performance ................................................................ 16
 Evaluation of the 95 % credible interval ............................................................ 16
 3.2 Posterior prediction sampling .................................................................................... 18
 Predicted median among HMPs ......................................................................... 20
 Predicted variation among HMPs ...................................................................... 23
 Predictive variation at high hunting levels ......................................................... 25
4. Discussion ........................................................................................................................ 27
 4.1 Conclusion ................................................................................................................. 30
5. Societal & ethical considerations ..................................................................................... 31
6. Acknowledgements .......................................................................................................... 31
7. References ........................................................................................................................ 32
8. Appendix A ...................................................................................................................... 36
Abstract
In Sweden, reporting is voluntary for most common felled game, and the number of voluntary
reports can vary between hunting teams, HMP, and counties. In 2020, an improved harvest
estimation model was developed, which reduced the sensitivity to low reporting. However,
there were still some limits to the model, where large, credible intervals were estimated.
Therefore, additional variables were considered as the model does not take into account
landcover among HMPs, [2] the impact of climate, [4] wildlife accidents, and [4] geographical
distribution, creating the covariate model. This study aimed to compare the new model with the
covariate model to see if covariates would reduce the large, credible intervals. Two hypothesis
tests were performed: evaluation of predictive performance using leave one out cross-validation
and evaluation of the 95 % credible interval. Evaluation of predictive performance was
performed by examining the difference in expected log-pointwise predictive density (ELPD)
and standard error (SE) for each species and model. The results show that the covariates model
ranked highest for all ten species, and out of the ten species, six had an (ELPD) difference of
two to four, which implies that there is support that the covariate model will be a better predictor
for other datasets than this one. At least one covariate had an apparent effect on harvest
estimates for nine out of ten species. Finally, the covariate model reduced the large
uncertainties, which was an improvement of the null model, indicating that harvest estimates
can be improved by taking covariates into account.

Keywords: Bayesian statistics, covariates, land cover, credible interval, model selection

 3
1. Introduction
Reliable harvest estimates and hunting regulation are cornerstones of fact-based wildlife
management (Bergqvist et al., 2016). Wildlife management can contribute to the rehabilitation
of regional biodiversity and, due to its direct (interspecific interactions) and indirect functions
(food web and community structure), increase the health of ecosystems (Tsunoda, 2020).
However, managing wildlife is a challenge worldwide since it needs to balance societal and
ecological aspects while considering species' long-term survival (Pelikka et al., 2007; Dressel
et al., 2018). Reliable harvest estimates could reduce the challenges of managing wildlife since
they can indicate changes in population size (monitoring long-term survival of wildlife species)
(Bergqvist et al., 2016). Furthermore, harvest should be continuously adapted to changes in
ecosystems and society, making harvest estimates a fundamental and vital component in most
wildlife management programs (Danell & Bergström, 2010).

Game management and harvest reporting vary among countries. For example, in Cyprus,
Greece, and Great Britain, reporting is entirely voluntary, whereas, in Norway, Iceland, and
Germany, reporting is completely mandatory (Aubry et al., 2020). In Sweden, voluntary
reporting is applied for game species with an open season. The landowner holds the hunting
rights, which means that felled game is always owned by the landowner or a hunter(s) to whom
the landowner has authorized hunting. However, reporting is mandatory for game species with
pre-determined harvest quotas such as moose (Alces alces), red deer (Cercus elaphus), wolf
(Canis lupus), bear (Ursus arctos), lynx (Lynx lynx), and wolverine (Gulo gulo). Harvest of
these species are regulated by licenses or management area plans, and the harvest must be
reported to the County Administrative Boards.

In Sweden, all voluntary reporting is performed through a standardized form or directly in the
database “Viltdata” owned by The Swedish Association for Hunting and Wildlife Management
(SAHWM) (Viltdata, w.y). All harvest reporting and harvest estimations are carried out within
Hunting Management Precincts (HMP, Swedish: jaktvårdskrets), with 313 HMPs throughout
Sweden during the hunting year 2018/2019. Harvest estimates are based on voluntary reporting
from hunting teams within each HMP. The previous harvest estimation method, used by
SAHWM, extrapolates linearly from harvest reports, based on the assumption that harvest rate
occurs uniformly within HMPs (Lindström & Bergqvist, 2020). Furthermore, as the method
does not provide any measures of uncertainty, there was a need to develop a new method in
order to overcome these potential drawbacks.

 4
Since reporting is voluntary, it is not possible to verify if hunting teams report their harvest of
all species or just some species. SAHWM continuously informs and encourages hunting teams
to report their total harvest, regardless of size, species abundance, or hunters' game preferences.
According to SAHWM, it is easier to reach large hunting teams, and because of that, hunting
teams with large areas may be over-represented in the reporting compared with smaller hunting
teams (G. Bergqvist, personal communication, May 4, 2021). However, this is not entirely
certain. Since there are major uncertainties about hunting and reporting preferences, these
uncertainties should be considered in the estimation.

During 2019-2020, Lindström and Bergqvist (2020) introduced a new model for estimating
hunting harvest, using a Hierarchical Bayesian framework. Their framework suggested that
there could be random variability among hunting teams and that there could be an effect of
hunting area per hunting team on harvest rate. Further, their results showed that hunting teams
that hunt in larger areas harvest less per unit area, suggesting that the harvest rate does not scale
linearly with hunting areas (Lindström & Bergqvist, 2020). Their framework improved the
former estimation method since it presented inherent uncertainties while reducing sensitivity to
low reporting rates.

Even if Lindström and Bergqvist’s model improved the former harvest estimation method,
there were still large uncertainties (large credible interval) for some species. The finest spatial
resolution considered from harvest reports used in their model was HMPs. Therefore, I
wanted to investigate if the uncertainty could be reduced by adding covariates to the model. I
will extend Lindström and Bergqvist’s model by adding fixed effects at the HMP level, such
as [1] landcover among HMPs, [2] the impact of climate, and [3] geographical distribution. I
will, like them, use a Hierarchical Bayesian model, with the aim of developing a more robust
model for estimating hunting harvest.

2. Material & Methods
2.1 Game species
This study included ten different game species. Like Lindström and Bergqvist’s (2020) model,
red fox (Vulpes vulpes), Eurasian beaver (Castor fiber), wild boar (Sus scrofa), and European
pine marten (Martes martes) was included as model species. These four species were included
since they exemplify game harvest at high or low numbers and uniformly or variably across
Sweden. In addition, six additional species for which landcover, climate variables, and
geographical distribution were expected to have a substantial impact were included. These six

 5
species are mountain hare (Lepus timidus), western capercaillie (Tetrao urogallus), rock
ptarmigan (Lagopus muta), common eider (Somateria mollissima), American mink (Neovison
vison), and greylag goose (Anser anser).

The red fox is a highly adapted generalist mesopredator and inhabits most parts of Sweden.
Populations of red fox are favoured by increasing agricultural land and infrastructure since such
areas can provide good food resources such as garbage and roadkill (Elmhagen et al., 2015;
Walton & Walton, 2018). Since the early 2000s, red foxes’ populations have increased in the
Arctic tundra, favoring the increase of reindeer carcasses (Elmhagen et al., 2015; Walton &
Walton, 2018).

Populations of Eurasian beaver have been close to extinction, mainly caused by hunting, but
during the latest decades, the populations have been reintroduced in their native range in
Sweden. Beavers are herbivores and live in freshwater throughout Sweden and use mainly
deciduous trees for foraging (Hartman et al., 2008; Willby et al., 2018). They are favoured by
warmer seasons since they can fill their cache during a more extended warmer period before
the winter (Willby et al., 2018).

Wild boars are a widespread mammal that during the latest decades has increased exceedingly,
both in numbers and distribution. They are mainly located in Sweden's south and central parts
(In Swedish: Götaland and Svealand) (Bergqvist & Elmhagen, 2019). The main reason for this
high increase is their high reproductive rate, where one sow can give birth to up to 5-6 offspring
per litter (Malmsten & Dalin, 2016; Vetter et al., 2020). Furthermore, juveniles benefit from
climate change since they survive the winter season due to a higher winter temperature and
greater food availability (Vetter et al., 2020). Wild boars are one of the most common felled
game species (Bergqvist & Elmhagen, 2016).

The European pine marten is a small omnivore that occupies conifer and deciduous forests in
most parts of Sweden (Birks et al., 2005). Their food resource during the winter season consists
of mainly small mammals, especially microtine rodents. During the spring season, pine martens
mainly forage on berries and invertebrates (Helldin, 2000).

Since 1980, mountain hare has decreased in Sweden, possibly due to interactions with the
European hare (Elmhagen et al., 2015). Today, the mountain hare is classified as near threatened
by the Swedish red list (Thurfjell et al., 2020). The mountain hare inhabits a wide range of
biotas, from taiga to tundra and can survive under poor environmental conditions. However,
they are rarely found in agricultural land (Angerbjörn & Flux, 1995; Bergqvist & Elmhagen,

 6
2020). Mountain hares change their food preferences during each semester. During spring, they
forage on herbs and grass, and during winter, they forage on shrubs and branches from
deciduous trees (Bergqvist & Elmhagen, 2020).

Western capercaillie is a grouse that inhabits mature forests (Storch, 2000). Populations of
western capercaillie are highly susceptible to habitat and structure changes, and populations are
declining due to fragmented areas and loss of suitable habitats (Graf et al., 2007). During the
hunting year of 2018/2019, there was a decreasing trend of capercaillie harvest in the south of
Sweden, while harvest still is relatively stable throughout the rest of Sweden (Bergqvist et al.,
2020).

Rock ptarmigan is an herbivorous bird that is inhabiting arctic regions above the treeline. Due
to global warming, there are concerns about rock ptarmigan since there is a loss of valuable
habitats (an increase of shrubs and trees is replacing open areas) in the arctic regions (Watson
& Moss, 2008; Elmhagen et al., 2015).

The common eider is a marine-benthic diver that occupies the marine environment all year-
round throughout Sweden (Robertson, 2016; Houle et al., 2017), where they prey mainly on
molluscs (Fenstad et al., 2016). The common eider has declined dramatically in the last decades,
as has hunting of the species (Ekroos et al., 2012; Bergqvist et al., 2020). The species is reported
as endangered on the Swedish red list (SLU, 2010).

The American mink was introduced to Europe in the early 1900s, mainly for fur farming, and
is classified as an invasive species (Bonesi & Palazon, 2007). However, during the latest years,
there is a decrease in both felled game and mink populations in Sweden, which is seen as
desirable due to an invasive species' potentially ecological risks (Bergqvist et al., 2020). The
American mink is a semi-aquatic carnivore and is often found near lakes, rivers, and/or near the
coast. They eat primarily small vertebrates and crustaceans (Bonesi et al., 2004).

Greylag goose has increased tremendously in the last decades, which is a result of increased
agricultural land and stabilized harvest (Ebbinge, 1991; Kampe-Persson, 2002; Fox &
Abraham, 2017; Bergqvist et al., 2020). Greylag geese are often found in and around lakes
covered by small woods and agricultural land where they cause significant damage (Buij et al.,
2017). Hunting of greylag goose occurs throughout the country, mostly in southern Sweden
(Liljebäck et al., 2019).

 7
2.2 Data collection
 Reporting game management
Each hunting year (July 1 to June 30), hunting teams are free to voluntarily report felled game
through the database Viltdata, owned by SAHWM. In Sweden, there were 313 HMPs during
the hunting year 2018/2019, and in total 7,482 hunting teams reported their harvest, covering
an area of 9,561,156 hectares, corresponding to 29,2 % of the total huntable area. For wildlife
species with voluntary reporting, each report contains the hunting area, the number of felled
individuals of each species, and the HMP in which game were felled.

Each HMP were linked to different covariates, which include landcover (deciduous forest and
not temporary forest, arable land, wetland, water area, and coastline), climate variables, wildlife
accidents (only applies for wild boar), and geographical distribution (Table 1). The
incorporations were performed using R (R Core Team, 2019) with the GIStools-package
(rgeos, rgdal, raster). HMPs that are not included in SAHWM harvest estimation were
excluded from the analysis. These include urban HMPs (Stockholm and Uppsala) and HMPs
above the north's cultivation limit (Vilhelminafjällen and Tärna).

Table 1. Covariates.

Thematic class (combined)
Wetland
Farmland
Open area with vegetation
Open area without vegetation
Infrastructure
Water (actual huntable water)
Not temporary forest
Coniferous forest (will be used as” zero habitat” and not included as a
covariate)
Deciduous forest
Temperature (K)
Precipitation (kg/m2)
Humidity (%)
Coastline lake and river
Coastline sea
Wildlife accidents (only for wild boar)
West and East gradients
South and North gradients

 8
Actual hunting land
The Swedish landcover map (SLC) was downloaded from Naturvårdsverket (Naturvårdsverket,
2020), and it represents landcover as pixels, where each pixel (10*10-meter, 0.01 ha) includes
a thematic class (Naturvårdsverket, 2018). All 313 HMPs specified by SAHWM during the
game period 2018/2019 were merged with SLC, and landcover per HMP was extracted. In order
to extract landcover classes per HMP, all national parks, water areas not suitable for hunting,
and governmentally owned land in each HMP were removed. For all covariate data collection,
each thematic class was calculated as the proportion of total landcover (Jonsson et al., 2020).

The water area suitable for hunting was determined by adding an inner buffer area of 25 m
(representing the range of a hunting shotgun) for each sea, lake, and river per HMP. The rest of
the water areas were excluded. Furthermore, coastline for each sea, lake and river per HMP
were calculated as the circumference of each lake/river and sea.

There are 29 National parks in Sweden, and out of the 313 HMPs, 34 HMPs were located in a
Swedish National Park. A shapefile over national parks in Sweden was collected from the
Swedish data portal by Naturvårdsverket (Naturvårdsverket, 2021), and these were removed (as
no hunting is allowed there). Furthermore, 14 HMPs were located in the Swedish mountain
range and above the cultivation limit and were therefore treated differently. The HMPs that
were included in the mountain chain were, Kiruna, Jokkmokk, Gällivare, Arjeplog, Dorotea,
Tärna, Vilhelminafjällen, Sorsele, Strömsund, Hammerdal, Krokom, Västjämtland, Berg and
Härjedalen HMP.

The cultivation limit in the mountain chain divide Kiruna, Gällivare, Jokkmokk, Arjeplog,
Dorotea, Tärna, Vilhelminafjällen, and Sorsele HMP into an upper and lower limit. Subparts of
HMPs in the lower limit were accepted as huntable land, and parts of HMPs in the upper limit
as not huntable land. The whole area of Tärna and Vilhelminfjällen HMP is located in the upper
part of the cultivation limit and, therefore, not handled as huntable land. However, for Kiruna,
Gällivare, Jokkmokk, Arjeplog, Dorotea, and Sorsele HMP, the upper limit was further divided
into privately and governmentally owned parts, where the privately owned land was accepted
as huntable land.

The first step for Kiruna, Gällivare, Jokkmokk, Arjeplog, Dorotea, and Sorsele HMP was to
subtract national parks from both the upper limits (privately owned land) and lower limit.
Second, excess water was removed, and coastal lines and huntable water areas were calculated
for each sea, lake, or river. These conditions also apply to Strömsund, Hammerdal, Krokom,

 9
Västjämtland, Berg, but instead of the cultivation limit, all HMPs were divided by the reindeer
grazing limit. All landcover above reindeer grazing limit is governmentally owned and was
therefore not accepted as huntable land.

 Climate data
Climate data were collected from the Swedish Meteorological and Hydrological Institute
Application Programming Interface (SMHI API) database. Data contains entries for various
weather parameters, including temperature, relative humidity, and precipitation, recorded every
hour and is presented in a grib square of 2.5*2.5 km2. Each grib square consists of a climate
measuring station with associated longitude and latitude coordinates. Data were read in Python
3.6.3, using the package eccodes. Parameters were extracted from each grib-file with
associated longitude and latitude coordinates and saved as the sum of precipitation (kg/m2),
mean of relative humidity (%), and mean of temperature (K) per year.

Out of the 313 HMPs, seven HMPs (Bokenäset, Lidingö, Danmark-Funbo-Vaksala, Solna,
Sundbyberg, Ljungskile and Södra Skärgården Göteborg) were handled differently, as they did
not overlap between HMP and any climate measuring station, which means that climatic
measurement could not be extracted from these HMPs. As these HMPs did not overlap with
any grib squares, parameter values from adjacent HMP were used to estimate a relative
measurement for each special-case HMP. A convex hull was created to receive parameter
values around all the adjacent HMPs, using R-package sf. Parameter values from every grib
square in adjacent HMPs were extracted, and extracted parameter values from adjacent HMP
were saved as a relative measurement.

 Wildlife accidents

All data used for analysing wildlife accidents were collected from the National wildlife accident
council (viltolycka.se). Each wildlife accident with corresponding coordinates, documented as
SWEREF99, was matched with the associated HMP, receiving an actual number of wildlife
accidents per species, year, county, and HMP. The received data also included species, year,
county, and HMPs when there were zero accidents. For this study, only wildlife accidents for
wild boar were included.

 Geographical distribution

To better estimate where the model species are located in the country and if this could affect
the posterior distribution, a north/south/west/east- gradient was created. The gradient, with

 10
north (N) and east (E) coordinates (SWEREF99), was based at the central point in Sweden. The
central point was denoted to function as a zero-point and was performed using the function
gUnaryUnion from package rgeos. A centroid was then placed in each HMP with the function
gCentroid (package rgeos) and was saved with associated coordinates (N and E coordinates).
Coordinates for each HMP-centroid were then saved in a data frame by subtracting the zero-
point coordinates with the centroid coordinates of each HMP. The last step was to standardize
each coordinate for the respective HMP. Standardizing coordinates was performed by
subtracting the easternmost point from the westernmost point and then divide the E-coordinates
for each HMP with the subtracted value. The same was performed for the north coordinates by
subtracting the northernmost point with the southernmost point and divide the N-coordinates
for each HMP with the subtracted value.

2.3 Model specification
A Hierarchical Bayesian framework estimates the posterior distribution parameters while
allowing parameters to vary probabilistically, and it has become an important tool for many
ecologists (Hooten & Hobbs, 2015). The modeling structure exploits different information
levels to draw conclusions about complex relationships (Clark, 2005). For this study, the
hierarchical model consists of multiple levels, where the available information (hunting teams
that have reported felled game and area) is used for unobserved levels (hunting teams that have
not reported felled game). Therefore, it is an appropriate tool to estimate harvest for unreported
hunting areas.

 Previous model (Null model)

Lindström and Bergqvist improved the formerly used estimation method by taking into account
possible inherent uncertainties. The model consisted of three steps: area analysis, analysing
reported harvest, and posterior prediction.

The first step was to model a set of reported hunting areas (i.e., area of reported hunting teams),
which estimates the distribution of the hunting area for different HMPs and counties. This was
performed by using the mean and standard deviation of reported hunting areas in a Log-Normal
distribution. Exploratory of data, Lindström and Bergqvist revealed that there could be variation
within HMPs in hunting area per hunting team, and within-HMP variation could vary between
counties. Area analysis was performed on a log scale, where the standard deviation is a scale-
free measurement of the variability in hunting teams within each HMP. The results of the area
analysis gave a compilation of the estimated average area per hunting team in each HMP.

 11
The analysis of reported harvest uses the data collected from the area analysis to obtain the
posterior distribution (harvest rate for each hunting team and HMP). Hunting follows a Poisson
process with a specific hunting team rate, where the specific team rate is assumed to be gamma
distributed, which results in a Gamma-Poisson mixture distribution.

The model, which will henceforth be denoted as the null model ( 0 ) accounts for (1) random
variability among hunting teams, (2) rate-variability association, and (3) the effect of hunting
area on harvest rate (Figure 1). However, Lindström and Bergqvist’s results showed that there
were computational problems with the parameter (2), rate-variability association, and that the
importance of the parameter was dubious. Posterior estimates showed that when harvest rate
was low, there was a greater variability between hunting teams in an HMP. Furthermore, it is
likely that game is not present in all areas for species at low densities. Therefore, a reduced null
model, 0 , was included in this study, where (2) rate-variability association was excluded.
Details are found in Lindström and Bergqvist (2020).

 0 0

 (2) Rate- (3) Effect of (3) Effect of
 (1) Variation hunting area (1) Variation hunting area
 variability
 within HMPs on harvest within HMPs on harvest
 association
 rate rate

Figure 1: Parameters included in the null model ( 0 ) and the reduced null model ( 0 ).

 Proposed model (Covariate model)

Like Lindström and Bergqvist, this study also consisted of the following three steps: area
analysis, analysis of reported harvest, and posterior prediction for unreported hunting teams.
Lindström and Bergqvist's model considers random effects of nationwide, county, and HMP
effect, calculating harvest per HMP k and county l.

 µ , = exp⁡( + + , ⁡) (1)

To be able to add the effect of covariates to the null model ( 0 ), an additional parameter was
introduced, , , for HMP k and county l.

 µ , = exp⁡( + + , + , ) (2)

 12
 , are coefficients per covariate (bn ) multiplicated with the covariate value of each covariate
in HMP k and county l (X n,k,l ).

 , = ⁡ b1⁡ ∗ ⁡ X1, , + ⁡ b2⁡ ∗ ⁡ X2, , + ⁡ …⁡b ⁡ ∗ ⁡ , , ⁡⁡ (3)

Creating the covariate model, which adds the effect of covariates to the null model (equation
3). Since Lindström and Bergqvist's results showed that the inclusion of the rate-variability
association parameter was dubious, this parameter was included in this study to see if there
would be an effect when including covariates at the precinct level. This study, therefore,
included one full and one reduced covariate model ( , ⁡ ), which either includes all
three parameters (Figure 1) or excludes (2), the rate-variability association. Then, the full null
model was compared with the full covariate model, and the reduced null model was compared
with the reduced covariates model. This was done to see which of the models ranked highest in
terms of predictive density.

 Prior elicitation

All priors used in this study were equivalent to the ones in Lindström and Bergqvist’s model.
However, an additional prior was added (β), which was determined to be proportional to 1.

2.4 Model selection
This study performed two hypothesis tests: first, a covariate model was selected for each species
based on the 95 % credible interval evaluation. Second, leave one out cross-validation was
performed to evaluate predictive ability between models (covariate model compared to the null
model). Two covariate models were tested, one full covariate model (including rate-variability
association), M and the reduced covariate model (excluding rate-variability association),
M . Each model was always compared to the null model, that is M was compared with
M0 and M ⁡was compared with M 0 .

Model selection was performed for each species. However, for wild boar, two different models
were performed: wild boar including wildlife accidents, where one extra covariate was added,
that is, wildlife accidents for wild boar, and wild boar excluding wildlife accidents. This was
done to see if wildlife accidents would give a better or worse posterior density than when
excluding wildlife accidents. Moreover, all species started with 15 (16 for wild boar including
wildlife accidents) covariates for each model, aiming that in the end, only some covariates
would remain, that is, covariates that actually have an effect on hunting estimate.

 13
Removal of covariates started after the first MCMC simulation. The first step was to determine
which of the 95 % credible interval for each covariate overlapped with zero (i.e., no strong
effect of that covariate on hunting estimate). The second step was to investigate if any non-
influential covariates correlated with another (i.e., determine if there was any covariance
between covariates). Removal of correlated covariates was performed since there may be
problems with identifiability when several covariates show covariation and that it may be
unclear which of the parameters has an effect. If so, removing of one correlated covariate out
of two was determined by removing the covariate with its standard deviation (s.d) closest to
zero from its mean. To summarise, if the 95 % credible interval overlapped with zero, and if
there was a correlation between covariates, only one covariate was removed. This process was
repeated, for each species, until the remaining covariates did not overlap with zero, and the
remaining covariates for each species only positively or negatively affected the amount of
hunting harvest per HMP. The third step was to determine (for each species and model M ,
M , M0 , M 0 ) which of the four models ranked highest in the form of predictive ability.
This was performed by examining the difference in expected log-pointwise predictive density
(ELPD) and standard error (SE) for each species and model, see (exact LOO-CV) - see chapter
Leave one out cross-validation.

2.5 Prediction

Posterior predictive sampling for unreported areas was performed by sampling hunting
team areas with harvest rates for associated hunting teams until all hunting teams
covered all areas in each HMP, and unreported and reported areas were merged.

2.6 Computation
All analyses were performed with the programming language R (R Core Team, 2019), where
all Bayesian modeling were performed with Stan, using the R-package Rstan (Stan Develop
Team, 2020)

The posterior distributions of the considered models do not have a standard form, and, therefore,
numerical methods were required. One of the most frequently used numerical methods for
Hierarchical Bayesian interference is Markov chain Monte Carlo (MCMC) (Monnahan et al.,
2017). An MCMC simulation makes use of random sampling from a target distribution (in this,
case the posterior) (Kurt, 2019). Within the MCMC family, Hamiltonian Monte Carlo (HMC)
is an algorithm that facilitates efficient sampling and avoids inefficient random walk sampling
(Monnahan et al., 2017). However, the HMC algorithm requires expert tuning, and Stan’s

 14
Hamiltonian Monte Carlo algorithm is a flexible modeling software that often outperforms
other samplers (Monnahan et al., 2017). Therefore, for HMC algorithms, I used Stan for faster
computation.

 Leave one out cross-validation

Leave one out cross-validation (LOO-CV) is a statistical method for evaluating predictive
ability, where one observation at a time (in this study, harvest per report) is excluded to later
predict the remaining data of a model (Bürkner et al., 2020). Predictive performance was
compared between models (in this study, the null model was compared to the covariate model).
The comparison was performed through expected log-pointwise density (ELPD), which shows
the relative predictive ability.

Since LOO-CV can be computationally expensive, Pareto Smoothed Importance Sampling
Leave One Out Cross-validation (PSIS-LOO-CV) was used to approximate out of sample
predictive density. This method makes it possible to use LOO-CV, where all the observations,
in this case, reports, were included. Furthermore, PSIS-LOO-CV also flags observations where
the approximation of predictive density is deemed unreliable (i.e., flagged observations- in this
study, hunting reports). Based on Vehtari et al. (2017) recommendations, exact LOO-CV was
performed on observations with a Pareto-k value higher than 0.7. Furthermore, to reduce high
computational demands, exact LOO-CV was only performed if the number of flagged
observations did not exceed ten observations per model.

Evaluation of predictive performance was performed by examining the difference in expected
log-pointwise predictive density (ELPD) and standard error (SE) for each species and model.
A higher ELPD difference is a safer range to ensure that ELPD differences are valid with other
datasets than this one. Two SE differences were considered weak support, and a difference of
four or more SE was considered strong support.

 3. Results
3.1 Model selection
During model selection, Pareto Smoothed Importance Sampling Leave One Out Cross-
Validation (PSIS-LOO-CV), approximated out of sample predictive density where LOO-CV
were not reliable (flagged observations, in this case, reports). Out of ten species (eleven species
including models for wild boar including wildlife accidents), seven species had less than ten
flagged observations with a Pareto k-value higher than 0.7. Rock ptarmigan, American mink,

 15
and greylag goose had more than ten flagged observations with a Pareto k-value higher than 0.7
and were further excluded from exact leave one out cross-validation.

 Evaluation of predictive performance

Evaluation of predictive performance was performed by examining the difference in expected
log-pointwise predictive density (ELPD) and standard error (SE) (ELPD/SE). The higher
difference between ELPD and SE, the more valid the model is for other datasets. For nine out
of ten species, the full covariate model, M , ranked highest of the four models (Table 2). The
reduced covariate model (M ) ranked highest for greylag goose.

The full covariate model only had a pronounced effect for beaver and wild boar (excluding
wildlife accidents) with a SE four times smaller than ELPD (Table 2). For wild boar (including
wildlife accidents), red fox, mountain hare, and common eider, with a SE two to three times
smaller than ELPD, there was weak support that the covariate model would perform better with
other datasets. Finally, for nine out of ten species, at least one covariate had an effect on harvest
estimates. For rock ptarmigan, covariates did not have an apparent effect on harvest estimates
(Table 2).

Table 2. Table shows covariate model predictive performance (column two) and the number of
covariates with an effect on hunting rate (column three).
 Species ELPD (SE) Nr of covariates
 Wild boar -72.5 (29) 10
 Wild boar (incl. wildlife accidents) -99.5 (30.2) 12
 Red fox -41.1 (17) 5
 Greylag goose -5.3 (4.6) 4
 Western capercaillie -16.6 (10.4) 7
 Mountain hare -22.7 (11.4) 5
 European pine marten -2.9 (2.7) 1
 American mink -11.5 (14.2) 3
 Eurasian beaver -40.9 (9) 2
 Common eider -15.2 (5.6) 1
 Rock ptarmigan - 0

 Evaluation of the 95 % credible interval
All game species were initially tested with all 15 covariates (16 for wild boar, including wildlife
accidents). However, if any credible interval overlapped with zero or there was a correlation
between covariates, these covariates were removed.

 16
For nine out of ten species, at least one covariate either had a positive or negative effect on
harvest estimate (Figure 2, Table 2). For example, a positive effect of temperature on harvest
estimate indicates that the higher temperature within an HMP, the more harvest there is for
respective species. A positive effect of West/East (gradients) indicates that the further east in
Sweden, the more harvest there is for separate species. If the credible interval is close to zero,
there is an effect of that covariate on harvest estimate, however vague. If the credible interval
is further away from zero, there is a strong effect on the covariate on harvest estimate.

 17
Figure 2: Boxplot over covariate effect on hunting of A. wild boar, B. wild boar including
wildlife accidents, C. red fox, D. greylag goose, E. western capercaillie, F. mountain hare, G.
European pine marten, H. American mink, I. Eurasian beaver, and J. common eider. The plot
shows the 95 % credible intervals, where the box represents 50 % of all values and the
distance between the percentiles. The black line inside the box indicates the median, and the
dashed line outside the box represents outliers. A negative/positive effect indicates less/more
hunting for the respective covariate. Abbreviations: veg = vegetation, temp. = temporary, CL
= Coastline.

3.2 Posterior prediction sampling
During posterior prediction sampling on unreported areas, all parameters from earlier steps
were used to predict how much has been hunted in the unreported areas. Results are shown with
a 95 % credible interval.

The posterior prediction sampling was performed for the covariate and null model for nine out
of ten game species (excluding rock ptarmigan were no covariates had a clear effect). The 95
% credible interval for the null model and covariate model is shown in figure 3, where a smaller
bar indicates smaller credible interval. The red bar indicates the covariate model, and the grey
bar indicates the null model. By looking at the figures in Figure 3, the credible interval for the
two models differed. In general, the credible interval between the covariate model was smaller

 18
than the null model, and the covariate model estimated a lower median than the null model.
However, this does not apply for all species. For example, European pine marten, where there
was no pronounced difference in the credible interval between models. For American mink, the
credible interval seems to differ for almost every county. By looking at the figures for red fox
and greylag geese, the credibility intervals differ more in counties in northern Sweden than in
counties in southern Sweden. As for common eider, which was only found in three counties,
the credible interval for the covariate model was larger than the credible interval for the null
model (county 14, Figure 3J). However, reports for common eider were only found in three
counties. A list of counties is found in Appendix A.

 19
Figure 3: Variation between models shown with the 95 % credible interval of A. wild boar, B.
wild boar including wildlife accidents, C. red fox, D. greylag goose, E. western capercaillie, F.
mountain hare, G. European pine marten, H. American mink, I. Eurasian beaver, and J.
common eider. Numbers on the x-axis represent the respective county. The upper and lower
limit indicates the 2.5 % and 97.5 % percentiles, and the middle line indicates the median of a
95% credible interval.

 Predicted median among HMPs

During posterior predictive sampling, each model and species received an estimated median for
each HMP. Then, a comparison between models was performed by dividing the estimated
median for the null and covariate model (Median of null model/Median of covariate model)
(Figure 4). A positive value (blue colour) implies that the estimated median for the null model
was higher than the covariate model, and a negative value (red colour) implies that the covariate
model estimated a higher median than the null model. The white colour implies that the ratio is
one.

By looking at the figures, the estimated median for the null model was higher than the covariate
model in northern parts of Sweden for greylag goose and Eurasian beaver. As for common
eider, which is only felled in three counties, one cannot determine which model estimated a
higher median by looking at the figure. There was no apparent difference in the estimated
median throughout Sweden for the rest of the game species, and no clear spatial patterns were
discovered.

 20
21
Figure 4. Map over Swedish HMP estimated median of A. wild boar, B. wild boar including
wildlife accidents, C. red fox, D. greylag goose, E. western capercaillie, F. mountain hare, G.
European pine marten, H. American mink, I. Eurasian beaver, and J. common eider. Blue areas
represent, where the estimated median was higher for the null model, and orange represents
where the estimated median was higher for the covariate model. Values are shown in a ratio
on log scale, and the white area represents where the ratio is 1, indicating no difference in the
estimated median between the two models.

 22
Predicted variation among HMPs

The coefficient of variation among HMPs was used to visualize the difference in variation in a
scale-free measurement. The coefficient of variation (CV), calculated from each HMPs
predictive posterior mean and standard deviation for both the covariate and null model, was
compared for each HMP and county (Figure 5). The comparison was made by evaluating the
ratio between CV between the two models (CV of null model/ CV of covariate model), where
a positive value implies that the CV was higher for the null model (red colour), and a negative
value implies that the CV was higher for the covariate model (green colour).

The total number of HMPs with a smaller ratio of the CV of the covariate model was for wild
boar (169/243), wild boar (including wildlife accidents) (186/243), red fox (265/306), greylag
goose (278/298), Western capercaillie (202/271), American mink (298/306), Eurasian beaver
(151/249), common eider (55/76), mountain hare (154/306), and European pine marten
(179/301).

 23
Figure 5. Map over Swedish HMP showing coefficient of variation of A. wild boar, B. wild boar
including wildlife accidents, C. red fox, D. greylag goose, E. western capercaillie, F. mountain
hare, G. European pine marten, H. American mink, I. Eurasian beaver, and J. common eider.
Red areas represent where the ratio of CV was smaller for the covariate model green represents
where the ratio of CV was smaller for the null model. Values are on semi-log scale, and the
white area indicates that the ratio is 1, indicating no difference in CV between the two models.

 24
Predictive variation at high hunting levels

Prediction variation at high hunting levels was performed by calculating the coefficient of
variation (CV) difference between the covariate and null models ( - 0 ) and plotting
these together with each county's predictive median. A positive slope indicates an increased
harvest (higher median) coupled to smaller variation (i.e., smaller predictive variation in CV)
for the covariate model. For western capercaillie, mountain hare, European pine marten,
Eurasian beaver, and common eider, there is a positive slope, which implies that the CV for the
covariate model decreases when harvest per county (positive slope) increases (Figure 5).
However, harvest of common eider is only reported for three counties, and therefore it could be
challenging to say with certainty that the covariate model's predictive variation in harvest at
high levels is smaller or greater than the null model. For wild boar (both models), red fox,
greylag goose, and American mink the CV for the covariate model increases when harvest per
county (negative slope) increased (Figure 5).

 25
Figure 5: Result of correlation between models in coefficient of variation (CV) and the amount
of hunting (median) of A. wild boar, B. wild boar including wildlife accidents, E. red fox, D.
greylag goose, E. western capercaillie, F. mountain hare, G. European pine marten, H.
American mink, I. Eurasian beaver, and J. common eider.

 26
4. Discussion
Harvest estimates are among the most important activities since it can function as an indicator
of population size changes (Skalsi et al., 2006: Bergqvist et al., 2016), and it should be
continuously adapted to species distribution, land use, and biotic/abiotic factors. However,
harvest reporting systems vary among countries, depending on local conditions and preferences
(Åhl et al., w.y). In Sweden hunting reporting is voluntary for most common felled game, which
results in a variation in the number of reports between hunting teams, HMP, and counties.
Therefore, there is a demand for new statistical methods for quantifying harvest in Sweden. To
our knowledge, the implemented method of using Bayesian modelling and covariates at the
precinct level, is a novel model for felled game.

The results from this study show that harvest estimates can be improved using covariates at the
precinct level. The covariates model ranked highest for all ten species, and out of the ten species,
six had an ELPD difference of two to four, which implies that, depending on game species,
there is both strong and week support that the covariate model will be a better predictor for
other datasets than this one. At least one covariate had an apparent effect on harvest estimates
for nine out of ten species. In addition, the covariate model reduced the large uncertainties,
which is an improvement of the null model.

For all nine species, excluding rock ptarmigan, the covariate model estimated a smaller CV
(coefficient of variation, a scale-free measurement of the variation) and a 95 % credible interval
than the null model, indicating that the large uncertainties were removed by including covariates
in the model, and hunting estimates became even more precise.

During the evaluation of the 95 % credible interval, this study found both expected and
unexpected effects. Some of the expected effects were that infrastructure and farmland had a
positive effect on harvest estimations for red fox, which implies that the more areas of
infrastructure and farmland there are in HMPs, the more red fox is felled. The effects were
expected since a landscape with increased levels of farmland and infrastructure provides good
food resources for the red fox. Second, open areas without vegetation and farmland had a
negative effect on western capercaillie harvest estimations, i.e., less western capercaillie is
felled in HMPs where there are areas of farmland and open areas without vegetation. The
western capercaillie is mainly located in northern parts of Sweden, where agricultural land is
scarce. In addition, western capercaillie is disadvantaged by exploited areas and open areas.

 27
Lastly, coastline for the sea had a strong positive effect on the American mink, which is
probable as the coastline is a part of the American mink’s natural habitat.

The unexpected effects from covariates include, for example, that agricultural land had a
positive effect on harvest for beaver, i.e., the more farmland land, the more beavers are felled
in HMPs. One explanation could be that there is a higher abundance of beaver populations in
southern Sweden, where there are larger agricultural land areas than in the northern parts of
Sweden. For mountain hare, there were quite contradictory results. There were two distinct
north and south gradients, precipitation and humidity. These two imply that the more
precipitation and humid it is, the more mountain hare is felled. Precipitation was an expected
effect since precipitation is higher around the mountain range and abundance for mountain hare
is higher in the northern parts of Sweden. However, it is most humid in the southern parts of
Sweden (SMHI, w.y). In addition, the results also show that more mountain hare is felled in
southern Sweden, which can be misleading as most mountain hare hunting is located in the
northern part of Sweden.

Two models were tested for wild boar, with wildlife accidents, and without wildlife accidents.
The number of covariates varied from 12 (including wildlife accidents) to 10 (excluding
wildlife accidents). One hypothesis was that including wildlife accidents in the model would
substantially affect other covariates since it is a strong predictor. If there is a high abundance
of wild boar in an HMP, it is more likely that there is an increase of wildlife accidents. However,
the wildlife accidents covariate results showed a negative effect on the harvest estimates, which
might seem like a contradiction as a higher abundance of wild boar should increase the risk for
accidents. Another contradictory result was that water had a positive effect, while coastline for
sea/lake and river had a negative effect. One possible explanation for these results (both for
mountain hare and wild boar) is that there could be a correlation between covariates, that they
are very alike, contributing to a complicated variation composition of several factors.

The null model by Lindström and Bergqvist found that the inclusion of rate variability
association was dubious. However, when covariates were added to the model, it turned out that
the inclusion of rate-variability association was valuable information to include in the model.
The full covariate model received a higher difference in ELPD and SE for nine out of ten game
species during the evaluation of predictive performance. For greylag goose, however, the
reduced covariate model ranked highest. For greylag goose, when including rate-variability
association to the model, not all MCMC chains converged. However, when excluding rate-
variability association from the model, all chains converged. Greylag goose is felled in 20
 28
(reported) counties and in 298 (reported) HMPs, with, during the hunting year 2018/2019, over
7000 reports for greylag goose. However, even if greylag goose is felled in large parts of the
country, a possible explanation for why parameter (2) did not converge may be due to greylag
goose being felled by only a few hunters, which means that the variation within HMPs could
be high.

Although the covariate model ranked highest for rock ptarmigan, there was no apparent effect
of covariates for that species. A possible explanation may be that rock ptarmigan is felled in
only three counties, all located close to mountain chains. For this study, counties with no
hunting for each species were not included, and if it had, the result for rock ptarmigan would
probably have turned out differently, and the results would have shown an effect of several
factors. However, when only including the actual area where the species was felled, it seems to
have a more negligible effect.

Other countries in Europe (e.g., Cyprus and Finland) are experiencing similar problems with
voluntary hunting report systems, as in Sweden, and experiences that harvest estimates are
sensitive to deviating values at low reporting levels (Åhl et al., 2020). In addition, in Europe,
there is an ongoing problem: where hunting estimates do not take into account any
measurements of uncertainty and that the hunters who report may not represent all hunters. This
suggests that there is a need to develop better methods for harvest reporting and harvest
estimations. By including valuable information about land cover, geographical distribution,
wildlife accidents, and the impact of climate, other countries other than Sweden can benefit
from this kind of harvest estimation model, where covariates are taking into account.

There is no method for monitoring population size and distribution for many of the game
species in Sweden, even if harvest is high or low. Therefore, harvest estimates are vital for
wildlife management since it could indicate changes in population size (monitoring long-term
survival of wildlife species) (Bergqvist et al., 2016). However, if harvest estimates should
work as an indicator of population size, the estimates should be as accurate as possible. Even
if the null model, created by Lindström and Bergqvist, improved the former harvest
estimation model, the uncertainties were still too large for most game species. The covariate
model reduced the large uncertainties and estimated a more precise harvest estimate.
Furthermore, a model with a lower credible interval can detect trends in hunting patterns over
the years, which also is important for species distribution and hunters game preference. This
implies that the covariate model can be used as a tool for population size and distribution

 29
changes, which is vital for wildlife management, especially since there is no reliable
methodology for inventorying many of the game species.

There are still some limits to the covariate model. During posterior prediction for unobserved
hunting, cross-validation with training and validation data was not applied due to the time limit,
limiting the model. Without cross-validation with training and validation data, the performed
prediction can still be uncertain, even if the performed prediction were reliable. Another aspect
necessary to consider was that the covariate model may be slightly over-fitted and may not
work so well on other datasets. Further analysis should therefore include cross-validation with
training and validation data to reduce any bias in the result. However, the results still show that
the covariate model results in a smaller credible interval in posterior prediction than the null
model, and LOO-CV was lower for all species, although there was not always a significant
enough difference. Lastly, there is still limited knowledge of which hunting teams that report
their harvest. Even if SAHWM continuously informs and encourages hunting teams to report
their total harvest, low reporting remains a problem since there are uncertainties about hunting
and reporting preferences. Therefore, if harvest reporting should increase for all hunting teams
throughout Sweden, and if the covariate model would be implemented as a harvest estimation
model for all game species in Sweden, harvest estimates would work as a strong indicator of
population size and distribution changes.

4.1 Conclusion
Since reporting is voluntary for most common felled game, the number of reports can vary
between hunting teams, HMP, and counties. There is therefore a demand for statistical methods
to estimate harvest in Sweden. In this study, I assessed and expanded a model for estimating
the relevance of covariates for harvest estimation of felled game.

This study aimed to improve the current estimation method, and by including covariates at the
HMP level, this study has developed a framework for studying the effect of climate, land cover,
and geographical distribution on harvest estimation. Second, this study suggests that harvest
estimates can be improved by taking these factors (covariates) into account. Based on LOO-
CV and the 95% credibility interval, which did not include zero for the remaining covariates, it
can be said that the covariate model is a better model than the null model for nine out of ten
focal species. To conclude, by taking covariates into account as the finest spatial resolution that
can be considered from the available data, instead of only using HMP as the finest scale, the
covariate model reduced the predictive variation. Using covariates at the precinct level, models

 30
You can also read