Effect of permanent plots on the relative efficiency of spatially balanced sampling in a national forest inventory

Räty, Minna; Kangas, Annika Susanna

doi:10.1007/s13595-019-0802-6

Research Paper
Open access
Published: 21 February 2019

Effect of permanent plots on the relative efficiency of spatially balanced sampling in a national forest inventory

Annals of Forest Science volume 76, Article number: 20 (2019) Cite this article

2351 Accesses
10 Citations
17 Altmetric
Metrics details

Abstract

Key message

Using spatially balanced sampling utilizing auxiliary information in the design phase can enhance the design efficiency of national forest inventory. These gains decreased with increasing proportion of permanent plots in the sample. Using semi-permanent plots, changing every n th inventory round, instead of permanent plots, reduced this phenomenon. Further studies for accounting the permanent sample when selecting temporary sample are needed.

Context

National forest inventories (NFIs) produce national- and regional-level statistics for sustainability assessment and decision-making. Using an interpreted satellite image as auxiliary information in the design phase improved the relative efficiency (RE). Spatially balanced sampling through local pivotal method (LPM) used for selection of clusters of sample plots is designed for temporary sample; thus, the method was tested in a NFI design with both permanent and temporary clusters.

Aims

We estimated LPM method and stratified sampling for a NFI designed for successive occasions, where the clusters are permanent, semi-permanent, or temporary being replaced: never, every nth, and every inventory round, respectively.

Methods

REs of sampling designs against systematic sampling were studied with simulations of inventory sampling.

Results

The larger the proportion of permanent clusters the smaller benefits gained with LPM. REs of stratified sampling were not depending on the proportion of permanent clusters. The semi-permanent sampling with LPM removed the previously described decrease and resulted in the largest REs.

Conclusion

Sampling strategies with semi-permanent clusters were the most efficient, yet not necessarily optimal for all inventory variables. Further development of method to simultaneously take into account the distribution of permanent sample when selecting temporary or semi-temporary sample is desired since it could increase the design efficiency.

1 Introduction

National forest inventories (NFIs) are the main source of information for characterizing the state of the forest resources (Vidal et al. 2016, p. 8). The most common inventory variables are forest area, mean growing stock volume, and distribution of growing stock volume into tree species and timber assortments (Tomppo et al. 2010; Vidal et al. 2016). In addition to the current growing stock, estimating the changes in the forests over time is important. The plots can be permanent, meaning they are remeasured in all consecutive inventory rounds, or temporary, meaning they are discarded after the first measurements. Temporary plots are mainly intended to capture the current state of the forest, whereas permanent plots in addition to the current state aim at capturing the changes (Scott 1998; Tomppo et al. 2010). Even though the increments of growing stock can be accurately measured via increment cores from temporary plots, estimating the changes such as natural mortality and harvests is much more precise from permanent than from temporary plots (e.g., Päivinen and Yli-Kojola 1989). NFIs can be solely on temporary plots (e.g., Poland, Portugal, France, and Spain), solely on permanent plots (e.g., Austria, Iceland, China, and Canada), or a combination of these two plot types (e.g., Finland, Sweden, Netherlands, Estonia, New Zealand; see Tomppo et al. 2010). The designs also change constantly in time, for instance in France, plans to introduce permanent plots have been reported (Vidal et al. 2016).

An inventory with purely permanent plots is called continuous forest inventory (CFI). Sometimes, the permanent plots established may lose their importance as an indicator for change. For example, treatment bias can be imparted when permanent plots are managed differently than the surrounding forests and affect CFI estimates (Köhl et al. 2015). In such occasion, the possibility to redistribute also the permanent plots would be beneficial.

Another option is a sampling design where the permanent plots are only used for a limited time, i.e., they are semi-permanent. Such designs are flexible since the priorities in survey may be changed from a round to another by allocation of different numbers of temporary plots (Scott and Köhl 1994). A semi-permanent plot is surveyed in at least two consecutive inventory rounds but then relocated like a temporary plot. Therefore, it is capable of capturing change and, in addition, with an efficient reallocation, is not susceptible to the treatment bias in the same way as permanent plots. An example of this is sampling with partial replacement (e.g., Patterson 1950; Matis et al. 1984; Köhl et al. 1995).

The measurement costs of permanent plots have been higher than those of temporary plots, due to necessity of making sure the plot is found for remeasurements. However, with modern GPS, the trees in temporary plots may be located as accurately as the trees in permanent plots, and therefore, the measurement costs do not differ markedly any more (e.g., Tomppo et al. 2014). This makes it possible to introduce new permanent plots without additional costs.

In Sweden, the temporary clusters in the current NFI round, which began in summer 2018, were chosen with spatially balanced sampling using local pivotal method (LPM) in the sample selection (Grafström et al. 2017b). This has motivated us to test the same method in the Finnish NFI setting. In a spatially balanced sampling, the distribution of the auxiliary variables in the sample is matched as closely as possible to the distribution in the entire population (Grafström et al. 2012). Auxiliary data may be any data available for all units of the population with no upper limit for the number of auxiliary variables used. Typically, auxiliary variables are spatial location, other geographic data such as altitude, and remotely sensed data (e.g., Grafström and Ringvall 2013; Grafström et al. 2014). The underlying assumption is that auxiliary information and inventory variables should be correlated (Grafström et al. 2012). LPM is a sample selection method resulting in approximately spatially balanced sample (Grafström and Lundström 2013). The LPM was assessed in a simulation study with independent auxiliary information and real NFI field data, where all sampling units belonged to one and the same population available for sampling (Räty et al. 2018). In other words, the setting in the study corresponded to an inventory with temporary inventory plots solely. The LPM can also be connected with other sampling methods such as stratification: in such a case, the LPM would be carried out separately within each stratum.

So far, there is no approach accounting for the distribution of existing permanent sample when selecting a temporary sample with the LPM. Such an approach should not compromise the requirement that each unit in the population has larger than zero probability to be included in the sample. To date with LPM, it has been only possible to match the distribution of the temporary sample irrespective of the existing permanent sample. Therefore, in the case of permanent sample, stratification with systematic or random sample selection may be more efficient than stratification with LPM or pure LPM. In stratified sampling, the sample within a stratum is populated first with the permanent sample belonging to that stratum. Then, the remaining sample within a given stratum is filled using systematic or random selection. Thus, while stratified sampling (with or without LPM) was shown to be less robust than pure LPM in our previous study (Räty et al. 2018), it may be more robust than LPM in a design involving permanent plots.

The main survey principles in the NFIs in these two countries, Finland and Sweden, are alike (Tomppo et al. 2010). The sample plots are arranged in clusters, the location of which refers to a corner point (Fig. 1). The temporary clusters comprise one third ≈ 33% of the clusters in Sweden and 60% in Finland (Kangas et al. 2018). One inventory round lasts for 5 years, and each year, the sample of the systematically positioned clusters covers the entire country. The exceptions in Finland are the most northern part and southwestern archipelago which both are surveyed in one summer. The number of sample plots measured annually is approximately 10,000 and 15,000 in Sweden and Finland, respectively (Kangas et al. 2018). Thus, improving the cost efficiency is important.

We assess in this study the efficiency of sampling designs by simulating the second phase of inventory sampling with different proportions of permanent clusters in the sample. Our first hypothesis is that as the proportion of permanent clusters in the sample increases, the relative efficiency (RE) of sampling design using LPM for temporary plot selection decreases, because a larger proportion of sample is chosen without utilizing the auxiliary information. In other words, with a larger proportion of permanent clusters, it is more difficult to match the distribution of a total sample including both temporary and permanent clusters to the distribution of auxiliary variables over the study region. As our second hypothesis, we assume that as the proportion of permanent clusters in the sample increases, the performance of stratified sampling designs with systematic plot selection compared with that of the LPM sampling improves. This is because the sampling units in the stratified sampling are selected independently of each other, without any need to account for the distribution of the permanent clusters in the same way as in LPM. In the last assessment, the permanent clusters are treated as semi-permanent, meaning all the semi-permanent clusters are resampled and allowed to change their position at the same time. In that case, both the entire temporary cluster population and semi-permanent cluster population are sampled with LPM using auxiliary variables. The setup could be thought as a maximal potential achievable with semi-permanent clusters. We assume that this semi-permanent/temporary sampling design would be more efficient than the design with permanent and temporary plots but not as efficient as the design where all clusters are temporary.

2 Material and methods

2.1 Study region and primary data

The study region is the southern part of Finland excluding the southwestern archipelago that covers about 153,000 km² land area and two sampling regions (Fig. 1a). Primary data in this study are the field data from the 11th Finnish NFI (NFI11) which was carried out in years 2009–2013. The sample plots are arranged in the clusters with slightly different cluster designs for the sampling regions (Fig. 1b, c). Data comprise altogether 46,914 field sample plots in N = 5408 clusters of which 1082 clusters were permanent and the rest 4326 were temporary.

Primary data in our study represents the population Ρ from which the samples are chosen and population parameters are estimated. Our study is based on the main results of the Finnish NFI: total growing stock volume on the forested land (m³), forested land area proportion, and mean growing stock volumes by tree species groups (m³/ha) (Table 1). The forested land in this study is defined to include the two national forestry land classes: “forest land” and “poorly productive forest land” (Tomppo et al. 2011), resulting in an estimate close to the forest land as defined by the United Nations Food and Agriculture Organization (FAO 2012). The tree species–specific groups comprising all the growing stock volume are as follows: pine (Pinus sylvestris L.) including all conifers except spruce, spruce (Picea abies L.), and broadleaves, which mostly are birches (Betula pendula L. and Betula pubescens L.) (Korhonen et al. 2017).

Table 1 Reference levels in the study: the population-level values for chosen population parameters (first row) and the mean squared errors (MSEs) for local pivotal method with spatial coordinates (=geospatial spread) by increasing proportion of permanent clusters (p) with the sample size of n = 400. Unit of MSE is the squared unit of that variable

Full size table

2.2 Auxiliary information

Auxiliary information in this study was from the tenth multi-source NFI (MS-NFI10) (Tomppo et al. 2008), which was available as georeferenced raster layers of 20 × 20 m pixel size. These forest resource maps were based on the field measurements (NFI10 in years 2003–2008) and Landsat 5 TM images from year 2007 (Tomppo et al. 2012).

To calculate the auxiliary variables for sampling units, i.e., clusters, a five-pixel window for each sample plot in a cluster was extracted from the forest resource map rasters: a center pixel where the plot center located and one adjacent pixel to all main cardinal directions, i.e., the so called Rook’s case contiguity (e.g., Lloyd 2009). The cluster-level auxiliary variables were estimated as sums, means, or variances using equal weight for all extracted pixels belonging to any sample plot in the cluster. For the forested land proportion, for all pixels classified as land, and for growing stock volume–based mean and variance estimates, the pixels classified as forested land were utilized. Six forest resource thematic maps were utilized to produce the following six auxiliary variables for the clusters: (1) mean growing stock volume of all tree species, (2) mean growing stock volume of pine including other conifers than spruce, (3) mean growing stock volume of spruce, (4) mean growing stock volume of broadleaves, (5) variance of growing stock volume of all tree species within the cluster, and (6) forested land proportion (Table 2).

Table 2 Thematic maps utilized in the study, cluster-level auxiliary variable description, and correlation between the auxiliary and primary data

Full size table

2.3 LPM

LPM utilizes auxiliary information in sample selection. In this study, the used combinations of cluster-level auxiliary variables are the ones that proved to be efficient in the previous study (Räty et al. 2018). LPM aims at selecting a sample from a population whose distribution in auxiliary space is as close as possible to its distribution in population (Grafström et al. 2012). Consequently, the sample is irregular if auxiliary variables include other variables besides spatial coordinates. Each sampling unit i in the population Ρ of size N receives an (equal or unequal) initial inclusion probability, π_i, which sum up to the sample size, n:

$$ n=\sum \limits_{i=1}^N{\pi}_i\ \mathrm{and}\ 0<{\pi}_i<1 $$

(1)

In the selection process, the initial inclusion probabilities are turned into inclusion indicators, which are updated with an algorithm. However, while these indicators change during the process, the actual inclusion probabilities remain at the initial level. The updating is carried out using pairwise comparisons. Further, while the LPM algorithm is selecting a sample, the population divides into two: available and decided population. In the beginning, the entire population is available, i.e., all inclusion indicators differ from values 1 and 0. If the updated indicator value is 0, that unit will not be included in the sample and it is moved from the available population to the decided population. Similarly, a unit chosen to the sample and having an inclusion indicator value of 1 will also be moved to the decided population. As the selection proceeds, in every algorithm round, at least one unit is either chosen to the sample or loses its possibility to be included in the sample and thus is moved to the decided population. So, when LPM algorithm is running, the available population is diminishing and decided population consisting of included and excluded units is increasing. For more details of LPM, see, e.g., Grafström et al. (2012) and Fig. 2 in Räty et al. (2018).

The distance between the clusters is a Euclidian distance in the space of the auxiliary variables:

$$ d\left(i,j\right)=\sqrt{\sum \limits_{k=1}^q{\left({x}_{ik}^{,}-{x}_{jk}^{,}\right)}^2} $$

(2)

where $ \left({x}_{i1}^{,},{x}_{i2}^{,},\dots, {x}_{iq}^{,}\right) $ are the standardized values of auxiliary variables associated to all pairs (i, j) of sampled clusters. Standardization of auxiliary variables guarantees an equal importance in distance calculation (Grafström and Ringvall 2013).

2.4 Stratified sampling

In stratified sampling, the population is divided into as homogenous strata as possible using the auxiliary variables. We used equal-distanced limits along the cumulative distribution of the square root of the density function of auxiliary variable to define the strata (see Cochran 1977; section 5A.7). The sample size within each stratum was defined with optimal allocation where the within-stratum variance of auxiliary variable was weighted with the size of the stratum (Cochran 1977). In this study, we utilized the stratifications that proved to be most efficient and robust in our previous study (Räty et al. 2018).

The stratum for clusters was defined prior to the sampling simulation (Table 3). No separation was made between the permanent and temporary clusters in the population when it was stratified, but in the sample selection, the permanent clusters were chosen first using LPM with spatial coordinates with equal inclusion probability, called geospatial spread from this point onwards in this paper. Then, the temporary clusters were selected with LPM with geospatial spread based on the situation after the allocation of chosen permanent clusters into strata to fulfill the sample sizes in each of them (Table 4). If the number of sample clusters in any stratum exceeded the predefined sample size(s) already after allocation of permanent clusters, strata were combined.

Table 3 Limits of strata in stratifications based on cluster-level auxiliary variables (see Table 2 for definitions)

Full size table

Table 4 The stratifications performed and both the sizes, N_s, and sample sizes, n_s, of strata. For a more detailed definition of auxiliary variables, see Table 3

Full size table

As a result was a sample where the permanent plots were spatially as spread as possible in the whole test area and the temporary plots within each stratum. Standard stratified estimators were used to compute the estimates for population parameters from each stratified sample.

2.5 Design efficiency and sampling simulations

The hypotheses were tested in sampling simulations with the real NFI field clusters. In the simulations, the permanent and temporary NFI field clusters were put in one and the same population, i.e., the sampling population was N = 5408 clusters and sample size n = 400. Further, the clusters chosen for either set were not excluded from the selection of the other set. This corresponds to the situation where the two different cluster sets are spread independently from each other.

The sample selection method was LPM with geospatial spread for permanent clusters. For temporary and semi-permanent clusters, LPM with auxiliary data was used. In the case of stratification, first permanent and then temporary clusters were selected using geospatial spread. Further, proportion of permanent clusters in the sample was changed to show its impact on the performance of the sampling design. Thus, the changing elements in the sampling simulation were the sample selection method, auxiliary variables (Tables 3, 4, and 5) in the selection method, and proportion of permanent clusters resulting in several sampling designs. Performance of each sampling design was measured by the mean squared error (MSE):

$$ MS{E}^2=\frac{1}{T}\sum \limits_{t=1}^T{\left({\widehat{y}}_t-y\right)}^2 $$

(3)

where y is the true value of the target parameter (from Table 1), $ {\widehat{y}}_t $ the estimate obtained from the tth replication of the design, and T = 5000 is the number of replications.

Comparison of sampling designs was based on RE which is a ratio between the MSEs of reference and method (m = LPM or stratification):

$$ R{E}_{m,p}=\frac{MS{E}_{ref,p}}{MS{E}_{m,p}} $$

(4)

where p is the proportion of permanent clusters of the sample size, p = 0.1–0.7. The reference is a design where both permanent and temporary clusters are chosen with LPM with geospatial spread. It can be interpreted as a systematic sampling design which is close to the current sampling design of Finnish NFI. The reference was estimated separately for each level of proportion p. Values RE > 1 mean that the method under investigation is more effective than the reference.

All simulations, analyses, and visualizations were made with R (R Core Team 2018). The LPM was performed with lpm1 function available in R package BalancedSampling (Grafström and Lisic 2018).

3 Results

In the reference method, no other auxiliary information was utilized besides spatial coordinates in sample selection. Further, the population from which the sample was chosen was always the same for both sets of clusters comprising all NFI clusters. The reference MSEs of target variables derived with simulation as well as the real values of population parameters are shown in Table 1. The simulation was replicated T = 5000 times which was a sufficient large number to let the estimates of mean to settle (Fig. 2).

When the proportion of permanent clusters in the sample increased, the RE of LPM decreases as expected (Fig. 3). Particularly, the RE of the forested land proportion decreased. It changed from the level of 1.80 to 1.16 as the proportion of temporary clusters changed from 90 to 30%. In the REs of mean growing stock volume and total growing stock volume, the decreases were 0.20 and 0.34 units, respectively (Table 5).

Table 5 Relative efficiencies of sampling designs with local pivotal method and stratified sampling when the proportion of permanent clusters out of n = 400 sample clusters is 0%, 10%, 30%, and 60% and permanent clusters were distributed systematically. Result for p = 0 is from a previous study (Räty et al. 2018)

Full size table

For the tree species–specific mean growing stock volumes, we were able to observe three phenomena: First, the decreasing trend as a function of increasing proportion of permanent clusters was not as obvious as for the other parameters. Second, for all tree species–specific mean growing stock volumes, the RE was larger if the auxiliary information included the tree-specific variables. Third, the clear differences in the REs between the cases using different auxiliary variables with small proportions of permanent clusters vanished as the proportion of permanent clusters increased. In the end, the REs were the same despite the auxiliary variables included in the sample selection.

For the stratified sampling, the changes depended both on the stratification and on estimated population parameter. For example, in broadleaf mean growing stock volume estimation, the REs had somewhat decreasing trend whereas in total growing stock volume estimation, the REs of most of the stratifications were fluctuating at the same level (Fig. 4). When stratification included forested land proportion, its estimation was efficient, otherwise not (Fig. 4, top left). The same applied for the tree species–specific growing stock volumes. If the stratification included information on a given tree species, the RE of that species was larger than that for the other stratifications.

When sampling with LPM utilizing the same set of auxiliary information in both temporary and permanent cluster populations, the effect of increasing proportion of these semi-permanent clusters was not anymore evident for all population parameter estimations (Fig. 5). The RE of forested land proportion and mean growing stock volume of pine did not have any detectable trend. For the other parameters, there was a slight decrease which seemed to turn to increase before the last simulated proportion, 70%. On the variable level, the RE of tree species–specific mean growing stock volumes depended on the chosen set of auxiliary variables similarly to the previous LPM case (Fig. 3).

4 Discussion

The aim of this study was to estimate the efficiency of spatially balanced and stratified sampling designs in a realistic NFI situation. Spatially balanced sampling used LPM in sample selection, and it was applied in two different setups: In the first setup, the field clusters were divided into permanent clusters being surveyed in the consecutive inventories and temporary clusters, which were measured only once. The location of permanent clusters was fixed and arranged spatially systematically in consecutive inventories, but the temporary clusters were reallocated inside the study region each simulation round with LPM utilizing remote sensing data from previous inventory. In the second setup, the cluster groups were semi-permanent and temporary; thus, both cluster populations were reallocated each simulation round with LPM utilizing similar auxiliary remote sensing data. In the first setup above, also the stratified sampling method was assessed. In all simulations, the sample size was fixed but different proportions of samples were allocated into the two cluster populations.

We estimated the sampling efficiencies using the fixed positions and designs of sample clusters from the previous inventories with total sampling intensity of 400/5408 ≈ 7.4%. A sampling design that is systematically placed should capture all the variation in the population, and the small sampling intensity guarantees that differences in design efficiency result from actual performance of the methods. Efficiencies of different sampling designs were studied in respect to the design where both cluster sub-populations were geospatially spread with LPM, which means that samples in sub-populations were close to a current systematic sampling design.

Our first hypothesis concerning the RE of LPM held. The RE of sampling designs decreased as the proportion of permanent clusters in the sample increased the sampling simulations (Fig. 3). In a previous study (Räty et al. 2018), where all the clusters were chosen with LPM from one population, the largest REs were 1.77 and 2.15 for total growing stock volume and forested land proportion estimation, respectively (Table 5). As the proportion of permanent clusters increased to 60% of the sample, the REs decreased even as much as 40% (Table 5). Nevertheless, as in the previous study (Räty et al. 2018), LPM was producing similar results irrespective of the auxiliary variables chosen, but in stratification, the result depended heavily on the chosen stratification strategy (Figs. 3 and 4, Table 5). Thus, with LPM, the estimation of a given tree species–specific mean growing stock volume was enhanced if the growing stock volume for that species was included in the auxiliary information given to LPM (Table 5).

Also, the second hypothesis, that the stratified sampling would become more efficient in respect to LPM as the proportion of permanent clusters increases in the sample, held for some stratifications (Fig. 6). In fact, the stratified sampling was invariant in respect to the proportions (Table 5). However, the variation in performance between the different stratifications was large, and enhancement of RE of one or few population parameters meant often inefficiency in the other population parameter estimations. Contrarily, spatially balanced sampling was reaching about the same level of RE regardless of the set of auxiliary variables chosen. This means that the stratification should always be based on experience, consideration, and knowledge whereas with LPM, the RE is at least on the same level with systematic sampling (Fig. 3) (Grafström et al. 2017b).

The proportion of permanent clusters in the Finnish NFI is 60% (Kangas et al. 2018). Based on this study, sampling with LPM would enhance the estimation in respect to the current systematic sampling design, but the expected improvements are smaller than the previous studies (Grafström et al. 2017b; Räty et al. 2018) anticipated, being at 5–25% for different population parameters when the proportion of permanent plots in the sample is 60%. The question is whether the improvements gained at design phase with LPM would contribute enough in contrast to the other existing methods like post-stratification or model-assisted estimation methods (Haakana et al. accepted; Särndal et al. 1992; Kangas et al. 2016; Myllymäki et al. 2017) applied in the estimation phase to the results from systematic sampling design. Possibly, the most efficient approach would then be a combination of cluster-level LPM and plot-level post-stratification.

One possible way to mitigate the decrease in efficiency as the proportion of permanent plots in sample increases would be selecting both the permanent and temporary plots with LPM utilizing auxiliary information. Even though changes happening in the permanent plots during the years would impact also on their distribution, it would still probably match the distribution of auxiliary variables better than systematically chosen permanent clusters. If the permanent plots are mainly used for estimating short-range changes, semi-permanent plots that are measured, say two to three times, could be a useful compromise that would improve the design efficiency (Table 6). However, the strategies of using permanent and semi-permanent sample plots in NFI need to be studied further, because permanent plots produce time series data that are valuable. Naturally, taking into account the distribution of permanent plots when selecting the sample from temporary population with LPM would be an optimal solution.

Table 6 Relative efficiency of sampling designs when both temporary and permanent clusters were chosen with local pivotal method utilizing auxiliary information

Full size table

Both datasets MS-NFI10 and NFI11 had their own error sources and inaccuracies, for example, locating the sample plots in NFI and MS-NFI has inaccuracy of some meters and therefore given center coordinates might have fallen into an adjacent pixel in the MS-NFI auxiliary data raster with pixel size of 20 × 20 m (Tomppo et al. 2012). On top of that, sample plots of changing radius (angle-gauge measurements) of maximum of 12.52 m are usually spreading to the adjacent pixels (Korhonen et al. 2017). Therefore, we did not choose only the pixel where the sample plot center falls in but also the adjacent pixels for auxiliary information estimation. Thus, we could be surer that we had chosen a pixel which describes the conditions of the sample plot, but at the same time, we also could have included pixels which were describing the forest conditions of adjacent stands. These position errors may decrease the RE, but do not cause bias.

The auxiliary variables were defined at cluster level which brought challenges to the simulation. Having all auxiliary information aggregated from single pixels to mean values for clusters faded the extremes of the multi-dimensional auxiliary variable distribution. This was compensated by adding an auxiliary variable which describes the amount of within cluster variation, i.e., variance of total growing stock volume, but the variation could also originate from distribution of land use or tree species compositions as well as from other site conditions like altitude, which were not included as auxiliaries in our study. Instead of mean values, different kinds of metrics to describe the distances between the clusters in the multi-dimensional auxiliary space or the variation and distribution of auxiliary variables within the clusters as well as variance estimator (Grafström and Schelin 2014; Grafström et al. 2017a; Grafström and Matei 2018) could also be studied further.

5 Conclusion

Increasing proportion of permanent sample plots did not have an effect on the RE of stratified sampling designs, though their result depended on the chosen variables used in stratification. Contrarily, with spatially balanced sampling designs, the REs decreased being about 10% for the mean growing stock when proportion of permanent plots in the sample increased to 60%. When permanent plots were changed to semi-permanent plots which were, instead of using systematic sampling, allocated in the similar manner as temporary plots, i.e., based on auxiliary information, the loss in RE experienced in spatially balanced designs disappeared. Therefore, the result challenges to consider sampling strategies with shorter term permanent sample plots which, however, might not be optimal regarding long-term changes, e.g., the effect of forest management on the forest structure. On the other hand, further development of spatially balanced sampling methods could also solve the problem how to take into account the permanent sample when selecting temporary sample.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Cochran WG (1977) Sampling techniques, 3rd edn. Wiley, New York, NY
FAO (2012) Forest resources assessment 2015: Terms and Definitions. In: FAO Rep. http://www.fao.org/docrep/017/ap862e/ap862e00.pdf. Accessed 1 Feb 2019
Grafström A, Lisic J (2018) Package “BalancedSampling” [online]. http://www.antongrafstrom.se/balancedsampling
Grafström A, Lundström NLP (2013) Why well spread probability samples are balanced. Open J Stat 03:36–41. https://doi.org/10.4236/ojs.2013.31005
Article Google Scholar
Grafström A, Matei A (2018) Spatially balanced sampling of continuous populations. Scand J Stat. https://doi.org/10.1111/sjos.12322
Grafström A, Ringvall AH (2013) Improving forest field inventories by using remote sensing data in novel sampling designs. Can J For Res 43:1015–1022. https://doi.org/10.1139/cjfr-2013-0123
Article Google Scholar
Grafström A, Schelin L (2014) How to select representative samples. Scand J Stat 41:277–290. https://doi.org/10.1111/sjos.12016
Article Google Scholar
Grafström A, Lundström NLP, Schelin L (2012) Spatially balanced sampling through the pivotal method. Biometrics 68:514–520. https://doi.org/10.1111/j.1541-0420.2011.01699.x
Article PubMed Google Scholar
Grafström A, Saarela S, Ene LT (2014) Efficient sampling strategies for forest inventories by spreading the sample in auxiliary space. Can J For Res 44:1156–1164. https://doi.org/10.1139/cjfr-2014-0202
Article Google Scholar
Grafström A, Schnell S, Saarela S et al (2017a) The continuous population approach to forest inventories and use of information in the design. Environmetrics 28:e2480. https://doi.org/10.1002/env.2480
Article Google Scholar
Grafström A, Zhao X, Nylander M, Petersson H (2017b) A new sampling strategy for forest inventories applied to the temporary clusters of the Swedish NFI. Can J For Res 47:1161–1167. https://doi.org/10.1139/cjfr-2017-0095
Article Google Scholar
Haakana H, Heikkinen J, Katila M, Kangas A (2019) Efficiency of post-stratification for a large-scale forest inventory – case Finnish NFI. Ann For Sci 76:9. https://doi.org/10.1007/s13595-018-0795-6
Kangas A, Myllymäki M, Gobakken T, Naesset E (2016) Model-assisted forest inventory with parametric, semiparametric, and nonparametric models. Can J For Res 46:855–868. https://doi.org/10.1139/cjfr-2015-0504
Article Google Scholar
Kangas A, Astrup R, Breidenbach J et al (2018) Remote sensing and forest inventories in Nordic countries – roadmap for the future. Scand J For Res 33:394–412. https://doi.org/10.1080/02827581.2017.1416666
Köhl M, Scott CT, Zingg A (1995) Evaluation of permanent sample surveys for growth and yield studies: a Swiss example. For Ecol Manag 71(3):187–194
Article Google Scholar
Köhl M, Scott CT, Lister AJ et al (2015) Avoiding treatment bias of REDD+ monitoring by sampling with partial replacement. Carbon Balance Manag 10(11):1–11. https://doi.org/10.1186/s13021-015-0020-y
Article CAS Google Scholar
Korhonen KT, Ihalainen A, Ahola A et al (2017) Suomen metsät 2009–2013 ja niiden kehitys 1921–2013 [online]. Luonnonvara- ja biotalouden tutkimus 59/2017. Luonnonvarakeskus, Helsinki, p 86
Lloyd C (2009) Spatial data analysis - an introduction for GIS users. Oxford University Press, Oxford
Google Scholar
Matis KG, Hetherington JC, Kassab JY (1984) Sampling with partial replacement — an literature review. Commonw For Rev 63:193–206
Myllymäki M, Gobakken T, Naesset E, Kangas A (2017) The efficiency of poststratification compared with model-assisted estimation. Can J For Res 47:515–526. https://doi.org/10.1139/cjfr-2016-0383
Article Google Scholar
Päivinen R, Yli-Kojola H (1989) Permanent sample plots in large-area forest inventory. Silva Fenn 23:243–252
Article Google Scholar
Patterson HD (1950) Sampling on successive occasions with partial replacement of units. J R Stat Soc Ser B Methodol 12(2):241–255
Google Scholar
R Core Team (2018) The R Project for Statistical Computing. https://www.r-project.org/. Accessed 14 Dec 2018
Räty M, Heikkinen J, Kangas AS (2018) Assessment of sampling strategies utilizing auxiliary information in large-scale forest inventory. Can J For Res 48:749–757. https://doi.org/10.1139/cjfr-2017-0414
Article Google Scholar
Särndal C-E, Swensson B, Wretman J (1992) Model assisted survey sampling. Springer-Verlag Publishing, New York, NY
Book Google Scholar
Scott CT (1998) Sampling methods for estimating change in forest resources. Ecol Appl 8:228–233. https://doi.org/10.1890/1051-0761(1998)008[0228:SMFECI]2.0.CO;2
Scott CT, Köhl M (1994) Sampling with partial replacement and stratification. For Sci 40:30–46. https://doi.org/10.1093/forestscience/40.1.30
Article Google Scholar
Tomppo E, Haakana M, Katila M, Peräsaari J (2008) Multi-source national forest inventory - methods and applications. In: Series: Managing Forest Ecosystems 18. Springer, Berlin
Google Scholar
Tomppo E, Gschwantner T, Lawrence M, McRoberts RE (eds) (2010) National forest inventories: pathways for common reporting. Springer, Berlin
Google Scholar
Tomppo E, Heikkinen J, Henttonen HM et al (2011) Designing and conducting a forest inventory - case: 9th National Forest Inventory of Finland. Springer, Netherlands
Book Google Scholar
Tomppo E, Katila M, Mäkisara K, Peräsaari J (2012) The Multi-source National Forest Inventory of Finland –methods and results 2007 [online]. Work Pap Finnish For Res Inst 233. http://www.metla.fi/julkaisut/workingpapers/2012/mwp227.pdf
Tomppo E, Malimbwi R, Katila M et al (2014) A sampling design for a large area forest inventory: case Tanzania. Can J For Res 44:931–948. https://doi.org/10.1139/cjfr-2013-0490
Article Google Scholar
Vidal C, Alberdi IA, Hernández Mateo L, Redmond JJ (eds) (2016) National forest inventories - assessment of wood availability and use, 1st edn. Springer International Publishing, Cham
Google Scholar

Download references

Acknowledgements

Open access funding provided by Natural Resources Institute Finland (LUKE).

Funding

This study was funded by the Ministry of Agriculture and Forestry of Finland key project “Puuta liikkeelle ja uusia tuotteita metsästä” (“Wood on the move and new products from forest”).

Author information

Authors and Affiliations

Natural Resources Institute Finland (Luke), PO Box 2, FI-00791, Helsinki, Finland
Minna Räty
Natural Resources Institute Finland (Luke), Yliopistokatu 6, FI-80100, Joensuu, Finland
Annika Susanna Kangas

Authors

Minna Räty
View author publications
You can also search for this author in PubMed Google Scholar
Annika Susanna Kangas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Minna Räty.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Handling Editor: John M. Lhotka

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contribution of the co-authors

MR did the analysis and wrote the original draft. ASK supervised and coordinated the research, and reviewed and edited the original draft.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Räty, M., Kangas, A.S. Effect of permanent plots on the relative efficiency of spatially balanced sampling in a national forest inventory. Annals of Forest Science 76, 20 (2019). https://doi.org/10.1007/s13595-019-0802-6

Download citation

Received: 18 July 2018
Accepted: 18 January 2019
Published: 21 February 2019
DOI: https://doi.org/10.1007/s13595-019-0802-6

Effect of permanent plots on the relative efficiency of spatially balanced sampling in a national forest inventory