globalchange  > 全球变化的国际研究计划
DOI: 10.1016/j.ecolmodel.2019.108719
WOS记录号: WOS:000480664800006
论文题名:
Species distribution models can be highly sensitive to algorithm configuration
作者: Hallgren, W.1; Santana, F.2; Low-Choy, S.3; Zhao, Y.2; Mackey, B.1
通讯作者: Hallgren, W.
刊名: ECOLOGICAL MODELLING
ISSN: 0304-3800
EISSN: 1872-7026
出版年: 2019
卷: 408
语种: 英语
英文关键词: Configuration option settings ; Provenance ; Transparency ; Koala ; Thorny devil ; MaxEnt ; ANN GLM ; GAM ; MARS ; FDA ; SRE ; CTA
WOS关键词: NEURAL-NETWORKS ; PREDICTION ; ECOLOGY
WOS学科分类: Ecology
WOS研究方向: Environmental Sciences & Ecology
英文摘要:

In pursuit of a more robust provenance in the field of species distribution modelling, an extensive literature search was undertaken to find the typical default values, and the range of values, for configuration settings of a large number of the most commonly used statistical algorithms available for constructing species distribution models (SDM), as implemented in the R script packages (such as Dismo and Biomod2) or other species distribution modelling programs like MaxEnt. We found that documentation of SDM algorithm configuration option settings in the SDM literature is, overall, very uncommon, and the justifications for these settings were minimal, when present. Such settings were often the R default values, or were the result of trial and error. This is potentially concerning since: (i) it detracts from the robustness of the provenance for such SDM studies; (ii) a lack of documentation of configuration option settings in a paper prevents the replication of an experiment, which contravenes one of the main tenets of the scientific method; (iii) inappropriate or uninformed configuration option settings are particularly concerning if they represent a poorly understood ecological variable or process, and if the algorithm is sensitive to such settings, this could result in erroneous and/or unrealistic SDMs. Therefore, this study sets out to comprehensively test the sensitivity of eight widely used SDM algorithms to variation in configuration options settings: MaxEnt, Artificial Neural Network (ANN), Generalized Linear Model (GLM), Generalized Additive Model (GAM), Multivariate Adaptive Regression Splines (MARS), Flexible Discriminant Analysis (FDA), Surface Range Envelope (SRE) and Classification tree analysis (CTA).


A process of expert elicitation was used to derive a range of appropriate values with which to test the sensitivity of our algorithms. We chose to use species occurrence records for two species - Koala (Phascolartos cinereus) and Thorny Devil (Moloch horridus) - in order to investigate how algorithm sensitivity depends on the species being modelled. Results were assessed by comparing the modelled distribution of the control SDM (default settings) to the modelled distribution from each sensitivity test SDM (i.e. non-default configuration settings). This was done using the visual and statistical measures of predictive performance available in the Biodiversity and Climate Change Virtual Laboratory (BCCVL), including the area under the (receiver operating characteristic) curve. The aim of our study was to be able to draw conclusions as to how the sensitivity of SDM algorithms to their configuration option settings may detract from the reliability of SDM results, given the often unjustified and unscrutinized use of the default settings, and generally infrequent and largely perfunctory attendance to this issue in most of the published SDM literature. Our results indicate that all of the algorithms tested showed sensitivity to alternative (non-default) values for some of their configuration settings and that often this sensitivity is species-dependent. Therefore we can conclude that the choice of configuration settings in these widely used SDM algorithms can have a large impact on the resulting projected distribution. This has important ramifications for decision-making and policy outcomes wherever SDMs are used to inform species and biodiversity management plans and policy settings. This study demonstrates that assigning suitable values for these settings is a very important consideration and as such should always be published along with the model. Documenting all configuration settings is necessary to increase the scientific robustness, transparency and reproducibility of species distribution modelling studies.


Citation statistics:
资源类型: 期刊论文
标识符: http://119.78.100.158/handle/2HF3EXSE/146996
Appears in Collections:全球变化的国际研究计划

Files in This Item:

There are no files associated with this item.


作者单位: 1.Griffith Univ, Griffith Climate Change Response Program, Southport, Qld, Australia
2.Univ Canberra, Fac Sci & Technol, Canberra, ACT, Australia
3.Griffith Univ, Griffith Social & Behav Res Coll, Southport, Qld, Australia

Recommended Citation:
Hallgren, W.,Santana, F.,Low-Choy, S.,et al. Species distribution models can be highly sensitive to algorithm configuration[J]. ECOLOGICAL MODELLING,2019-01-01,408
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[Hallgren, W.]'s Articles
[Santana, F.]'s Articles
[Low-Choy, S.]'s Articles
百度学术
Similar articles in Baidu Scholar
[Hallgren, W.]'s Articles
[Santana, F.]'s Articles
[Low-Choy, S.]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[Hallgren, W.]‘s Articles
[Santana, F.]‘s Articles
[Low-Choy, S.]‘s Articles
Related Copyright Policies
Null
收藏/分享
所有评论 (0)
暂无评论
 

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.