globalchange  > 过去全球变化的重建
DOI: 10.1371/journal.pone.0149089
论文题名:
Selecting Optimal Random Forest Predictive Models: A Case Study on Predicting the Spatial Distribution of Seabed Hardness
作者: Jin Li; Maggie Tran; Justy Siwabessy
刊名: PLOS ONE
ISSN: 1932-6203
出版年: 2016
发表日期: 2016-2-18
卷: 11, 期:2
语种: 英语
英文关键词: Forecasting ; Ecology and environmental sciences ; Marine environments ; Marine and aquatic sciences ; Acoustics ; Machine learning ; Community ecology ; Marine ecology
英文摘要: Spatially continuous predictions of seabed hardness are important baseline environmental information for sustainable management of Australia’s marine jurisdiction. Seabed hardness is often inferred from multibeam backscatter data with unknown accuracy and can be inferred from underwater video footage at limited locations. In this study, we classified the seabed into four classes based on two new seabed hardness classification schemes (i.e., hard90 and hard70). We developed optimal predictive models to predict seabed hardness using random forest (RF) based on the point data of hardness classes and spatially continuous multibeam data. Five feature selection (FS) methods that are variable importance (VI), averaged variable importance (AVI), knowledge informed AVI (KIAVI), Boruta and regularized RF (RRF) were tested based on predictive accuracy. Effects of highly correlated, important and unimportant predictors on the accuracy of RF predictive models were examined. Finally, spatial predictions generated using the most accurate models were visually examined and analysed. This study confirmed that: 1) hard90 and hard70 are effective seabed hardness classification schemes; 2) seabed hardness of four classes can be predicted with a high degree of accuracy; 3) the typical approach used to pre-select predictive variables by excluding highly correlated variables needs to be re-examined; 4) the identification of the important and unimportant predictors provides useful guidelines for further improving predictive models; 5) FS methods select the most accurate predictive model(s) instead of the most parsimonious ones, and AVI and Boruta are recommended for future studies; and 6) RF is an effective modelling method with high predictive accuracy for multi-level categorical data and can be applied to ‘small p and large n’ problems in environmental sciences. Additionally, automated computational programs for AVI need to be developed to increase its computational efficiency and caution should be taken when applying filter FS methods in selecting predictive models.
URL: http://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0149089&type=printable
Citation statistics:
资源类型: 期刊论文
标识符: http://119.78.100.158/handle/2HF3EXSE/25137
Appears in Collections:过去全球变化的重建
影响、适应和脆弱性
科学计划与规划
气候变化与战略
全球变化的国际研究计划
气候减缓与适应
气候变化事实与影响

Files in This Item:
File Name/ File Size Content Type Version Access License
journal.pone.0149089.PDF(4346KB)期刊论文作者接受稿开放获取View Download

作者单位: Geoscience Australia, GPO Box 378, Canberra, ACT, 2601, Australia;Geoscience Australia, GPO Box 378, Canberra, ACT, 2601, Australia;Geoscience Australia, GPO Box 378, Canberra, ACT, 2601, Australia

Recommended Citation:
Jin Li,Maggie Tran,Justy Siwabessy. Selecting Optimal Random Forest Predictive Models: A Case Study on Predicting the Spatial Distribution of Seabed Hardness[J]. PLOS ONE,2016-01-01,11(2)
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[Jin Li]'s Articles
[Maggie Tran]'s Articles
[Justy Siwabessy]'s Articles
百度学术
Similar articles in Baidu Scholar
[Jin Li]'s Articles
[Maggie Tran]'s Articles
[Justy Siwabessy]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[Jin Li]‘s Articles
[Maggie Tran]‘s Articles
[Justy Siwabessy]‘s Articles
Related Copyright Policies
Null
收藏/分享
文件名: journal.pone.0149089.PDF
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.