globalchange  > 过去全球变化的重建
DOI: 10.1371/journal.pone.0157988
论文题名:
A Novel Clustering Methodology Based on Modularity Optimisation for Detecting Authorship Affinities in Shakespearean Era Plays
作者: Leila M. Naeni; Hugh Craig; Regina Berretta; Pablo Moscato
刊名: PLOS ONE
ISSN: 1932-6203
出版年: 2016
发表日期: 2016-8-29
卷: 11, 期:8
语种: 英语
英文关键词: Algorithms ; Optimization ; Computer networks ; Data mining ; Dolphins ; Probability distribution ; Software tools ; Sports
英文摘要: In this study we propose a novel, unsupervised clustering methodology for analyzing large datasets. This new, efficient methodology converts the general clustering problem into the community detection problem in graph by using the Jensen-Shannon distance, a dissimilarity measure originating in Information Theory. Moreover, we use graph theoretic concepts for the generation and analysis of proximity graphs. Our methodology is based on a newly proposed memetic algorithm (iMA-Net) for discovering clusters of data elements by maximizing the modularity function in proximity graphs of literary works. To test the effectiveness of this general methodology, we apply it to a text corpus dataset, which contains frequencies of approximately 55,114 unique words across all 168 written in the Shakespearean era (16th and 17th centuries), to analyze and detect clusters of similar plays. Experimental results and comparison with state-of-the-art clustering methods demonstrate the remarkable performance of our new method for identifying high quality clusters which reflect the commonalities in the literary style of the plays.
URL: http://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0157988&type=printable
Citation statistics:
资源类型: 期刊论文
标识符: http://119.78.100.158/handle/2HF3EXSE/23520
Appears in Collections:过去全球变化的重建
影响、适应和脆弱性
科学计划与规划
气候变化与战略
全球变化的国际研究计划
气候减缓与适应
气候变化事实与影响

Files in This Item:
File Name/ File Size Content Type Version Access License
journal.pone.0157988.PDF(2576KB)期刊论文作者接受稿开放获取View Download

作者单位: The Priority Research Centre of Bioinformatics and Information-Based Medicine, The University of Newcastle, Newcastle, New South Wales, Australia;School of Electrical Engineering and Computer Science, Faculty of Engineering and Built Environment, The University of Newcastle, Newcastle, New South Wales, Australia;School of Built Environment, Faculty of Design, Architecture and Building, University of Technology Sydney, Sydney, Australia;Centre for Literary and Linguistic Computing, School of Humanities and Social Science, The University of Newcastle, Newcastle, New South Wales, Australia;The Priority Research Centre of Bioinformatics and Information-Based Medicine, The University of Newcastle, Newcastle, New South Wales, Australia;School of Electrical Engineering and Computer Science, Faculty of Engineering and Built Environment, The University of Newcastle, Newcastle, New South Wales, Australia;The Priority Research Centre of Bioinformatics and Information-Based Medicine, The University of Newcastle, Newcastle, New South Wales, Australia;School of Electrical Engineering and Computer Science, Faculty of Engineering and Built Environment, The University of Newcastle, Newcastle, New South Wales, Australia

Recommended Citation:
Leila M. Naeni,Hugh Craig,Regina Berretta,et al. A Novel Clustering Methodology Based on Modularity Optimisation for Detecting Authorship Affinities in Shakespearean Era Plays[J]. PLOS ONE,2016-01-01,11(8)
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[Leila M. Naeni]'s Articles
[Hugh Craig]'s Articles
[Regina Berretta]'s Articles
百度学术
Similar articles in Baidu Scholar
[Leila M. Naeni]'s Articles
[Hugh Craig]'s Articles
[Regina Berretta]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[Leila M. Naeni]‘s Articles
[Hugh Craig]‘s Articles
[Regina Berretta]‘s Articles
Related Copyright Policies
Null
收藏/分享
文件名: journal.pone.0157988.PDF
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.