📞 +91-7667918914 | ✉️ iarjset@gmail.com
International Advanced Research Journal in Science, Engineering and Technology
International Advanced Research Journal in Science, Engineering and Technology A Monthly Peer-Reviewed Multidisciplinary Journal
ISSN Online 2393-8021ISSN Print 2394-1588Since 2014
IARJSET aligns to the suggestive parameters by the latest University Grants Commission (UGC) for peer-reviewed journals, committed to promoting research excellence, ethical publishing practices, and a global scholarly impact.
← Back to VOLUME 4, ISSUE 7, JULY 2017

AUTHORSHIP ATTRIBUTION USING UNSUPERVISED CLUSTERING ALGORITHMS ON ENGLISH C50 NEWS ARTICLES

Dr. O Srinivasa Rao, Dr. N V Ganapathi Raju, Dr. Y. Srilalitha, Mrs. P. Bharathi

👁 2 views📥 0 downloads
Share: 𝕏 f in

Abstract: The aim of the authorship attribution is identifying the author of an unknown/anonymous document. Many earlier researches used authorship attribution as a multi class single labelled text classifier problem. However, in several applications it is not easy or even possible to find such labeled data and it is necessary to build unsupervised attribution models that are able to estimate similarities/differences in personal style of authors. The present paper experimets authorship attribution as a clustering task using various unsupervised clustering algorithms like K-means, Mini Batch K-means and Ward Hierarchialclusterings and our authorship clustering algorithm achieves 97% of clustering accuracy in clustering C50 English news groups artcles.

Keywords: authorship clustering; unsupervised algorithms; C50 data set.

How to Cite:

[1] Dr. O Srinivasa Rao, Dr. N V Ganapathi Raju, Dr. Y. Srilalitha, Mrs. P. Bharathi, “AUTHORSHIP ATTRIBUTION USING UNSUPERVISED CLUSTERING ALGORITHMS ON ENGLISH C50 NEWS ARTICLES,” International Advanced Research Journal in Science, Engineering and Technology (IARJSET), DOI: 10.17148/IARJSET.2017.4747

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.