Pakistan Science Abstracts
Article details & metrics
No Detail Found!!
Classification of scientific publications using swarm intelligence.
Author(s):
1. Tariq Ali: Department of Computer Science, Mohammad Ali Jinnah University, Islamabad, Pakistan
2. Sohail Asghar: Department of Computer Science, Mohammad Ali Jinnah University, Islamabad, Pakistan
3. Naseer Ahmed Sajid: Department of Computer Science, Mohammad Ali Jinnah University, Islamabad, Pakistan
4. Munir Ahmad: Department of Computer Science, Mohammad Ali Jinnah University, Islamabad, Pakistan
Abstract:
Document classification is an important task in data mining. Currently, identifying category (i.e.,topic) of a scientific publication is a manual task. The Association for Computing Machinery Computing Classification System (ACM CCS) is most wildly used multi-level taxonomy for scientific document classification. Correct classification becomes difficult with an increase in number of levels as well as in number of categories. Domain overlapping aggravates this problem as a publication may belong to multiple domains. Thus manual classification to taxonomy becomes more difficult. Most of the existing text classification schemes are based on the Term Frequency and Inverse Document Frequency (TF-IDF) technique. Similar approaches become computationally inefficient for large datasets. Most of the techniques for text classification are not experimentally validated on scientific publication datasets. Also, multi-level and multi-class classification is missing in most of the existing schemes for document classification. The proposed approach is based on metadata (i.e., structural representation), in which only the title and keywords are considered. We reduced the features set by dropping some of the metadata, like abstract section of the scientific publication that diversifies the result accuracy. The proposed solution was inspired from the well-known evolutionary Particle Swarm Optimization (PSO). The proposed technique results in overall 84.71% accuracy on Journal of Universal Computer Science (J.UCS) dataset.
Page(s): 115-126
DOI: DOI not available
Published: Journal: Proceedings of Pakistan Academy of Sciences, Volume: 50, Issue: 2, Year: 2013
Keywords:
Keywords are not available for this article.
References:
References are not available for this document.
Citations
Citations are not available for this document.
0

Citations

0

Downloads

56

Views