Author(s):
1. Ali Arshad:
Northwest Normal University Lanzhou,China
2. Wanghu Chen:
Northwest Normal University Lanzhou,China
3. Yang Liu:
Northwest Normal University Lanzhou,China
4. Nauman Ali Khan:
University of Science and Technology,Hefei,China
Abstract:
Keywords are the list of valuable words present in a paragraph, that help in quickly understanding the context of the paragraph. These keywords hold the generic and overall meaning of the paragraph. Extraction of valid and meaningful keywords from scientific documents became one of the hot topics for researchers. Such research not only facilitates better comprehension of articles but also explores the scientific manner of understanding big repositories of scientific documents. In this study, we propose Semantic keyword extraction by adding a new feature that includes domain-specific grammar rules and deduction of adjectives. Our algorithm incorporates frequencies of keywords that are appearing repeatedly. The proposed framework extracts the keywords from the scientific paper abstract to support topic clustering. Such topic clustering benefits the new researchers to easily and quickly find their research topic in the concerned field of educational big data. We have selected the educational big dataset that includes 1028 published research papers regarding education learning, education management, students’ information system, etc. For evaluating the results and performance of a Semantic Keyword Extractor, we have used a general dataset. The proposed keyword extractor gives a precision of 76.8% which outperforms other keywords extractors. In our research, our proposed framework classified scientific papers into 3 meaningful groups by using an unsupervised machine learning clustering technique called k-means
Page(s):
1-1
DOI:
DOI not available
Published:
Journal: IEEE International Conference on Digital Futures and Transformative Technologies (ICoDT2) May 24-26, 2022 (Book of Abstracts), Volume: 1, Issue: 1, Year: 2022
Keywords:
Big Data
,
Keywords Extraction
,
Topic Clustering
References:
References are not available for this document.
Citations
Citations are not available for this document.