Comparing research trends through author-provided keywords with machine extracted terms: A ML algorithm approach using publications data on neurological disorders

نویسندگان

چکیده

Objective. This study aimed to identify the primary research areas, countries, and organizational involvement in publications on neurological disorders through an analysis of human-assigned keywords. These results were then compared with unsupervised machine-algorithm-based extracted terms from title abstract gain knowledge about deficiencies both techniques. has enabled us understand how far machine-derived titles abstracts can be a substitute for keywords scientific articles. Design/Methodology/Approach. While significant areas identified author-provided downloaded Web Science PubMed, these by based models like VOSviewer techniques YAKE CounterVectorizer. Results/Discussion. We observed that post-covid-19 era witnessed more various disorders, but authors still chose generic keyword list than specific ones. The extraction tool, VOSviewer, many other extraneous insignificant along However, our self-developed machine learning algorithm using CountVectorizer provided precise subject adding stop-words dictionary stop-word NLTK tool kit. Conclusion. although author play vital role as they are assigned broader sense increase readability, concept lacked specificity in-depth analysis. suggested ML being compatible unstructured data was valid alternative author-generated accurate results. Originality/Value. To knowledge, this is first-ever machine-extracted real datasets, which may essential lead domain. Replicating large datasets different fields valuable resource experts stakeholders.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Precision of Keywords Extracted From Persian Text Using Word2Vec Algorithm

Keywords can present the main concepts of the text without human intervention according to the model. Keywords are important vocabulary words that describe the text and play a very important role in accurate and fast understanding of the content. The purpose of extracting keywords is to identify the subject of the text and the main content of the text in the shortest time. Keyword extraction pl...

متن کامل

Data Clustring Using A New CGA(Chaotic-Generic Algorithm) Approach

Clustering is the process of dividing a set of input data into a number of subgroups. The members of each subgroup are similar to each other but different from members of other subgroups. The genetic algorithm has enjoyed many applications in clustering data. One of these applications is the clustering of images. The problem with the earlier methods used in clustering images was in selecting in...

متن کامل

Data Clustring Using A New CGA(Chaotic-Generic Algorithm) Approach

Clustering is the process of dividing a set of input data into a number of subgroups. The members of each subgroup are similar to each other but different from members of other subgroups. The genetic algorithm has enjoyed many applications in clustering data. One of these applications is the clustering of images. The problem with the earlier methods used in clustering images was in selecting in...

متن کامل

ML Confidential: Machine Learning on Encrypted Data

We demonstrate that, by using a recently proposed leveled homomorphic encryption scheme, it is possible to delegate the execution of a machine learning algorithm to a computing service while retaining confidentiality of the training and test data. Since the computational complexity of the homomorphic encryption scheme depends primarily on the number of levels of multiplications to be carried ou...

متن کامل

Identification Psychological Disorders Based on Data in Virtual Environments Using Machine Learning

Introduction: Psychological disorders is one of the most problematic and important issue in today's society. Early prognosis of these disorders matters because receiving professional help at the appropriate time could improve the quality of life of these patients. Recently, researches use social media as a form of new tools in identifying psychological disorder. It seems that through the use of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Iberoamerican journal of science measurement and communication

سال: 2023

ISSN: ['2709-7595', '2709-3158']

DOI: https://doi.org/10.47909/ijsmc.36