نتایج جستجو برای: synthetic minority over sampling technique
تعداد نتایج: 1974657 فیلتر نتایج به سال:
Since failure of steam turbines occurs frequently and can causes huge losses for thermal plants, it is important to identify a fault in advance. A novel clustering diagnosis method based on t-distribution stochastic neighborhood embedding (t-SNE) extreme gradient boosting (XGBoost) proposed this paper. First, the t-SNE algorithm was used map high-dimensional data low-dimensional space; K-means ...
The most important information about the content of a document is represented by the key phrases of that document. In this study an automatic key phrase extraction algorithm is devised using machine learning technique. The proposed method not only considers the document level statistics like TFxIDF, the linguistic features of the phrases are also incorporated. Experiment has been performed on N...
Authorship identification can be seen as a single-label multi-class text categorization problem. Very often, there are extremely few training texts at least for some of the candidate authors. In this paper, we present methods to handle imbalanced multi-class textual datasets. The main idea is to segment the training texts into sub-samples according to the size of the class. Hence, minority clas...
Twitter is one of the most popular social media used to interact online. Through Twitter, a person's personality can be determined based on that thoughts, feelings, and behavior patterns. A person has five main personalities likes Openness, Conscientiousness, Extraversion, Agreeableness, Neuroticism. This study will make predictions using Naïve Bayes method – Support Vector Machine, Synthetic M...
Delivery of justice with the help artificial intelligence is a current research interest. Machine learning natural language processing (NLP) can classify types sexual harassment experiences into quid pro quo (QPQ) and hostile work environments (HWE). However, imbalanced data are often present in classes classification on specific datasets. Data imbalance cause decrease classifier's performance ...
Cardiovascular diseases are considered as the most life-threatening syndromes with highest mortality rate globally. Over a period of time, they have become very common and now overstretching healthcare systems countries. The major factors cardiovascular high blood pressure, family history, stress, age, gender, cholesterol, Body Mass Index (BMI), unhealthy lifestyle. Based on these factors, rese...
Penyakit jantung merupakan penyakit paling mematikan didunia. Laporan WHO tahun 2019 menyebutkan sebagai penyebab kematian tertinggi didunia dengan persentase 16% dari jumlah atau 8.9 juta kematian. Tingginya yang disebabkan oleh ini terjadi karena biasanya timbul tanpa adanya gejala sehingga sulit untuk diketahui sejak dini penderita. Salah satu cara mengatasi permasalahan tersebut adalah pema...
Named Entity Recognition is an information extraction task that serves as a pre-processing step for other natural language processing tasks, such machine translation, retrieval, and question answering. entity recognition enables the identification of proper names well temporal numeric expressions in open domain text. For Semitic languages Arabic, Amharic, Hebrew, named more challenging due to h...
We propose use of Latin Hypercube Sampling to create a synthetic data set that reproduces many of the essential features of an original data set while providing disclosure protection. The synthetic micro data can also be used to create either additive or multiplicative noise which when merged with the original data can provide disclosure protection. The technique can also be used to create hybr...
in this thesis, a better reaction conditions for the synthesis of spirobarbiturates catalyzed by task-specific ionic liquid (2-hydroxy-n-(2-hydroxyethyl)-n,n-dimethylethanaminium formate), calcium hypochlorite ca(ocl)2 or n-bromosuccinimide (nbs) in the presence of water at room temperature by ultrasonic technique is provided. the design and synthesis of spirocycles is a challenging task becaus...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید