after preprocessing

نتایج جستجو برای: after preprocessing

تعداد نتایج: 1675770 فیلتر نتایج به سال:

An Evaluation of Sampling on Filter-Based Feature Selection Methods

2010

Kehan Gao Taghi M. Khoshgoftaar Jason Van Hulse

Feature selection and data sampling are two of the most important data preprocessing activities in the practice of data mining. Feature selection is used to remove less important features from the training data set, while data sampling is an effective means for dealing with the class imbalance problem. While the impacts of feature selection and class imbalance have been frequently investigated ...

متن کامل

Space-time trade-offs for some ranking and searching queries

Journal: :Inf. Process. Lett. 2001

Adrian Dumitrescu William L. Steiger

We study space/time tradeoffs for querying some combinatorial structures. In the first, given an arrangement of n lines in general position in the plane, a query for a real number t asks about Rank(t), the number of vertices of the arrangement with x-coordinates ≤ t. We show that for K = O(n/logn), after a preprocessing step that uses space S = O(n/(K logK)) the query can be answered in time O(...

متن کامل

Optimising Sentiment Classification using Preprocessing Techniques

2015

Kranti Vithal Ghag Ketan Shah

Sentiment Classification refers to the computational techniques for classifying whether the sentiments of text are positive or negative. Sentiment Classification being a specialized domain of text mining is expected to benefit after preprocessing. In this paper we propose various models with selective combinations of preprocessing techniques and Sentiment Classifiers, to optimize Sentiment Clas...

متن کامل

Shortest Paths in Digraphs of Small Treewidth. Part I: Sequential Algorithms Shortest Paths in Digraphs of Small Treewidth. Part I: Sequential Algorithms

1995

Shiva Chaudhuri Christos D. Zaroliagis

We consider the problem of preprocessing an n-vertex digraph with real edge weights so that subsequent queries for the shortest path or distance between any two vertices can be eeciently answered. We give algorithms that depend on the treewidth of the input graph. When the treewidth is a constant, our algorithms can answer distance queries in O((n)) time after O(n) preprocessing. This improves ...

متن کامل

English-French Document Alignment Based on Keywords and Statistical Translation

2016

Marek Medved Milos Jakubícek Vojtech Kovár

In this paper we present our approach to the Bilingual Document Alignment Task (WMT16), where the main goal was to reach the best recall on extracting aligned pages within the provided data. Our approach consists of tree main parts: data preprocessing, keyword extraction and text pairs scoring based on keyword matching. For text preprocessing we use the TreeTagger pipeline that contains the Uni...

متن کامل

Shortest Path Queries in Digraphs of Small Treewidth

1995

Shiva Chaudhuri Christos D. Zaroliagis

We consider the problem of preprocessing an n-vertex digraph with real edge weights so that subsequent queries for the shortest path or distance between any two vertices can be efficiently answered. We give algorithms that depend on the treewidth of the input graph. When the treewidth is a constant, our algorithms can answer distance queries in O(α(n)) time after O(n) preprocessing. This improv...

متن کامل

Improved Implementation of Point Location in General Two-Dimensional Subdivisions

2012

Michael Hemmer Michal Kleinbort Dan Halperin

We present a major revamp of the point-location data structure for general two-dimensional subdivisions via randomized incremental construction, implemented in Cgal, the Computational Geometry Algorithms Library. We can now guarantee that the constructed directed acyclic graph G is of linear size and provides logarithmic query time. Via the construction of the Voronoi diagram for a given point ...

متن کامل

On Concurrent Zero-Knowledge with Pre-processing

1999

Giovanni Di Crescenzo Rafail Ostrovsky

Concurrent Zero-Knowledge protocols remain zero-knowledge even when many sessions of them are executed together. These protocols have applications in a distributed setting, where many executions of the same protocol must take place at the same time by many parties, such as the Internet. In this paper, we are concerned with the number of rounds of interaction needed for such protocols and their ...

متن کامل

Affymetrix GeneChip microarray preprocessing for multivariate analyses

Journal: :Briefings in Bioinformatics 2011

متن کامل

Using Entropy in Web Usage Data Preprocessing

Journal: :Entropy 2018

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید