نتایج جستجو برای: batch data processing

تعداد نتایج: 2759647  

2002
Maciej Zakrzewicz Marek Wojciechowski

Data mining is a useful decision support technique, which can be used to find trends and regularities in warehouses of corporate data. A serious problem of its practical applications is long processing time required by data mining algorithms. Current systems consume minutes or hours to answer single requests, while typically batches of the requests are delivered the systems. In this paper we pr...

2016
Timo Bingmann Michael Axtmann Emanuel Jöbstl Sebastian Lamm Huyen Chau Nguyen Alexander Noe Sebastian Schlag Matthias Stumpp Tobias Sturm Peter Sanders

We present the design and a first performance evaluation of Thrill – a prototype of a general purpose big data processing framework with a convenient data-flow style programming interface. Thrill is somewhat similar to Apache Spark and Apache Flink with at least two main differences. First, Thrill is based on C++ which enables performance advantages due to direct native code compilation, a more...

2015
Raul Castro Fernandez Peter R. Pietzuch Jay Kreps Neha Narkhede Jun Rao Joel Koshy Dong Lin Chris Riccomini Guozhang Wang

With more sophisticated data-parallel processing systems, the new bottleneck in data-intensive companies shifts from the back-end data systems to the data integration stack, which is responsible for the pre-processing of data for back-end applications. The use of back-end data systems with different access latencies and data integration requirements poses new challenges that current data integr...

2014
Mingjie Chen R Shyama Prasad Rao Yiming Zhang Cathy Xiaoyan Zhong Jay J Thelen

The goal of metabolomics data pre-processing is to eliminate systematic variation, such that biologically-related metabolite signatures are detected by statistical pattern recognition. Although several methods have been developed to tackle the issue of batch-to-batch variation, each method has its advantages and disadvantages. In this study, we used a reference sample as a normalization standar...

2004
Shuguang Li Guojun Li Shaoqiang Zhang

We consider the problem of scheduling n jobs with release dates on m identical parallel batch processing machines so as to minimize the maximum lateness. Each batch processing machine can process up to B (B < n) jobs simultaneously as a batch, and the processing time of a batch is the largest processing time among the jobs in the batch. Jobs processed in the same batch start and complete at the...

2006
Alexandre Preti

This paper deals with unsupervised model adaptation for speaker recognition. Two adaptation schemes are proposed, the first one is based on a test by test model adaptation and the second one proposes a batch mode, where the adaptation is performed using a set of tests before computing the decision score for each of them. The experiments are conducted thanks to the NIST SRE 2005 database. This p...

Journal: :EURASIP J. Wireless Comm. and Networking 2009
Woo-Yong Choi

We propose the efficient reliable multicast MAC protocol based on the connectivity information among the recipients. Enhancing the BMMM (Batch Mode Multicast MAC) protocol, the reliable multicast MAC protocol significantly reduces the RAK (Request for ACK) frame transmissions in a reasonable computational time and enhances the MAC performance. By the analytical performance analysis, the through...

2012
Stefano Ermon Ronan Le Bras Carla P. Gomes Bart Selman R. Bruce van Dover

In combinatorial materials discovery, one searches for new materials with desirable properties by obtaining measurements on hundreds of samples in a single high-throughput batch experiment. As manual data analysis is becoming more and more impractical, there is a growing need to develop new techniques to automatically analyze and interpret such data. We describe a novel approach to the phase ma...

2017
Matteo Negri Marco Turchi Rajen Chatterjee Gebremedhen Gebremelak

Automatic post-editing (APE) for machine translation (MT) aims to fix recurrent errors made by the MT decoder by learning from correction examples. In controlled evaluation scenarios, the representativeness of the training set with respect to the test data is a key factor to achieve good performance. Real-life scenarios, however, do not guarantee such favorable learning conditions. Ideally, to ...

1992
Raymond J. Mooney

Most existing theory re nement systems are not incremental. However, any theory re nement system whose input and output theories are compatible can be used to incrementally assimilate data into an evolving theory. This is done by continually feeding its revised theory back in as its input theory. An incremental batch approach, in which the system assimilates a batch of examples at each step, se...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید