mapreduce

نتایج جستجو برای: mapreduce

تعداد نتایج: 3018 فیلتر نتایج به سال:

A Parallel Military-Dog-Based Algorithm for Clustering Big Data in Cognitive Industrial Internet of Things

Journal: :IEEE Transactions on Industrial Informatics 2021

With the advancement of wireless communication, Internet Things (IoT), and big data, high performance data analytic tools algorithms are required. Data clustering, a promising technique is widely used to solve IoT big-data-based problems, since it does not require labeled datasets. Recently, metaheuristic have been efficiently various clustering problems. However, handle datasets produced from ...

متن کامل

MapReduce System over Heterogeneous Mobile Devices

2009

Peter R. Elespuru Sagun Shakya Shivakant Mishra

MapReduce is a distributed processing algorithm which breaks up large problem sets into small pieces, such that a large cluster of computers can work on those small pieces in an efficient, timely manner. MapReduce was created and popularized by Google, and is widely used as a means of processing large amounts of textual data for the purpose of indexing it for search later on. This paper examine...

متن کامل

Infusing Parallelism into Introductory Computer Science Curriculum using MapReduce

2007

Matthew Johnson Robert H. Liao Alexander Rasmussen Ramesh Sridharan Dan Garcia Brian K. Harvey Daniel D. Garcia Brian Harvey

We have incorporated cluster computing fundamentals into the introductory computer science curriculum at UC Berkeley. For the first course, we have developed coursework and programming problems in Scheme centered around Google’s MapReduce. To allow students only familiar with Scheme to write and run MapReduce programs, we designed a functional interface in Scheme and implemented software to all...

متن کامل

OS4M: Achieving Global Load Balance of MapReduce Workload by Scheduling at the Operation Level

Journal: :CoRR 2014

Liya Fan Bo Gao Fa Zhang Zhiyong Liu

The efficiency of MapReduce is closely related to its load balance. Existing works on MapReduce load balance focus on coarse-grained scheduling. This study concerns finegrained scheduling on MapReduce operations, with each operation representing one invocation of the Map or Reduce function. By default, MapReduce adopts the hash-based method to schedule Reduce operations, which often leads to po...

متن کامل

ETLMR: A Highly Scalable Dimensional ETL Framework Based on MapReduce

2011

Xiufeng Liu Christian Thomsen Torben Bach Pedersen

Extract-Transform-Load (ETL) flows periodically populate data warehouses (DWs) with data from different source systems. An increasing challenge for ETL flows is processing huge volumes of data quickly. MapReduce is establishing itself as the de-facto standard for large-scale data-intensive processing. However, MapReduce lacks support for high-level ETL specific constructs, resulting in low ETL ...

متن کامل

The Limitation of MapReduce: A Probing Case and a Lightweight Solution

2010

Zhiqiang Ma Lin Gu

MapReduce is arguably the most successful parallelization framework especially for processing large data sets in datacenters comprising commodity computers. However, difficulties are observed in porting sophisticated applications to MapReduce, albeit the existence of numerous parallelization opportunities. Intrinsically, the MapReduce design allows a program to scale up to handle extremely larg...

متن کامل

MapReduce for Integer Factorization

Journal: :CoRR 2009

Javier Tordable

Integer factorization is a very hard computational problem. Currently no e cient algorithm for integer factorization is publicly known. However, this is an important problem on which it relies the security of many real world cryptographic systems. I present an implementation of a fast factorization algorithm on MapReduce. MapReduce is a programming model for high performance applications develo...

متن کامل

Course 7001 Mini Project Performance Evaluation of Hadoop on Virtual Machines

2009

Yuzhe Tang

MapReduce[1] is a popular programming framework that is intended for automatical paralellization of computation in the cloud. MapReduce deals with data intensive applications; huge amount of data is first loaded from remote DFS, then copied as intermediate results from Mapper to Reducer, and finally written back to DFS. Along with this large amount of data transfer, many I/O operations are incu...

متن کامل

Document Selection Using Mapreduce

Journal: :International Journal of Security, Privacy and Trust Management 2015

متن کامل

Large Scale Temporal RDFS Reasoning Using MapReduce

2012

Chang Liu Guilin Qi Yong Yu

In this work, we build a large scale reasoning engine under temporal RDFS semantics using MapReduce. We identify the major challenges of applying MapReduce framework to reason over temporal information, and present our solutions to tackle them.

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید