hadoop

نتایج جستجو برای: hadoop

تعداد نتایج: 2553 فیلتر نتایج به سال:

Efficient Big Data Processing in Hadoop MapReduce

Journal: :PVLDB 2012

Jens Dittrich Jorge-Arnulfo Quiané-Ruiz

This tutorial is motivated by the clear need of many organizations, companies, and researchers to deal with big data volumes efficiently. Examples include web analytics applications, scientific applications, and social networks. A popular data processing engine for big data is Hadoop MapReduce. Early versions of Hadoop MapReduce suffered from severe performance problems. Today, this is becoming...

متن کامل

HADOOP: A Comparative Study between Single-Node and Multi-Node Cluster

Journal: :International Journal of Advanced Computer Science and Applications 2021

Data analysis has become a challenge in recent years as the volume of data generated difficult to manage, therefore more hardware and software resources are needed store process this huge amount data. Apache Hadoop is free framework, widely used thanks Distributed Files System (HDFS) its ability relate other processing components such MapReduce for data, Spark - in-memory Processing, Drill SQL ...

متن کامل

Hadoop and MapReduce

Journal: :Journal of the Korean Data and Information Science Society 2013

متن کامل

Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce

Journal: :Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases 2013

Ablimit Aji Fusheng Wang Hoang Vo Rubao Lee Qiaoling Liu Xiaodong Zhang Joel H. Saltz

Support of high performance queries on large volumes of spatial data becomes increasingly important in many application domains, including geospatial problems in numerous fields, location based services, and emerging scientific applications that are increasingly data- and compute-intensive. The emergence of massive scale spatial data is due to the proliferation of cost effective and ubiquitous ...

متن کامل

A MR Simulator in Facilitating Cloud Computing

2013

Palson Kennedy T. V. Gopal

MapReduce is an enabling technology in support of Cloud Computing. Hadoop which is a mapReduce implementation has been widely used in developing MapReduce applications. This paper presents Hadoop simulatorHaSim, MapReduce simulator which builds on top of Hadoop. HaSim models large number of parameters that can affect the behaviors of MapReduce nodes, and thus it can be used to tune the performa...

متن کامل

Developing System Performance Metrics for Cloud Computing Based on Hadoop

2012

Rema Hariharan Gabriele Jost Sanjiv Lakhanpal Dave Raddatz

This short white paper describes our efforts to establish techniques and tools to identify optimization opportunities for Hadoop workloads. Suitable performance metrics and relevant benchmark use cases are a crucial component to achieve these goals. We discuss efforts to define suitable metrics for cloud computing in general, briefly describe hardware and software components that impact Hadoop ...

متن کامل

Scheduling Job Queue on Hadoop using Hybrid Hadoop Fair Sojourn Protocol

Journal: :Indian Journal of Science and Technology 2016

متن کامل

Pilot-Abstraction: A Valid Abstraction for Data-Intensive Applications on HPC, Hadoop and Cloud Infrastructures?

Journal: :CoRR 2015

André Luckow Pradeep Kumar Mantha Shantenu Jha

HPC environments have traditionally been designed to meet the compute demand of scientific applications and data has only been a second order concern. With science moving toward data-driven discoveries relying more and more on correlations in data to form scientific hypotheses, the limitations of existing HPC approaches become apparent: Architectural paradigms such as the separation of storage ...

متن کامل

What is Hadoop?

Journal: :مجلة الجمعیة المصریة لنظم المعلومات وتکنولوجیا الحاسبات 2017

متن کامل

Hadoop Superlinear Scalability

Journal: :Queue 2015

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید