hadoop

نتایج جستجو برای: hadoop

تعداد نتایج: 2553 فیلتر نتایج به سال:

Scaling Beyond One Rack and Sizing of Hadoop Platform

Journal: :Scalable Computing: Practice and Experience 2015

Wieslawa Litke Marcin Budka

This paper focuses on two aspects of configuration choices of the Hadoop platform. Firstly we are looking to establish performance implications of expanding an existing Hadoop cluster beyond a single rack. In the second part of the testing we are focusing on performance differences when deploying clusters of different sizes. The study also examines constraints of the disk latency found on the t...

متن کامل

Survey Based on Hadoop

Journal: :International Journal for Research in Applied Science and Engineering Technology 2017

متن کامل

OEHadoop: Accelerate Hadoop Applications by Co-Designing Hadoop With Data Center Network

Journal: :IEEE Access 2018

متن کامل

Tutorial: SQL-on-Hadoop Systems

Journal: :PVLDB 2015

Daniel J. Abadi Shivnath Babu Fatma Özcan Ippokratis Pandis

Enterprises are increasingly using Apache Hadoop, more specifically HDFS, as a central repository for all their data; data coming from various sources, including operational systems, social media and the web, sensors and smart devices, as well as their applications. At the same time many enterprise data management tools (e.g. from SAP ERP and SAS to Tableau) rely on SQL and many enterprise user...

متن کامل

Big Data Using Hadoop

2017

17ANSP-BD-001 Hadoop Performance Modeling for JobEstimation and Resource Provisioning MapReduce has become a major computing model for data intensive applications. Hadoop, an open source implementationof MapReduce, has been adopted by an increasingly growing user community. Cloud computing service providers such as AmazonEC2 Cloud offer the opportunities for Hadoop users to lease a certain amou...

متن کامل

FP-Hadoop: Efficient processing of skewed MapReduce jobs

Journal: :Inf. Syst. 2016

Miguel Liroz-Gistau Reza Akbarinia Divyakant Agrawal Patrick Valduriez

Nowadyas, we are witnessing the fast production of very large amount of data, particularly by the users of online systems on the Web. However, processing this big data is very challenging since both space and computational requirements are hard to satisfy. One solution for dealing with such requirements is to take advantage of parallel frameworks, such as MapReduce or Spark, that allow to make ...

متن کامل

MR-Tandem: parallel X!Tandem using Hadoop MapReduce on Amazon Web Services

Journal: :Bioinformatics 2012

Brian Pratt J. Jeffry Howbert Natalie I. Tasman Erik J. Nilsson

SUMMARY MR-Tandem adapts the popular X!Tandem peptide search engine to work with Hadoop MapReduce for reliable parallel execution of large searches. MR-Tandem runs on any Hadoop cluster but offers special support for Amazon Web Services for creating inexpensive on-demand Hadoop clusters, enabling search volumes that might not otherwise be feasible with the compute resources a researcher has at ...

متن کامل

Diagnosing Heterogeneous Hadoop Clusters

2013

Shekhar Gupta Christian Fritz Johan de Kleer Cees Witteveen

We present a data-driven approach for diagnosing performance issues in heterogeneous Hadoop clusters. Hadoop is a popular and extremely successful framework for horizontally scalable distributed computing over large data sets based on the MapReduce framework. In its current implementation, Hadoop assumes a homogeneous cluster of compute nodes. This assumption manifests in Hadoop’s scheduling al...

متن کامل

myHadoop - Hadoop-on-Demand on Traditional HPC Resources

2004

Sriram Krishnan Mahidhar Tatineni Chaitanya Baru

Traditional High Performance Computing (HPC) resources, such as those available on the TeraGrid, support batch job submissions using Distributed Resource Management Systems (DRMS) like TORQUE or the Sun Grid Engine (SGE). For large-scale data intensive computing, programming paradigms such as MapReduce are becoming popular. A growing number of codes in scientific domains such as Bioinformatics ...

متن کامل

The Cooperative Study Between the Hadoop Big Data Platform and the Traditional Data Warehouse

2015

Ping Hu

In this paper, based on the application conditions of the existing traditional data warehouse and the future forecast of the Hadoop big data platform, this paper proposes the new framework of the cooperation of Hadoop and traditional data warehouse which focus on the cooperation between the traditional data warehouse and the Hadoop technique to solve the problem that the traditional data wareho...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید