apache spark

نتایج جستجو برای: apache spark

تعداد نتایج: 18089 فیلتر نتایج به سال:

Scaling Spark in the Real World: Performance and Usability

Journal: :PVLDB 2015

Michael Armbrust Tathagata Das Aaron Davidson Ali Ghodsi Andrew Or Josh Rosen Ion Stoica Patrick Wendell Reynold Xin Matei Zaharia

Apache Spark is one of the most widely used open source processing engines for big data, with rich language-integrated APIs and a wide range of libraries. Over the past two years, our group has worked to deploy Spark to a wide range of organizations through consulting relationships as well as our hosted service, Databricks. We describe the main challenges and requirements that appeared in takin...

متن کامل

Towards Large Scale Environmental Data Processing with Apache Spark

2016

Diego Ferrón Sebastián Villarroya José Ramon Rios Viqueira Tomás F. Pena

Currently available environmental datasets are either manually constructed by professionals or automatically generated from the observations provided by sensing devices. Usually, the former are modelled and recorded with traditional general-purpose relational technologies, whereas the latter require more specific scientific array formats and tools. Declarative data processing technologies are a...

متن کامل

Machine Learning Supported Diabetes Prediction with Apache Spark

Journal: :Düzce Üniversitesi bilim ve teknoloji dergisi 2022

Diyabet rahatsızlığı, insan vücudunun organlarını etkileyen kritik sağlık sorunlarından biridir. Bu nedenle, diyabet, 21. yüzyılda küresel bir sorunu olarak kabul edilmektedir. rahatsızlığın sonucu ortaya çıkan sorunlardan kaçınmak ve onları ağırlaşmadan önce tedavi etmek için diyabet rahatsızlığını tahmin edip işleyebilen sisteme ihtiyaç duyulmaktadır. Son yıllarda, alanında birçok erken teşhi...

متن کامل

The STARK Framework for Spatio-Temporal Data Analytics on Spark

2017

Stefan Hagedorn Philipp Götze Kai-Uwe Sattler

Big Data sets can contain all types of information: from server log files to tracking information of mobile users with their location at a point in time. Apache Spark has been widely accepted for Big Data analytics because of its very fast processing model. However, Spark has no native support for spatial or spatio-temporal data. Spatial filters or joins using, e.g., a contains predicate are no...

متن کامل

Challenges in Predicting Disease State with Apache Spark

Journal: :MOJ Proteomics & Bioinformatics 2016

متن کامل

Analisis Sentimen Pembelajaran Tatap Muka dengan Apache SPARK

Journal: :JURTI (Jurnal Rekayasa Teknologi Informasi) 2022

Menteri Pendidikan (Mendikbud Ristek), Nadiem Makarim menegaskan bahwa akan memprioritaskan kepada guru atau staf pengajar untuk melaksanakan vaksinasi, sehingga pada minggu kedua dan ketiga di bulan Juli dengan tahun ajaran baru diharapkan semua sekolah sudah dapat melakukan pembelajaran tatap muka secara terbatas tetap memperhatikan protokol kesehatan. Namun dari hasil statistik covid19.go.id...

متن کامل

Apache Spark SVM for Predicting Obstructive Sleep Apnea

Journal: :Big Data and Cognitive Computing 2020

متن کامل

Big Data in metagenomics: Apache Spark vs MPI

Journal: :PLOS ONE 2020

متن کامل

Multi-objective Big Data Optimization with jMetal and Spark

2017

Cristóbal Barba-González José García-Nieto Antonio J. Nebro José Francisco Aldana Montes

Big Data Optimization is the term used to refer to optimization problems which have to manage very large amounts of data. In this paper, we focus on the parallelization of metaheuristics with the Apache Spark cluster computing system for solving multi-objective Big Data Optimization problems. Our purpose is to study the influence of accessing data stored in the Hadoop File System (HDFS) in each...

متن کامل

Big Data Approaches for the Analysis of Large-Scale fMRI Data Using Apache Spark and GPU Processing: A Demonstration on Resting-State fMRI Data from the Human Connectome Project

Journal: :Frontiers in neuroscience 2015

Roland N. Boubela Klaudius Kalcher Wolfgang Huf Christian Našel Ewald Moser

Technologies for scalable analysis of very large datasets have emerged in the domain of internet computing, but are still rarely used in neuroimaging despite the existence of data and research questions in need of efficient computation tools especially in fMRI. In this work, we present software tools for the application of Apache Spark and Graphics Processing Units (GPUs) to neuroimaging datase...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید