نتایج جستجو برای: apache spark
تعداد نتایج: 18089 فیلتر نتایج به سال:
Abstract Reasoning is the process of inferring new knowledge and identifying inconsistencies within ontologies. Traditional techniques often prove inadequate when reasoning over large Knowledge Bases containing millions or billions facts. This article introduces NORA, a persistent scalable OWL reasoner built on top Apache Spark, designed to address challenges extensive complex NORA exploits sca...
Feature selection (FS) techniques generally require repeatedly training and evaluating models to assess theimportance of each feature for a particular task. However, due the increasing size currently availabledatabases, distributed processing has become necessity many tasks. In this context, Apache SparkML library is one most widely used libraries performing classification other tasks with larg...
Over the past few decades, volume of data has increased significantly in both scientific institutions and universities, with a large number students enrolled high related data. Furthermore, network traffic post-pandemic use online learning. Therefore, processing is complex challenging task that increases possibility intrusions anomalies. Traditional security systems cannot deal such high-speed ...
Aiming at the problem of spatial query processing in distributed computing systems, design and implementation new algorithms is a current challenge. Apache Spark memory-based framework suitable for real-time batch processing. Spark-based systems allow users to work on in-memory data, without worrying about data distribution mechanism fault-tolerance. Given two datasets points (called Query Trai...
Computing power has now become abundant with multi-core machines, grids and clouds, but it remains a challenge to harness the available power and move towards gracefully handling web-scale datasets. Several researchers have used automatically distributed computing frameworks, notably Hadoop and Spark, for processing multimedia material, but mostly using small collections on small clusters. In t...
Primal and dual block coordinate descent methods are iterative methods for solving regularized and unregularized optimization problems. Distributed-memory parallel implementations of these methods have become popular in analyzing large machine learning datasets. However, existing implementations communicate at every iteration which, on modern data center and supercomputing architectures, often ...
While cluster computing frameworks are continuously evolving to provide real-time data analysis capabilities, Apache Spark has managed to be at the forefront of big data analytics for being a unified framework for both, batch and stream data processing. However, recent studies on micro-architectural characterization of in-memory data analytics are limited to only batch processing workloads. We ...
Due to their promise of delivering real-time network insights, today’s streaming analytics platforms are increasingly being used in the communications networks where the impact of the insights go beyond sentiment and trend analysis to include real-time detection of security attacks and prediction of network state (i.e., is the network transitioning towards an outage). Current streaming analytic...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید