نتایج جستجو برای: hadoop
تعداد نتایج: 2553 فیلتر نتایج به سال:
Hadoop is a powerful open cloud computing platform, using MapReduce hardled data seg mentation and merging. Personal data in the flat without any protection, could be attacked at any time, as a result, cloud platform on the personal fast search algorithm problem is very important. In this paper, a Hadoop cloud data search platform to have safety rules was proposed, it increase the safety of the...
Due to advances in 10GBASE-T technology, faster interconnects are now more affordable. This gives organizations the opportunity of deploying a cost-effective, high performance Hadoop cluster based on 10 GbE interconnects. As part of this paper, we have partnered with the Texas Advanced Computing Center (TACC) to examine how their unique implementation of Hadoop can benefi t from Intel’s cutting...
Customization of Recommendation System Using Collaborative Filtering Algorithm on Cloud Using Mahout
Recommendation System helps people in decision making regarding an item/person. Growth of World Wide Web and E-commerce are the catalyst for recommendation system. Due to large size of data, recommendation system suffers from scalability problem. Hadoop is one of the solutions for this problem. Collaborative filtering is a machine learning algorithm and Mahout is an open source java library whi...
With the growing maturity of SQL-on-Hadoop engines such as Hive, Impala, and Spark SQL, many enterprise customers are deploying new and legacy SQL applications on them to reduce costs and exploit the storage and computing power of large Hadoop clusters. On the enterprise data warehouse (EDW) front, customers want to reduce operational overhead of their legacy applications by processing portions...
The performance of three Hadoop applications is reported for several virtual configurations on VMware vSphere 5 and compared to native configurations. A well-balanced seven-node AMAX ClusterMax system was used to show that the average performance difference between native and the simplest virtualized configurations is only 4%. Further, the flexibility enabled by virtualization to create multipl...
Large scale data set provides the better opportunity to find out much better data relationship in the area of business intelligence. In the paper, we implement our systems using Hadoop that has been popular to store and compute Big Data. However, it is not easy to write Hadoop Map Reduce code. Therefore, we use Hive and Hive QL codes to understand the relationships between ratings and the users...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید