- Subscribe to this page:
- RSS
- Email Alert
apache hadoop
(98 results)-
Webcasts
Big Data technology: Are you in over your head?
April 26, 2012 12:00am PDT
Big Data technology is popular in startups and academia but, despite all the chatter, is it really ready for the enterprise? Yes, organizations are collecting data, but for now it mostly just...
Provided by: ZDNet
-
White Papers
Understanding the Effects and Implications of Compute Node Related Failures in Hadoop
April 17, 2012 12:00am PDT
Hadoop has become a critical component in today's cloud environment. Ensuring good performance for Hadoop is paramount for the wide-range of applications built on top of it. In this paper, the...
Provided by: Association for Computing Machinery
-
White Papers
Oracle In-Database Hadoop: When MapReduce Meets RDBMS
March 14, 2012 12:00am PDT
Big data is the tar sands of the data world: vast reserves of raw gritty data whose valuable information content can only be extracted at great cost. MapReduce is a popular parallel programming...
Provided by: Association for Computing Machinery
-
Blog Post
Where are all the cloud-based Hadoop services?
December 8, 2011 9:00am PST
Ian Hardenburgh looks at Apache Hadoop's software framework as the best possibility for harnessing the amorphous data collected by social media marketing efforts.
-
White Papers
On the Duality of Data-Intensive File System Design: Reconciling HDFS and PVFS
November 18, 2011 12:00am PST
Data-intensive applications fall into two computing styles: Internet services (cloud computing) or High-Performance Computing (HPC). In both categories, the underlying file system is a key...
Provided by: Association for Computing Machinery
-
-
Blog Post
Hadoop: Cheat Sheet
November 15, 2011 8:22am PST
An elephant-themed, open-source way to tackle big data...
-
White Papers
Analysis of Hadoop's Performance Under Failures
November 14, 2011 12:00am PST
Failures are common in today's data center environment and can significantly impact the performance of important jobs running on top of large scale computing frameworks. In this paper, the authors...
Provided by: Rice University
-
White Papers
Management of Data Replication for PC Cluster Based Cloud Storage System
November 1, 2011 12:00am PDT
Storage systems are essential building blocks for cloud computing infrastructures. Although high performance storage servers are the ultimate solution for cloud storage, the implementation of...
Provided by: University of Computer Studies
-
White Papers
A Front-End, Hadoop-Based Data Management Service for Efficient Federated Clouds
October 6, 2011 12:00am PDT
In the recent years, cloud computing has emerged as the new IT paradigm that promises elastic resources on a pay-per-use basis. The challenges of cloud computing are focused around massive data...
Provided by: University of Athens
-
Blog Post
Oracle Big Data Appliance: Data so big it's scary?
October 3, 2011 10:38am PDT
With the new Oracle Big Data Appliance, Oracle will commercialize NoSQL and Hadoop for cloud-powered big data analytics.
4 Latest comment by viveka
-
White Papers
BotCloud: Detecting Botnets Using MapReduce
September 28, 2011 12:00am PDT
Botnets are a major threat of the current Internet. Understanding the novel generation of botnets relying on peer-to-peer networks is crucial for mitigating this threat. Nowadays, botnet traffic...
Provided by: University of Luxembourg
-
Webcasts
Trends in Business Analytics
August 31, 2011 12:00am PDT
In his presentation "Trends in Business Analytics", Colin White (Founder, BI Research) will explore the impact that trends such as analytic RDBMSes, Hadoop and MapReduce, the NoSQL movement,...
Provided by: IBM
-
White Papers
Static Scheduling in Clouds
August 8, 2011 12:00am PDT
Cloud computing aims to give users virtually unlimited pay-per-use computing resources without the burden of managing the underlying infrastructure. The authors present a new job execution...
Provided by: Institute of Science and Technology
-
White Papers
Comparing High Level MapReduce Query Languages
July 13, 2011 12:00am PDT
The MapReduce parallel computational model is of increasing importance. A number of High Level Query Languages (HLQLs) have been constructed on top of the Hadoop MapReduce realization, primarily...
Provided by: Heriot-Watt University
-
White Papers
Towards Peer-to-Peer Virtualized Service Hosting, Discovery and Delivery
July 11, 2011 12:00am PDT
This paper introduces a peer-to-peer framework for providing, locating and consuming distributed services that are encapsulated within virtual machines. The authors believe that the decentralized...
Provided by: University of Malta
-
White Papers
Cloud Computing for Online Visualization of GIS Applications in Ubiquitous City
July 11, 2011 12:00am PDT
Cloud computing can be used to generate the 3D noise maps in ubiquitous cities. Here in this paper, the authors present their cloud computing approach, its performance and a performance comparison...
Provided by: IARIA
-
White Papers
Performance Evaluation of Mapreduce Using Full Virtualisation on a Departmental Cloud
June 16, 2011 12:00am PDT
This paper analyses the performance of Hadoop, an implementation of the MapReduce programming model for distributed parallel computing, executing on a virtualisation environment comprised of 1+16...
Provided by: Robert Gordon University
-
White Papers
XML Query Optimization in Map-Reduce
June 12, 2011 12:00am PDT
The authors present a novel query language for large-scale analysis of XML data on a map-reduce environment, called MRQL, that is expressive enough to capture most common data analysis tasks and...
Provided by: University of Texas
-
White Papers
Hadoop's Overload Tolerant Design Exacerbates Failure Detection and Recovery
June 12, 2011 12:00am PDT
Data processing frameworks like Hadoop need to efficiently address failures, which are common occurrences in today's large-scale data center environments. Failures have a detrimental effect on the...
Provided by: Association for Computing Machinery
-
White Papers
Shared Cluster Scheduling: A Fair and Efficient Protocol
June 10, 2011 12:00am PDT
In this paper, the authors focus on the problem of resource allocation in a shared cluster used for data-intensive scalable computing. Specifically, they target the open-source implementation of...
Provided by: Association for Computing Machinery









































