- Subscribe to this page:
- RSS
- Email Alert
VLDB Endowment
(391 results)-
Whitepapers
MapReduce Programming and Cost based Optimization? Crossing This Chasm With Starfish
September 3, 2011, 12:00am PDT
MapReduce has emerged as a viable competitor to database systems in big data analytics. MapReduce programs are being written for a wide variety of application domains including business data...
Provided by VLDB Endowment
-
Whitepapers
Proactive Detection and Repair of Data Corruption: Towards a Hasslefree Declarative Approach With Amulet
September 3, 2011, 12:00am PDT
Occasional corruption of stored data is an unfortunate byproduct of the complexity of modern systems. Hardware errors, software bugs, and mistakes by human administrators can corrupt important...
Provided by VLDB Endowment
-
Whitepapers
Scalable SPARQL Querying of Large RDF Graphs
September 3, 2011, 12:00am PDT
The generation of RDF data has accelerated to the point where many data sets need to be partitioned across multiple machines in order to achieve reasonable performance when querying the data....
Provided by VLDB Endowment
-
White Papers
Linkage Query Writer
August 28, 2009, 12:00am PDT
The authors present Linkage Query Writer (LinQuer), a system for generating SQL queries for semantic link discovery over relational data. The LinQuer framework consists of LinQL, a language for...
Provided by VLDB Endowment
-
White Papers
MEET DB2: Automated Database Migration Evaluation
October 1, 2009, 12:00am PDT
Commercial databases compete for market share, which is composed of not only net-new sales to those purchasing a database for the first time, but also competitive \win-backs" and migrations....
Provided by VLDB Endowment
-
White Papers
Index Interactions in Physical Design Tuning: Modeling, Analysis, and Applications
August 24, 2009, 12:00am PDT
One of the key tasks of a database administrator is to optimize the set of materialized indices with respect to the current workload. To aid administrators in this challenging task, commercial...
Provided by VLDB Endowment
-
White Papers
From Regular Expressions to Nested Words: Unifying Languages and Query Execution for Relational and XML Sequences
October 1, 2009, 12:00am PDT
There is growing interest in query language extensions for pattern matching over event streams and stored database sequences, due to the many important applications that such extensions make...
Provided by VLDB Endowment
-
White Papers
Tuning Database Configuration Parameters With iTuned
August 28, 2009, 12:00am PDT
Database systems have a large number of configuration parameters that control memory distribution, I/O optimization, costing of query plans, parallelism, many aspects of logging, recovery, and...
Provided by VLDB Endowment
-
White Papers
Xplus: A SQL-Tuning-Aware Query Optimizer
October 1, 2009, 12:00am PDT
The need to improve a suboptimal execution plan picked by the query optimizer for a repeatedly run SQL query arises routinely. Complex expressions, skewed or correlated data, and changing...
Provided by VLDB Endowment
-
White Papers
UPI: A Primary Index for Uncertain Databases
July 12, 2010, 12:00am PDT
Uncertain data management has received growing attention from industry and academia. Many efforts have been made to optimize uncertain databases, including the development of special index data...
Provided by VLDB Endowment
-
White Papers
CORADD: Correlation Aware Database Designer for Materialized Views and Indexes
July 13, 2010, 12:00am PDT
The authors describe an automatic database design tool that exploits correlations between attributes when recommending Materialized Views (MVs) and indexes. Although there is a substantial body of...
Provided by VLDB Endowment
-
White Papers
Fast Sparse MatrixVector Multiplication on GPUs: Implications for Graph Mining
February 20, 2011, 12:00am PST
Scaling up the sparse matrix-vector multiplication kernel on modern Graphics Processing Units (GPU) has been at the heart of numerous studies in both academia and industry. In this paper the...
Provided by VLDB Endowment
-
White Papers
Automatic Wrappers for Large Scale Web Extraction
February 20, 2011, 12:00am PST
The authors present a generic framework to make wrapper induction algorithms tolerant to noise in the training data. This enables one to learn wrappers in a completely unsupervised manner from...
Provided by VLDB Endowment
-
White Papers
High Throughput Transaction Executions on Graphics Processors
February 24, 2011, 12:00am PST
OLTP (On-Line Transaction Processing) is an important business system sector in various traditional and emerging online services. Due to the increasing number of users, OLTP systems require high...
Provided by VLDB Endowment
-
White Papers
Incrementally Maintaining Classification Using an RDBMS
February 24, 2011, 12:00am PST
The proliferation of imprecise data has motivated both researchers and the database industry to push statistical techniques into Relational DataBase Management Systems (RDBMSes). The authors study...
Provided by VLDB Endowment
-
White Papers
Distributed Inference and Query Processing for RFID Tracking and Monitoring
February 24, 2011, 12:00am PST
In this paper, the authors present the design of a scalable, distributed stream processing system for RFID tracking and monitoring. Since RFID data lacks containment and location information that...
Provided by VLDB Endowment
-
White Papers
Nearest Neighbor Search With Strong Location Privacy
September 17, 2010, 12:00am PDT
The tremendous growth of the Internet has significantly reduced the cost of obtaining and sharing information about individuals, raising many concerns about user privacy. Spatial queries pose an...
Provided by VLDB Endowment
-
White Papers
Secure Personal Data Servers: A Vision Paper
September 17, 2010, 12:00am PDT
An increasing amount of personal data is automatically gathered and stored on servers by administrations, hospitals, insurance companies, etc. Citizen themselves often count on internet companies...
Provided by VLDB Endowment
-
White Papers
Explore or Exploit? Effective Strategies for Disambiguating Large Databases
October 1, 2009, 12:00am PDT
Data ambiguity is inherent in applications such as data integration, location-based services, and sensor monitoring. In many situations, it is possible to "Clean", or remove, ambiguities from...
Provided by VLDB Endowment
-
White Papers
Swarm: Mining Relaxed Temporal Moving Object Clusters
October 1, 2009, 12:00am PDT
Recent improvements in positioning technology make massive moving object data widely available. One important analysis is to find the moving objects that travel together. Existing methods put a...
Provided by VLDB Endowment
-
White Papers
Database Replication: A Tale of Research Across Communities
October 1, 2009, 12:00am PDT
Replication is a key mechanism to achieve scalability and fault-tolerance in databases. Its importance has recently been further increased because of the role it plays in achieving elasticity at...
Provided by VLDB Endowment
-
White Papers
Techniques for Similarity Searching in Multimedia Databases
October 1, 2009, 12:00am PDT
The representation of multidimensional points and objects, and the development of appropriate indexing methods that enable them to be retrieved efficiently is a well-studied subject. Most of these...
Provided by VLDB Endowment
-
White Papers
Big Data and Cloud Computing: New Wine or Just New Bottles?
October 1, 2009, 12:00am PDT
Cloud computing is an extremely successful paradigm of service oriented computing and has revolutionized the way computing infrastructure is abstracted and used. Three most popular cloud paradigms...
Provided by VLDB Endowment
-
White Papers
Fragments and Loose Associations: Respecting Privacy in Data Publishing
August 11, 2010, 12:00am PDT
The authors propose a modeling of the problem of privacy-compliant data publishing that captures confidentiality constraints on one side and visibility requirements on the other side....
Provided by VLDB Endowment
-
White Papers
Global Detection of Complex Copying Relationships Between Sources
October 1, 2009, 12:00am PDT
Web technologies have enabled data sharing between sources but also simplified copying (and often publishing without proper attribution). The copying relationships can be complex: some sources...
Provided by VLDB Endowment
-
White Papers
Interesting-Phrase Mining for Ad-Hoc Text Analytics
October 1, 2009, 12:00am PDT
Large text corpora with news, customer mail and reports, or Web 2.0 contributions offer a great potential for enhancing business-intelligence applications. The authors propose a framework for...
Provided by VLDB Endowment
-
White Papers
Annotating and Searching Web Tables Using Entities, Types and Relationships
October 1, 2009, 12:00am PDT
Tables are a universal idiom to present relational data. Billions of tables on Web pages express entity references, at-tributes and relationships. This representation of relational world knowledge...
Provided by VLDB Endowment
-
White Papers
TRAMP: Understanding the Behavior of Schema Mappings Through Provenance
October 1, 2009, 12:00am PDT
Though partially automated, developing schema mappings remains a complex and potentially error-prone task. In this paper, the authors present TRAMP (TRAnsformation Mapping Provenance), an...
Provided by VLDB Endowment
-
White Papers
Achieving High Output Quality Under Limited Resources Through Structure based Spilling in XML Streams
October 1, 2009, 12:00am PDT
Because of high volumes and unpredictable arrival rates, stream processing systems are not always able to keep up with input data - resulting in buffer overflow and uncontrolled loss of data. To...
Provided by VLDB Endowment
-
White Papers
Efficient RkNN Retrieval With Arbitrary NonMetric Similarity Measures
October 1, 2009, 12:00am PDT
A RkNN query returns all objects whose nearest k neighbors contain the query object. In this paper, the authors consider RkNN query processing in the case where the distances between attribute...
Provided by VLDB Endowment
-
White Papers
VoRTree: Rtrees With Voronoi Diagrams for Efficient Processing of Spatial Nearest Neighbor Queries
October 1, 2009, 12:00am PDT
A very important class of spatial queries consists of Nearest-Neighbor (NN) query and its variations. Many studies in the past decade utilize R-trees as their underlying index structures to...
Provided by VLDB Endowment
-
White Papers
Tree Indexing on Solid State Drives
October 1, 2009, 12:00am PDT
Large flash disks, or Solid State Drives (SSDs), have become an attractive alternative to magnetic hard disks, due to their high random read performance, low energy consumption and other features....
Provided by VLDB Endowment
-
White Papers
On the Stability of Plan Costs and the Costs of Plan Stability
October 1, 2009, 12:00am PDT
Predicate selectivity estimates are subject to considerable run-time variation relative to their compile-time estimates, often leading to poor plan choices that cause inflated response times. The...
Provided by VLDB Endowment
-
White Papers
Regret Minimizing Representative Databases
October 1, 2009, 12:00am PDT
The authors propose the k-representative regret minimization query (k-regret) as an operation to support multi-criteria decision making. Like top-k, the k-regret query assumes that users have some...
Provided by VLDB Endowment
-
White Papers
Foundations of Uncertain Data Integration
October 1, 2009, 12:00am PDT
Probabilistic databases hold promise of being a viable means for large-scale uncertainty management, increasingly needed in a number of real world applications domains. However, query evaluation...
Provided by VLDB Endowment
-
White Papers
ReadOnce Functions and Query Evaluation in Probabilistic Databases
October 1, 2009, 12:00am PDT
Probabilistic databases hold promise of being a viable means for large-scale uncertainty management, increasingly needed in a number of real world applications domains. However, query evaluation...
Provided by VLDB Endowment
-
White Papers
Querying Probabilistic Information Extraction
October 1, 2009, 12:00am PDT
Recently, there has been increasing interest in extending relational query processing to include data obtained from unstructured sources. A common approach is to use stand-alone Information...
Provided by VLDB Endowment
-
White Papers
Efficient Proximity Detection Among Mobile Users Via Self-Tuning Policies
October 1, 2009, 12:00am PDT
Given a set of users, their friend relationships, and a distance threshold per friend pair, the proximity detection problem is to find each pair of friends such that the Euclidean distance between...
Provided by VLDB Endowment
-
White Papers
Towards the Web of Concepts: Extracting Concepts From Large Datasets
October 1, 2009, 12:00am PDT
Concepts are sequences of words that represent real or imaginary entities or ideas that users are interested in. As a first step towards building a web of concepts that will form the backbone of...
Provided by VLDB Endowment
-
White Papers
Exploiting Content Redundancy for Web Information Extraction
October 1, 2009, 12:00am PDT
The authors propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. They start by populating a seed database with...
Provided by VLDB Endowment
-
Whitepapers
Scalable SPARQL Querying of Large RDF Graphs
September 3, 2011, 12:00am PDT
The generation of RDF data has accelerated to the point where many data sets need to be partitioned across multiple machines in order to achieve reasonable performance when querying the data....
Provided by VLDB Endowment
-
Whitepapers
Proactive Detection and Repair of Data Corruption: Towards a Hasslefree Declarative Approach With Amulet
September 3, 2011, 12:00am PDT
Occasional corruption of stored data is an unfortunate byproduct of the complexity of modern systems. Hardware errors, software bugs, and mistakes by human administrators can corrupt important...
Provided by VLDB Endowment
-
Whitepapers
MapReduce Programming and Cost based Optimization? Crossing This Chasm With Starfish
September 3, 2011, 12:00am PDT
MapReduce has emerged as a viable competitor to database systems in big data analytics. MapReduce programs are being written for a wide variety of application domains including business data...
Provided by VLDB Endowment
-
White Papers
SECRET: A Model for Analysis of the Execution Semantics of Stream Processing Systems
August 11, 2010, 12:00am PDT
There are many academic and commercial Stream Processing Engines (SPEs) today, each of them with its own execution semantics. This variation may lead to seemingly inexplicable differences in query...
Provided by VLDB Endowment
-
White Papers
Summary Graphs for Relational Database Schemas
September 3, 2011, 12:00am PDT
Increasingly complex databases need ever more sophisticated tools to help users understand their schemas and interact with the data. Existing tools fall short of either providing the "Big...
Provided by VLDB Endowment
-
White Papers
Online Data Fusion
September 3, 2011, 12:00am PDT
The Web contains a significant volume of structured data in various domains, but a lot of data are dirty and erroneous, and they can be propagated through copying. While data integration...
Provided by VLDB Endowment
-
White Papers
DivDB: A System for Diversifying Query Results
September 3, 2011, 12:00am PDT
With the availability of very large databases, an exploratory query can easily lead to a vast answer set, typically based on an answer's relevance (i.e., top-k, tf-idf) to the user query....
Provided by VLDB Endowment
-
White Papers
RemusDB: Transparent High Availability for Database Systems
September 3, 2011, 12:00am PDT
In this paper, the authors present a technique for building a High-Availability (HA) DataBase Management System (DBMS). The proposed technique can be applied to any DBMS with little or no...
Provided by VLDB Endowment
-
White Papers
The End of an Architectural Era (It's Time for a Complete Rewrite)
September 28, 2007, 12:00am PDT
In previous papers [SC05, SBC+07], some of one predicted the end of "One size fits all" as a commercial relational DBMS paradigm. These papers presented reasons and experimental evidence that...
Provided by VLDB Endowment
-
White Papers
Efficient Processing of Top-k Spatial Preference Queries
November 8, 2010, 12:00am PST
Top-k spatial preference queries return a ranked set of the k best data objects based on the scores of feature objects in their spatial neighborhood. Despite the wide range of location-based...
Provided by VLDB Endowment
-
White Papers
Monitoring Reverse Top-k Queries Over Mobile Devices
June 12, 2011, 12:00am PDT
Location-based queries are widely employed to retrieve useful information based on the user's geographical position. For example, a tourist that walks around a city may seek points of interest...
Provided by VLDB Endowment
-
White Papers
Efficient Probabilistic Reverse Nearest Neighbor Query Processing on Uncertain Data
July 21, 2011, 12:00am PDT
Given a query object q, a Reverse Nearest Neighbor (RNN) query in a common certain database returns the objects having q as their nearest neighbor. A new challenge for databases is dealing with...
Provided by VLDB Endowment
-
White Papers
PIQL: Success-Tolerant Query Processing in the Cloud
November 30, 2011, 12:00am PST
Newly-released web applications often succumb to a "Success Disaster," where overloaded database machines and resulting high response times destroy a previously good user experience....
Provided by VLDB Endowment
-
White Papers
PAO: Power-Efficient Attribution of Outliers in Wireless Sensor Networks
September 13, 2010, 12:00am PDT
Sensor nodes constitute inexpensive, disposable devices that are often scattered in harsh environments of interest so as to collect and communicate desired measurements of monitored quantities....
Provided by VLDB Endowment
-
White Papers
Efficient B-Tree Based Indexing for Cloud Data Processing
September 17, 2010, 12:00am PDT
There has been an increasing interest in deploying a storage system on Cloud to support applications that require massive scalability and high throughput in storage layer. Examples of such systems...
Provided by VLDB Endowment
-
White Papers
A Framework for Supporting DBMS-Like Indexes in the Cloud
September 3, 2011, 12:00am PDT
To support "Database as a service" (DaaS) in the cloud, the database system is expected to provide similar functionalities as in centralized DBMS such as efficient processing of ad hoc queries....
Provided by VLDB Endowment
-
White Papers
Data Markets in the Cloud: An Opportunity for the Database Community
November 1, 2008, 12:00am PDT
Cloud-computing is transforming many aspects of data management. Most recently, the cloud is seeing the emergence of digital markets for data and associated services. The authors observe that...
Provided by VLDB Endowment
-
White Papers
Graph Indexing of Road Networks for Shortest Path Queries With Label Restrictions
January 15, 2011, 12:00am PST
The current widespread use of location-based services and GPS technologies has revived interest in very fast and scalable shortest path queries. The authors introduce a new shortest path query...
Provided by VLDB Endowment
-
White Papers
Albatross: Lightweight Elasticity in Shared Storage Databases for the Cloud Using Live Data Migration
April 26, 2011, 12:00am PDT
Database systems serving cloud platforms must serve large numbers of applications (or tenants). In addition to managing tenants with small data footprints, different schemas, and variable load...
Provided by VLDB Endowment
-
White Papers
Supporting Realworld Activities in Database Management Systems
August 28, 2009, 12:00am PDT
Databases are integral to many application domains in which the cycle of processing the data is complex and may involve real-world activities that are external to the database, e.g., wet-lab...
Provided by VLDB Endowment
-
White Papers
Dynamic Join Optimization in Multi-Hop Wireless Sensor Networks
September 17, 2010, 12:00am PDT
To enable smart environments and self-tuning data centers, the authors are developing the Aspen system for integrating physical sensor data, as well as stream data coming from machine logical...
Provided by VLDB Endowment
-
White Papers
CoPhy: A Scalable, Portable, and Interactive Index Advisor for Large Workloads
March 9, 2011, 12:00am PST
Index tuning, i.e., selecting the indexes appropriate for a workload, is a crucial problem in database system tuning. In this paper, the authors solve index tuning for large problem instances that...
Provided by VLDB Endowment
-
White Papers
On Dense Pattern Mining in Graph Streams
September 17, 2010, 12:00am PDT
Many massive web and communication network applications create data which can be represented as a massive sequential stream of edges. For example, conversations in a telecommunication network or...
Provided by VLDB Endowment
-
White Papers
Read-Once Functions and Query Evaluation in Probabilistic Databases
September 17, 2010, 12:00am PDT
Probabilistic databases hold promise of being a viable means for large-scale uncertainty management, increasingly needed in a number of real world applications domains. However, query evaluation...
Provided by VLDB Endowment
-
White Papers
Community-Driven Data Grids
August 28, 2008, 12:00am PDT
Beyond already existing huge data volumes, e-science communities face major challenges in managing the anticipated data deluge of forthcoming projects. Community-driven data grids target at...
Provided by VLDB Endowment
-
White Papers
XML-Document-Filtering Automaton
August 28, 2008, 12:00am PDT
In a publish-subscribe system based on filtering of XML documents subscribers specify their interests with profiles expressed in the XPath language. The system processes a stream of XML documents...
Provided by VLDB Endowment
-
White Papers
Querying Web-Based Applications Under Models of Uncertainty
August 28, 2008, 12:00am PDT
Many businesses offer their services to customers via Web-based application interfaces. Reasoning about execution flows of such applications is extremely valuable for companies. Such reasoning...
Provided by VLDB Endowment
-
White Papers
Incompleteness in Information Integration
August 28, 2008, 12:00am PDT
Information integration is becoming a critical problem for both businesses and individuals. The data, especially the one that comes from the Web, is naturally incomplete, that is, some data values...
Provided by VLDB Endowment
-
White Papers
GS-TMS: A Global Stream-Based Threat Monitor System
August 28, 2008, 12:00am PDT
Computer networks have become ubiquitous and integral part of the nation's critical infrastructure. How to grasp the real-time overall situation of the network security is very noteworthy to...
Provided by VLDB Endowment
-
White Papers
Privacy Preserving Document Indexing Infrastructure for a Distributed Environment
August 28, 2008, 12:00am PDT
To carry out work assignments, small groups distributed within a larger enterprise or collaborative community often need to share documents among themselves while shielding those documents from...
Provided by VLDB Endowment
-
White Papers
Adaptive Workflow Scheduling Under Resource Allocation Constraints and Network Dynamics
August 28, 2008, 12:00am PDT
Workflow concepts are well suited for scenarios where many distributed entities work collaboratively together to achieve a common goal. Today, workflows are mostly used as computerized model for...
Provided by VLDB Endowment
-
White Papers
Implementing Filesystems by Tree-Aware DBMSs
August 28, 2008, 12:00am PDT
With the rise of XML, the database community has been challenged by semi-structured data processing. Since the data type behind XML is the tree, state-of-the-art RDBMSs have learned to deal with...
Provided by VLDB Endowment
-
White Papers
Towards Efficient Main-Memory Use For Optimum Tree Index Update
August 28, 2008, 12:00am PDT
An emerging class of database applications is characterized by frequent updates of low-dimensional data, e.g. coming from sensors that sample continuous real world phenomena. Traditional...
Provided by VLDB Endowment
-
White Papers
Mining Patterns and Rules for Software Specification Discovery
August 28, 2008, 12:00am PDT
Software specifications are often lacking, incomplete and outdated in the industry. Lack and incomplete specifications cause various software engineering problems. Studies have shown that program...
Provided by VLDB Endowment
-
White Papers
Studying Interaction Methodologies in Video Retrieval
August 28, 2008, 12:00am PDT
So far, several approaches have been studied to bridge the problem of the Semantic Gap, the bottleneck in image and video retrieval. However, no approach is successful enough to increase retrieval...
Provided by VLDB Endowment
-
White Papers
Challenges and Techniques for Effective and Efficient Similarity Search in Large Video Databases
August 28, 2008, 12:00am PDT
Searching relevant visual information based on content features in large databases is an interesting and changeling topic that has drawn lots of attention from both the research community and...
Provided by VLDB Endowment
-
White Papers
Dwarfs in the Rearview Mirror: How Big Are They Really?
August 28, 2008, 12:00am PDT
OnLine-Analytical Processing (OLAP) has been a field of competing technologies for the past ten years. One of the still unsolved challenges of OLAP is how to provide quick response times on any...
Provided by VLDB Endowment
-
White Papers
A Benchmark for Evaluating Moving Object Indexes
August 28, 2008, 12:00am PDT
Progress in science and engineering relies on the ability to measure, reliably and in detail, pertinent properties of artifacts under design. Progress in the area of database-index design thus...
Provided by VLDB Endowment
-
White Papers
Prefix Based Numbering Schemes for XML: Techniques, Applications and Performances
August 28, 2008, 12:00am PDT
Commonly used in network routing, programming, classification and knowledge representation systems, labeling schemes have also interested the XML community. The authors thus motivate and describe...
Provided by VLDB Endowment
-
White Papers
ColumnStore Support for RDF Data Management: Not All Swans Are White
August 28, 2008, 12:00am PDT
This paper reports on the results of an independent evaluation of the techniques presented in the VLDB 2007 paper "Scalable Semantic Web Data Management Using Vertical Partitioning", authored by...
Provided by VLDB Endowment
Keep Up with TechRepublic
Submit a Paper
Get your content listed in our directory!
Our directory is the largest library of vendor-supplied technical content on the Web. It’s also the first place IT decision makers turn to when researching technology solutions. Our members are already finding your competitors’ papers here - shouldn’t they find yours, too? It's FREE so click here and submit your white paper, case study, data sheet, research report, or other document today!



