Indiana University

Displaying 1-40 of 160 results

  • White Papers // Jun 2014

    SPARQL Query Optimization for Structural Indexed RDF Data

    Resource description framework, RDF, is a standard language model for representing semantic data. As the concept of semantic Web becomes more viable, the ability to retrieve and exchange semantic data will become increasingly more important. Efficient management of RDF data is one of the key research issues in semantic Web;...

    Provided By Indiana University

  • White Papers // Jun 2014

    The FutureGrid Testbed for Big Data

    In this paper the authors will be introducing FutureGrid, which provides a testbed to conduct research for cloud, grid, and high performance computing. Although FutureGrid has only a modest number of compute cores (about 4500 regular cores and 14000 GPU cores) it provides an ideal playground to test out various...

    Provided By Indiana University

  • White Papers // Mar 2014

    Towards Understanding Cloud Usage through Resource Allocation Analysis on XSEDE

    In shared resource environments, usage data is necessary to identify utilization of the infrastructure by users. Many cloud platforms recently started to collect measurements for use of resources that can be applied to billing and monitoring. Understanding utilization and performance through these measurements is crucial in the infrastructure in order...

    Provided By Indiana University

  • White Papers // Mar 2014

    Evaluation of Java Message Passing in High Performance Data Analytics

    In the last few years, Java gain popularity in processing \"Big data\" mostly with Apache big data stack - a collection of open source frameworks dealing with abundant data, which includes several popular systems such as Hadoop, Hadoop Distributed File System (HDFS), and spark. Efforts have been made to introduce...

    Provided By Indiana University

  • White Papers // Mar 2014

    Converting Data to Task-Parallelism by Rewrites: Purely Functional Programs Across Multiple GPUs and CPUs

    High-level domain-specific-languages for array processing on the GPU are increasingly common, but to date they run only on a single GPU. The authors argue that languages will need to target multiple devices, even simultaneous combinations of GPU/GPU and CPU/GPU. Increased flexibility may be key to making these languages more easily...

    Provided By Indiana University

  • White Papers // Feb 2014

    Supporting Queries and Analyses of Large-Scale Social Media Data with Customizable and Scalable Indexing Techniques over NoSQL Databases

    Social media data analysis demonstrates two special characteristics in big data processing. First, most analyses focus on data subsets related to specific social events or activities, instead of the whole data set. Second, analysis workflows consist of multiple stages, and algorithms applied in each stage may use different computation and...

    Provided By Indiana University

  • White Papers // Feb 2014

    Advanced Virtualization Techniques for High Performance Cloud Cyberinfrastructure

    With the advent of virtualization and Infrastructure-as-a-Service (IaaS), the broader scientific computing community is considering the use of clouds for their scientific computing needs. This is due to the relative scalability, ease of use, advanced user environment customization abilities, and the many novel computing paradigms available for data-intensive applications. However,...

    Provided By Indiana University

  • White Papers // Jan 2014

    Towards a Collective Layer in the Big Data Stack

    During the last decade three largely industry-driven disruptive trends have altered the landscape of scalable parallel computing, which has long been dominated by the HPC applications. These disruptions are the emergence of data intensive computing (aka big data), the new emergence of commodity cluster-based execution & storage frameworks such as...

    Provided By Indiana University

  • White Papers // Oct 2013

    Parallel Deterministic Annealing Clustering and its Application to LC-MS Data Analysis

    The authors present a scalable parallel deterministic annealing formalism for clustering with cutoffs and position dependent variances. They apply it to the \"Peak matching\" problem of the precise identification of the common LC-MS peaks across a cohort of multiple biological samples in proteomic biomarker discovery. They find reliably and automatically...

    Provided By Indiana University

  • White Papers // Sep 2013

    Optimizing OpenCL Kernels for Iterative Statistical Applications on GPUs

    The authors present a study of three important kernels that occur frequently in iterative statistical applications: k-means, Multi-Dimensional Scaling (MDS), and PageRank. They implemented each kernel using OpenCL and evaluated their performance on an NVIDIA Tesla GPGPU card. By examining the underlying algorithms and empirically measuring the performance of various...

    Provided By Indiana University

  • White Papers // Jul 2013

    An Overview of Present NoSQL Solutions and Features

    NoSQL database is an emerging research topic as the amount of stored digital information is dramatically growing each minute. In the authors' current era of extreme data scales, NoSQL meets the requirements of the large-scale distributed computing environment, which provides scalability, high availability, high performance and reliability. NoSQL solutions share...

    Provided By Indiana University

  • White Papers // Jun 2013

    Bootstrapping Trust in Online Dating: Social Verification of Online Dating Profiles

    Online dating is an increasingly thriving business which boasts billion dollar revenues and attracts users in the tens of millions. Notwithstanding its popularity, online dating is not impervious to worrisome trust and privacy concerns raised by the disclosure of potentially sensitive data as well as the exposure to self-reported (and...

    Provided By Indiana University

  • White Papers // May 2013

    A Robust and Scalable Solution for Interpolative Multidimensional Scaling With Weighting

    Advances in modern bio-sequencing techniques have led to a proliferation of raw genomic data that enables an unprecedented opportunity for data mining. To analyze such large volume and high-dimensional scientific data, many high performance dimension reduction and clustering algorithms have been developed. Among the known algorithms, the authors use Multi-Dimensional...

    Provided By Indiana University

  • White Papers // Apr 2013

    Mammoth Data in the Cloud: Clustering Social Images

    Social image datasets have grown to dramatic size with images classified in vector spaces with high dimension (512-2048) and with potentially billions of images and corresponding classification vectors. The authors study the challenging problem of clustering such sets into millions of clusters using iterative MapReduce. They introduce a new K-means...

    Provided By Indiana University

  • White Papers // Apr 2013

    Co-processing SPMD Computation on GPUs and CPUs on Shared Memory System

    Heterogeneous parallel system with multi processors and accelerators are becoming ubiquitous due to better cost-performance and energy-efficiency. These heterogeneous processor architectures have different instruction sets and are optimized for either task-latency or throughput purposes. Challenges occur in regard to programmability and performance when executing SPMD computations on heterogeneous architectures simultaneously....

    Provided By Indiana University

  • White Papers // Mar 2013

    Android Provenance: Diagnosing Device Disorders

    Mobile devices are a ubiquitous part of the people daily lives. Smartphones are being used in many areas where data privacy and integrity are a concern. One threat to integrity and privacy is the existence of bugs in operating system code. Little has been done to provide tools for system-wide...

    Provided By Indiana University

  • White Papers // Nov 2012

    Survey on High Productivity Computing Systems (HPCS) Languages

    Parallel languages have been focused towards performance, but it alone is not be sufficient to overcome the barrier of developing software that exploits the power of evolving architectures. DARPA initiated High Productivity Computing Systems (HPCS) languages project as a solution which addresses software productivity goals through language design. The resultant...

    Provided By Indiana University

  • White Papers // Nov 2012

    FRIEDA: Flexible Robust Intelligent Elastic Data Management in Cloud Environments

    Scientific applications are increasingly using cloud resources for their data analysis workflows. However, managing data effectively and efficiently over these cloud resources is challenging due to the myriad storage choices with different performance, cost trade-offs, complex application choices and complexity associated with elasticity, failure rates in these environments. The explosion...

    Provided By Indiana University

  • White Papers // Sep 2012

    PlaceRaider: Virtual Theft in Physical Spaces with Smartphones

    As Smartphone become more pervasive, they are increasingly targeted by malware. At the same time, each new generation of Smartphone features increasingly powerful onboard sensor suites. A new strain of 'Sensor malware' has been developing that leverages these sensors to steal information from the physical environment - e.g., researchers have...

    Provided By Indiana University

  • White Papers // Aug 2012

    Performance Model for Parallel Matrix Multiplication with Dryad: Dataflow Graph Runtime

    In order to meet the big data challenge of today's society, several parallel execution models on distributed memory architectures have been proposed: MapReduce, Iterative MapReduce, graph processing, and dataflow graph processing. Dryad is a distributed data-parallel execution engine that model program as dataflow graphs. In this paper, the authors evaluated...

    Provided By Indiana University

  • White Papers // Aug 2012

    The Design and Implementation of a Multi-Level Content-Addressable Checkpoint File System

    Long-running HPC applications guard against node failures by writing checkpoints to parallel file systems. Writing these checkpoints with petascale class machines has proven difficult and the increased concurrency demands of exascale computing will exacerbate this problem. To meet check-pointing demands and sustain application-perceived throughput at exascale, multi-tiered hierarchical storage architectures...

    Provided By Indiana University

  • White Papers // Jul 2012

    Bridging the Gap Between HPC and IaaS

    With the advent of virtualization and Infrastructure-as-a-Service (IaaS), the broader scientific computing community is considering the use of clouds for their technical computing needs. This is due to the relative scalability, ease of use, advanced user environment customization abilities, and the many novel computing paradigms available for data-intensive applications. However,...

    Provided By Indiana University

  • White Papers // Jul 2012

    Design of a Dynamic Provisioning System for a Federated Cloud and Bare-Metal Environment

    The authors present the design of a dynamic provisioning system that is able to manage the resources of a federated cloud environment by focusing on their utilization. With their framework, it is not only possible to allocate resources at a particular time to a specific Infrastructure as a service framework,...

    Provided By Indiana University

  • White Papers // Jun 2012

    My Privacy Policy: Exploring End-User Specification of Free-Form Location Access Rules

    The increasing inclusion of location and other contextual information in social media applications requires users to be more aware of what their location disclosures reveal. As such, it is important to consider whether existing access-control mechanisms for managing location sharing meet the needs of today's users. The authors report on...

    Provided By Indiana University

  • White Papers // Jun 2012

    Scalable Parallel Computing on Clouds Using Twister4Azure Iterative MapReduce

    Recent advances in data intensive computing for science discovery are fueling a dramatic growth in the use of data-intensive iterative computations. The utility computing model introduced by cloud computing, combined with the rich set of cloud infrastructure and storage services, offers a very attractive environment in which scientists can perform...

    Provided By Indiana University

  • White Papers // Jun 2012

    Large Scale Classification Based on Combination of Parallel SVM and Interpolative MDS

    With the development of information technology, the scale of electronic data becomes larger and larger. Data deluge occurs in many kinds of application fields. How to explore the useful information from the large scale dataset is a very important issue. Data mining is just to take on the task. Support...

    Provided By Indiana University

  • White Papers // Jun 2012

    A Parallel Clustering Method Study Based on MapReduce

    Clustering is considered as the most important task in data mining. The goal of clustering is to determine the intrinsic grouping in a set of unlabeled data. Many practical application problems should be solved with clustering method. It has been widely applied into all kinds of areas, such marketing, biology,...

    Provided By Indiana University

  • White Papers // Jun 2012

    Study on Parallel SVM Based on MapReduce

    Support Vector Machines (SVM) is powerful classification and regression tools. They have been widely studied by many scholars and applied in many kinds of practical fields. But their compute and storage requirements increase rapidly with the number of training vectors, putting many problems of practical interest out of their reach....

    Provided By Indiana University

  • White Papers // May 2012

    Improving Resource Utilization in MapReduce

    MapReduce has been adopted widely in both academia and industry to run large-scale data parallel applications. In MapReduce, each slave node hosts a number of task slots to which tasks can be assigned. So they limit the maximum number of tasks that can execute concurrently on each node. When all...

    Provided By Indiana University

  • White Papers // May 2012

    Hiding in Plain Sight: Exploiting Broadcast for Practical Host Anonymity

    Users are being tracked on the Internet more than ever before as Web sites and search engines gather pieces of information sufficient to identify and study their behavior. While many existing schemes provide strong anonymity, they are inappropriate when high bandwidth and low latency are required. In this paper, the...

    Provided By Indiana University

  • White Papers // May 2012

    Sensitive Data Requests: Do Sites Ask Correctly?

    The Web offers unprecedented opportunities for ecommerce. The security of such transactions is commonly provided through the use of the Transport Layer Security (TLS) protocol, the standards track successor of the Secure Sockets Layer (SSL) protocol. TLS allows clients to verify the authenticity of the servers they access and ensures...

    Provided By Indiana University

  • White Papers // May 2012

    Exploitable Redirects on the Web: Identification, Prevalence, and Defense

    Web sites on the Internet often use redirection. Unfortunately, without additional security, many of the redirection links can be manipulated and abused to mask phishing attacks. In this paper, the authors prescribe a set of heuristics to identify redirects that can be exploited. Using these heuristics, they examine the prevalence...

    Provided By Indiana University

  • White Papers // May 2012

    Trust Management Framework for Social Networks

    Inspired by the similarities between human trust and physical measurements, the authors propose a new system of trust metrics, composed by impression and confidence, which captures both human trust level and its uncertainty, while being intuitive and user friendly. Furthermore, based on measurement error propagation theory, they propose an evaluation...

    Provided By Indiana University

  • White Papers // May 2012

    Abstract Image Management and Universal Image Registration for Cloud and HPC Infrastructures

    Cloud computing has become an important driver for delivering Infrastructure as a Service (IaaS) to users with on-demand requests for customized environments and sophisticated software stacks. Within the FutureGrid (FG) project, the authors offer different IaaS frameworks as well as high performance computing infrastructures by allowing users to explore them...

    Provided By Indiana University

  • White Papers // May 2012

    Comparison of Multiple Cloud Frameworks

    Today, many cloud Infrastructure as a Service (IaaS) frameworks exist. Users, developers, and administrators have to make a decision about which environment is best suited for them. Unfortunately, the comparison of such frameworks is difficult because either users do not have access to all of them or they are comparing...

    Provided By Indiana University

  • White Papers // May 2012

    Accelerating Data Transfers In Iterative MapReduce Framework

    MapReduce has become popular in recent years due to its attractive programming interface with scalability and reliability in processing big data problems. Recently several iterative MapReduce frameworks including their Twister system have emerged to improve the performance on many important data mining applications. Utilizing local memory on each compute node...

    Provided By Indiana University

  • White Papers // Mar 2012

    Seed and Grow: An Attack Against Anonymized Social Networks

    Digital traces left by a user of an online social networking service can be abused by a malicious party to compromise the person's privacy. This is exacerbated by the increasing overlap in user-bases among various services. In this paper, the authors propose an algorithm, Seed and Grow, to identify users...

    Provided By Indiana University

  • White Papers // Mar 2012

    Qualitative Comparison of Multiple Cloud Frameworks

    Many cloud infrastructure as a service frameworks exist and users, developers and administrators have to make a decision, which environment is best suited for them. Unfortunately, the comparison of such frameworks is difficult as users may not have access to all of them, or are comparing the performance of such...

    Provided By Indiana University

  • White Papers // Mar 2012

    Towards Cloud Deployments Using FutureGrid

    In this paper, the authors briefly outline some differences between IaaS frameworks Eucalyptus, OpenNebula, OpenStack and Nimbus. They provide also an overview how platforms such as Amazon, Azure, and Google provide additional services to provide more convenient platforms for its users. They then present an overview of what FutureGrid currently...

    Provided By Indiana University

  • White Papers // Mar 2012

    Synchronization Level Specification and Matching of Software Components

    Generating distributed systems from independently developed and deployed components is a promising alternative for today's dynamic and interconnected world. If such components have to collaborate with each other effectively, they must indicate their contracts explicitly. Traditionally, only the syntactical interfaces are depicted for such components. Recent attempts have argued for...

    Provided By Indiana University

  • White Papers // Jan 2005

    Creating An Effective Resume

    Pick up your resume. Where are your thumbs? They are probably at about the midpoint of the page. Most employers spend 15 - 30 seconds when first examining resumes, and only seeing the top half of your resume. Therefore, everything above your thumbs should be the most important information that...

    Provided By Indiana University

  • White Papers // Feb 2010

    High Performance Dimension Reduction and Visualization for Large High-Dimensional Data Analysis

    Large high dimension datasets are of growing importance in many fields and it is important to be able to visualize them for understanding the results of data mining approaches or just for browsing them in a way that distance between points in visualization (2D or 3D) space tracks that in...

    Provided By Indiana University

  • White Papers // Jan 2010

    Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data

    One of the major complaints about Semantic Web technologies is that there are few user-friendly ways for displaying, browsing and querying semantic data. Lacking of effective interfaces for end users can significantly hinder further deployment of the Semantic Web. This paper proposes the Semantic Web Portal (SWP) as a light-weight...

    Provided By Indiana University

  • White Papers // Jun 2009

    High Performance Parallel Computing With Clouds and Cloud Technologies

    Infrastructure services (Infrastructure-as-a-service), provided by cloud vendors, allow any user to provision a large number of compute instances fairly easily. Whether leased from public clouds or allocated from private clouds, utilizing these virtual resources to perform data/compute intensive analyses requires employing different parallel runtimes to implement such applications. Among many...

    Provided By Indiana University

  • White Papers // Aug 2009

    Granules: A Lightweight, Streaming Runtime for Cloud Computing With Support for Map-Reduce

    Cloud computing has gained significant traction in recent years. The Map-Reduce framework is currently the most dominant programming model in cloud computing settings. This paper describes Granules, a lightweight, streaming-based runtime for cloud computing which incorporates support for the Map-Reduce framework. Granules provides rich lifecycle support for developing scientific applications...

    Provided By Indiana University

  • White Papers // Aug 2009

    Supporting Cloud Computing With the Virtual Block Store System

    The fast development of cloud computing systems stimulate the needs for a standalone block storage system, which can provide persistent block storage services to the virtual machines maintained by the clouds. This paper presents the Virtual Block Store (VBS) System, a standalone block storage system built on the basis of...

    Provided By Indiana University

  • White Papers // Oct 2009

    Multicores in Cloud Computing: Research Challenges for Applications

    With modern techniques that allow billions of transistors on a chip, microprocessor design is going to a world of multicore. A cluster of multicores will be commonly used as an efficient computational platform for high performance computing in the near future. Correspondingly, the resource providers, who share their computing elements...

    Provided By Indiana University

  • White Papers // Sep 2009

    Control Motivations And Capital Structure Decisions

    This paper investigates the use of leverage as one channel through which control-motivated blockholders can defend their corporate control. Such blockholders face a trade-off between raising external finance and losing their control over the firm. Debt offers a solution while equity does not because the former does not dilute the...

    Provided By Indiana University

  • White Papers // Jan 2010

    Privacy-Preserving History Mining for Web Browsers

    The authors introduce a new technique that permits servers to harvest selected Internet browsing history from visiting clients. Privacy-Preserving History Mining (PPHM) requires no installation of special-purpose client-side executables. Paradoxically, it exploits a feature in most browsers (IE, Firefox and Safari) regarded for years as a privacy vulnerability. PPHM enables...

    Provided By Indiana University

  • White Papers // Jan 2010

    Querying XML Data: Does One Query Language Fit All?

    This paper describes the characteristics of two different query languages designed to query XML data: DSQL, a declarative SQL like language and XQuery, a procedural language that is fast becoming the defacto language for XML querying. This paper then describes the design of an experiment aimed at comparing the accuracy...

    Provided By Indiana University

  • White Papers // Jul 2010

    Horizontal Mergers Of Online Firms: Structural Estimation And Competitive Effects

    This paper presents a general model of online price competition, shows how to structurally estimate the underlying parameters of the model when the number of competing firms is unknown or in dispute, estimates these parameters based on UK data for personal digital assistants, and uses these estimates to simulate the...

    Provided By Indiana University

  • White Papers // Aug 2010

    Personalized Modeling for SaaS Based on Extended WSCL

    Software as a Service (SaaS) is an emerging software framework in which business data and logic typically integrate with other applications. It requires a unified subscriber to describe SaaS to make for easy integration; however, SaaS provides services to different tenants by running only one instance. In order to satisfy...

    Provided By Indiana University

  • White Papers // Dec 2010

    Trade Costs And Economic Development

    The authors test the hypothesis of the circular causality between trade costs and degree of economic development using data on Italian provinces. Using different methods to control for multilateral resistance, they apply a gravity equation to estimate sectoral exports to 188 countries over the period 1995-2004. Provincial trade costs are...

    Provided By Indiana University

  • White Papers // Aug 2010

    The Gravity Equation In International Economics And International Business Research: A Note

    This paper discusses methodological issues and practical concerns for international economists and international business scholars who apply the gravity equation in their research. The most important message of the note is that this equation should correct for multilateral resistance factors. The authors propose a relatively low-cost specification and estimation to...

    Provided By Indiana University

  • White Papers // Jan 2010

    Environmental Inspection Proclivity And State Manufacturing Growth: The US Experience From The 1990s

    In this paper, the authors construct a ranking of states based on their proclivity to inspect facilities for environmental compliance. The measure utilizes state-level inspections data supplied by the US Environmental Protection Agency. After developing the ranking, they use it to predict state-level growth in manufacturing establishments. In doing so,...

    Provided By Indiana University

  • White Papers // Jan 2011

    Do Minimum Quality Standards Improve Quality Of Care? A Case Study Of The Nursing Home Industry

    This paper estimates the impact of minimum staffing requirements on the nursing home market using a unique national panel over the 1996-2005 period. This paper reveals that, given a half-hour increase in the minimum nursing hours per resident day for licensed nurses, quality of patient care increases by 25 percent....

    Provided By Indiana University

  • White Papers // Jun 2009

    International Monies, Special Drawing Rights, And Supernational Money

    The current International Monetary System (IMS) is fragile because the dollar standard is rapidly deteriorating. The dual role the dollar as the dominant international money and national money cannot be easily reconciled because the US monetary authorities face a conflict between pursuing domestic objectives of employment and inflation and maintaining...

    Provided By Indiana University

  • White Papers // Sep 2009

    Implications Of Grade Inflation: Knowledge Illusion And Economic Inefficiency In The Knowledge Market

    In this paper, the authors adopt the neoclassical model of consumer choice and view students as a utility maximizer to investigate two implied issues caused by grade inflation - knowledge illusion and economic inefficiency in the knowledge market. These issues are important because they negatively impact the quality of higher...

    Provided By Indiana University

  • White Papers // Apr 2010

    Teaching Collaborative Leadership: Ideas And Lessons For The Field

    This paper describes and analyzes a new approach to teaching collaborative leadership to masters of public administration students at the Maxwell School of Syracuse University. The three year old course teaches students how to design a collaborative network with the necessary players at the table; structure governance for a collaborative...

    Provided By Indiana University

  • White Papers // Jan 2011

    Schema-Independent and Schema-Friendly Scientific Metadata Management

    Computational science is creating a deluge of data, and being able to reuse this data requires detailed descriptive metadata. Scientific communities have developed detailed metadata schemas to describe data products, but this metadata must be captured as workflows execute. The authors' research has identified characteristics of scientific schemas that can...

    Provided By Indiana University

  • White Papers // Apr 2011

    Sigiri: Uniform Abstraction for Large-Scale Compute Resource Interactions

    Scientists who conduct mid-range computationally heavy modeling and analysis often scramble to find sufficient computational resources to test and run their codes. The science they carry out is not petascale or even terascale science but the computational needs often go beyond what can be satisfied by their university. With the...

    Provided By Indiana University

  • White Papers // Sep 2009

    Inducing Relatedness Graphs for Data Integration

    In this paper, the authors present the AbsMatcher system for schema matching which uses a graph based approach. AbsMatcher creates a graph of related attributes within a schema, mines similarity between attributes in different schemas, and then combines all information using the ABSURDIST graph matching algorithm. The focus of this...

    Provided By Indiana University

  • White Papers // Aug 2009

    Sigiri: Towards A Light-Weight Job Management System for Large Scale Systems

    e-Science applications are often compute and data intensive, requiring large-scale compute systems for execution. Large-scale systems, however, support a variety of resource management interfaces that an end user must adapt to for compute job submission and management. Grid middleware solutions abstract these heterogeneous resource managers and offer a single unified...

    Provided By Indiana University

  • White Papers // Jul 2009

    Modularizing Data Mining: A Case Study Framework

    This paper presents the fundamental concepts underpinning MoLS, a framework for exploring and applying many variations of algorithms for one datamining problem: mining a database relation for Approximate Functional Dependencies (AFDs). An engineering approach to AFD mining suggests a framework which can be customized with plug-ins, yielding targetability and improved...

    Provided By Indiana University

  • White Papers // Apr 2011

    A Study of RDB-Based RDF Data Management Techniques

    RDF has gained great interest in both academia and industry as an important language to describe graph data. Several approaches have been proposed for storing and querying RDF data efficiently; each work best under certain circumstances, e.g. certain types of data and/or queries. However, there was lack of a thorough...

    Provided By Indiana University

  • White Papers // Jan 2010

    Electronic Data Interchange and Enterprise Resource Planning Technology in Supply-Chain Contracts

    The authors examine a model of Information-Sharing contracting with a purchaser that desires to acquire as much of a product as possible at as low a price as possible. The supplier in their model has private information about its limited capacity. They compare two models of information. In the first,...

    Provided By Indiana University

  • White Papers // Oct 2010

    Pythia: A Privacy Aware, Peer-to-Peer Network for Social Search

    Social search systems such as Aardvark and Facebook Questions allow users to pose questions to their social network in real time. Upon receiving a question on a particular topic, Aardvark for example forwards the question to available "Experts" close to the querier in the social network to facilitate immediate, relevant...

    Provided By Indiana University

  • White Papers // Sep 2009

    A Unified Approach to Intra-Domain Security

    While a variety of mechanisms have been developed for securing individual intra-domain protocols, none address the issue in a holistic manner. The authors develop a unified framework to secure prominent networking protocols within a single domain. They begin with a secure version of the DHCP protocol, which has the additional...

    Provided By Indiana University

  • White Papers // Jan 2011

    Malicious Hubs: Detecting Abnormally Malicious Autonomous Systems

    While many attacks are distributed across botnets, investigators and network operators have recently targeted malicious networks through high profile Autonomous System (AS) de-peerings and network shut-downs. In this paper, the authors explore whether some ASes indeed are safe havens for malicious activity. They look for ISPs and ASes that exhibit...

    Provided By Indiana University

  • White Papers // Aug 2010

    An Internet Without the Internet Protocol

    The growth of the Internet has brought about many challenges for its critical infrastructure. The DNS infrastructure, which translates mnemonic host names into IP addresses understood by the routers, is frequently the target of cache poisoning attacks. Internet routers are also experiencing alarming growth in their routing table sizes, which...

    Provided By Indiana University

  • White Papers // Feb 2011

    Prevalence and Mitigation of Forum Spamming

    Forums on the Web are increasingly spammed by miscreants in order to attract visitors to their (often malicious) websites. In this paper, the authors study the prevalence of forum spamming and find that Internet users are at a high risk of encountering forums with spam links posted on them. To...

    Provided By Indiana University

  • White Papers // Mar 2011

    Measured Characteristics of FutureGrid Clouds for Scalable Collaborative Sensor-Centric Grid Applications

    The emergence of cloud technology has raised a renewed emphasis on the issue of scalable on-demand computing. Cloud back-end support of small devices such as sensors and mobile phones is one important application. The authors report the preliminary study of measured characteristics of distributed cloud computing infrastructure for collaboration of...

    Provided By Indiana University

  • White Papers // Mar 2011

    Toward Securing Sensor Clouds

    The authors aim to secure smart sensor networks, where computationally powerful sensing devices such as smartphones or cognitive radios interact with the cloud. In previous work, they have proposed a large-scale brokering framework, and they are researching several facets of securing sensors in the context of this framework. In this...

    Provided By Indiana University

  • White Papers // Feb 2011

    A Hierarchical Framework for Cross-Domain MapReduce Execution

    The MapReduce programming model provides an easy way to execute pleasantly parallel applications. Many data-intensive life science applications fit this programming model and benefit from the scalability that can be delivered using this model. One such application is AutoDock, which consists of a suite of automated tools for predicting the...

    Provided By Indiana University

  • White Papers // May 2011

    FutureGrid Image Repository: A Generic Catalog and Storage System for Heterogeneous Virtual Machine Images

    FutureGrid (FG) is an experimental, high-performance testbed that supports HPC, cloud and grid computing experiments for both application and computer scientist. FutureGrid includes the use of virtualization technology to allow the support of a wide range of operating systems in order to include a testbed for various cloud computing infrastructure...

    Provided By Indiana University

  • White Papers // Mar 2010

    Analysis of Virtualization Technologies for High Performance Computing Environments

    As Cloud computing emerges as a dominant paradigm in distributed systems, it is important to fully understand the underlying technologies that make clouds possible. One technology, and perhaps the most important, is virtualization. Recently virtualization, through the use of hypervisors, has become widely used and well understood by many. However,...

    Provided By Indiana University

  • White Papers // Dec 2009

    OASIS: A Service-Oriented Architecture for Dynamic Instrumentation of Enterprise Distributed Real-Time and Embedded Systems

    Performance analysis tools for enterprise Distributed Real-time and Embedded (DRE) systems require instrumenting heterogeneous sources (such as application- and system-level hardware and software resources). Traditional techniques for software instrumentation of such systems, however, are tightly coupled to system design and metrics of interest. It is therefore hard for system testers...

    Provided By Indiana University

  • White Papers // Nov 2010

    A Survery of Open-Source Cloud Infrastructure Using FutureGrid Testbed

    Cloud infrastructure management is now an emerging problem in cloud computing. Open source projects such as Nimbus, Eucalyptus and OpenNebula provide the great opportunity for industries, researchers and scientists to understand the modern cloud infrastructure model. Here, the authors' project aims to survey among these three open source cloud infrastructure...

    Provided By Indiana University

  • White Papers // Aug 2011

    Portable Parallel Programming on Cloud and HPC: Scientific Applications of Twister4Azure

    Recent advancements in data-intensive computing for science discovery are fueling a dramatic growth in use of data-intensive iterative computations. The utility computing model introduced by cloud computing combined with the rich set of cloud infrastructure services offers a very attractive environment for scientists to perform such data intensive computations. The...

    Provided By Indiana University

  • White Papers // Sep 2009

    Rescuing Banks From The Effects Of The Financial Crisis

    This paper examines government policies aimed at rescuing banks from the effects of the great financial crisis of 2007-2009. To delimit the scope of the analysis, the authors concentrate on the fiscal side of interventions and ignore, by design, the monetary policy reaction to the crisis. The policy response to...

    Provided By Indiana University