Worcester Polytechnic Institute Electronic Theses and Dissertations Collection

Title page for ETD etd-050409-131957


Document Typethesis
Author NameWang, Di
Email Address wangdi
URNetd-050409-131957
TitleQuery Optimization for Database Federation Systems
DegreeMS
DepartmentComputer Science
Advisors
  • Murali Mani, Advisor
  • Elke A. Rundensteiner , Reader
  • Keywords
  • database federation
  • query optimization
  • Date of Presentation/Defense2009-04-30
    Availability unrestricted

    Abstract

    Database federation is one approach to data integration, in which a middleware, called mediator, provides uniform access to a number of heterogeneous data sources. In this thesis, we focus on the query optimization for distributed joins over database federation. One important observation in query optimization over distributed database system is that run-time conditions (namely available buffer size, CPU utilization in machine and network environment) can significantly affect the execution cost of a query plan. However, in existing database federation systems, very few studies have addressed run-time conditions. It is a challenging problem, because usually the mediator is not able to know the run-time conditions of remote sites and considering run-time conditions will bring about extra complexity to the optimizer.

    This thesis proposes the Cluster-and-Conquer algorithm for query optimization over database federation while efficiently considering run-time conditions. This algorithm has three-fold benefits. Firstly, the run-time conditions of machines are now available for cluster mediator. Secondly, each cluster mediator can deal with its own sub query concurrently, so the complexity of processing query plan is decreased. Thirdly, the algorithm outperforms other related approaches in terms of “cost of costing”, because it removes unnecessary inter-cluster operations in the early stage.

    I have implemented a prototype data federation system with Cluster-and-Conquer algorithm. The experimental results showed the capabilities and efficiency of our algorithm and described the target scenarios where the algorithm performs better than other related approaches.

    Files
  • dwang.pdf

  • Browse by Author | Browse by Department | Search all available ETDs

    [WPI] [Library] [Home] [Top]

    Questions? Email etd-questions@wpi.edu
    Maintained by webmaster@wpi.edu