Preview

Researc Paper

Powerful Essays
Open Document
Open Document
38991 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Researc Paper
Efficient Workload and Resource Management in Datacenters

by

Hong Xu

A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy
Graduate Department of Electrical and Computer Engineering
University of Toronto

c Copyright 2013 by Hong Xu

Abstract
E cient Workload and Resource Management in Datacenters
Hong Xu
Doctor of Philosophy
Graduate Department of Electrical and Computer Engineering
University of Toronto
2013
This dissertation focuses on developing algorithms and systems to improve the e ciency of operating mega datacenters with hundreds of thousands of servers. In particular, it seeks to address two challenges: First, how to distribute the workload among the set of datacenters geographically deployed across the wide area? Second, how to manage the server resources of datacenters using virtualization technology?
In the first part, we consider the workload management problem in geo-distributed datacenters. We first present a novel distributed workload management algorithm that jointly considers request mapping, which determines how to direct user requests to an appropriate datacenter for processing, and response routing, which decides how to select a path among the set of ISP links of a datacenter to route the response packets back to a user. In the next chapter, we study some key aspects of cost and workload in geodistributed datacenters that have not been fully understood before. Through extensive empirical studies of climate data and cooling systems, we make a case for temperature aware workload management, where the geographical diversity of temperature and its impact on cooling energy e ciency can be used to reduce the overall cooling energy.
Moreover, we advocate for holistic workload management for both interactive and batch jobs, where the delay-tolerant elastic nature of batch jobs can be exploited to further reduce the energy cost. A consistent 15% to 20% cooling energy



Bibliography: NSDI, 2010. placing jobs at cooling-e cient locations in the data center,” in Proc. USENIX ATC, 2007. [11] D. P. Bertsekas and J. N. Tsitsiklis, Parallel and Distributed Computation: Numerical Methods. Athena Scientific, 1997. [14] S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge University Press, 2004. and services research,” in Proc. USENIX HotCloud, 2009. [18] Z. Cao, Z. Wang, and E. Zegura, “Performance of hashing-based schemes for Internet load balancing,” in Proc. IEEE INFOCOM, 2000. Inte. Netw. Manag. (IM), 2009. Manag. (IM), 2011. [22] Y. Chen, A. Das, W. Qin, A. Sivasubramaniam, Q. Wang, and N. Gautam, “Managing server energy and operational costs in hosting centers,” in Proc. ACM Sigmetrics, 2005. [30] J. Dean, “Underneath the covers at google: Current systems and future directions,” In Google I/O, 2008. [31] J. Dean and S. Ghemawat, “MapReduce: Simplified data processing on large clusters,” in Proc. USENIX OSDI, 2004. Dropbox: Understanding personal cloud storage services,” in Proc. ACM IMC, 2012. precision cooling system sales brochure,” http://tinyurl.com/c7e8qxz, 2012.

You May Also Find These Documents Helpful

  • Good Essays

    The above problem refers to problem of load balancing where time of execution of each task varies at random. Dynamic mapping technique can be used for solving the above problem. In dynamic mapping technique, tasks are managed by Master node and all other nodes that depend on Master for work are called slave nodes.…

    • 496 Words
    • 2 Pages
    Good Essays
  • Better Essays

    References: Attiya, H., & Welch, J. (2004). Distributed Computing: Fundamentals, Simulations, and Advanced Topics . : Wiley-Interscience.…

    • 3954 Words
    • 16 Pages
    Better Essays
  • Powerful Essays

    Mis 535 Course Paper

    • 2829 Words
    • 12 Pages

    A main server that is big enough to route files in it for storage can help a computer and website run faster. Just as an external hard drive can store data on it away from a computer to free up the main hard drive can help a computer run faster. Cloud storage operates in the same concept, instead of having a hard drive, there is a huge server with a lot of storage. In this paper, the company that I have chosen has issues with server space and need a newer solution to developing a better service to the company and its production facilities. I will explain what it will take to boost service speed and help the company to ship products out faster with just a solution in server space.…

    • 2829 Words
    • 12 Pages
    Powerful Essays
  • Satisfactory Essays

    A group of MapReduce jobs G= {0, 1,……g} and a group of Task-Trackers SS = {0,1,…..s}. We also state m and SS to index into the sets of jobs and Task-Trackers. For each TaskTracker S we correlate a series of resources, P = {0,1,….p}. Every resource of Task-Tracker S contains a correlated capacity V. We also take into account the disk bandwidth, memory and CPU capacities for each TaskTracker and our algorithm is designed to contain other resources such as storage capacity. A MapReduce job, (m) contains a group of tasks, called as offering time, that can be shared into map tasks and reduce tasks. Each TaskTracker S gives the cluster a group of job-slots in which tasks can execute. Each job-slot is given a specific job, and the scheduler will…

    • 197 Words
    • 1 Page
    Satisfactory Essays
  • Better Essays

    seabreeze

    • 1203 Words
    • 3 Pages

    Dynamic load balancing and the disaster recovery: “the workloads of the server changes, the virtualization gives the capability for the virtual machines that are more utilizing the resources of the server to be navigated to underutilized servers”. (Burger, 2012) The disaster recovery is the serious element for the IT in sea breeze, because the system crashes could make the large economic losses.…

    • 1203 Words
    • 3 Pages
    Better Essays
  • Good Essays

    Regan, P. (2004). Chapter 12: Introduction to Routers and Routing in Wide Area Networks. Prentice-Hall.…

    • 554 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Akamai Case

    • 1562 Words
    • 7 Pages

    a. First-mile infrastructure having to cope with heavy traffic loads as a Web site’s audience grew.…

    • 1562 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Mary Kay Case Study

    • 970 Words
    • 4 Pages

    Mary Kay is a cosmetics business which faced the problem that every company wished they had. They were growing so quickly that the system they have in place was no longer a viable option to process the orders they had coming in, which ranged from 25-30,000 per day. At the outset of the company, they had a decentralized system. This meant that the orders were scattered across 4 different mediums: web, phone, mail, and desktop computers, and many mistakes were made as a result. With over 850,000 IBCs each having the capability to create their websites (150,000 created), the burdens on the decentralized servers was becoming too large. So, Mary Kay switched to a centralized system which could handle the increased need for storage and could manage the network. With the change from Compaq Alpha servers to the Unisys Enterprise Server ES7000 mainframe with 32 Intel processors, Mary Kay found that IBCs could enter orders in half the time and was set up to simplify the management of online operations as sales grew. But, there is always the underlying question of whether implementing a centralized system was the best option for Mary Kay. There is no doubt that the decentralized system needed to be changed, but Galen Shreck states that “a lot of data centers can’t be broken up to run multiple tasks at the same time.” Also, the system was chosen not necessarily because it was best, but because it was convenient to stay with the Windows 2000 operating system with the transition from servers to mainframe. Instead of choosing the centralized mainframe system, which is already outdated in today’s technology world, I would have chosen to implement a distributive network infrastructure. For many reasons that I will illustrate later, the centralized system will prove to be inefficient for the tasks that Mary Kay wishes to perform in the future.…

    • 970 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Cloud Computing

    • 1763 Words
    • 7 Pages

    IT departments and infrastructure providers are under increasing pressure to provide computing infrastructure at the lowest possible cost. In order to do this, the concepts of resource pooling, virtualization, dynamic provisioning, utility and commodity computing must be leveraged to create a public or private cloud that meets these needs. Cloud computing is a general term for anything that involves delivering hosted services over the Internet. This provides the smaller companies or individuals who couldn’t able to buy costly software or any other resources. This becomes easy because of cloud computing. Cloud promises real costs savings and agility to customers. It’s a ‘Pay for Usage’ plan. We pay the money based on our usage only. Through cloud computing, a company can rapidly deploy applications where the underlying technology components. Access to applications and data anywhere, any Time, from any device is the potential outcome of cloud computing. This is suitable technology for limited budgets and a highly dynamic market with minimal resources.…

    • 1763 Words
    • 7 Pages
    Powerful Essays
  • Powerful Essays

    Business Trend Memo

    • 1299 Words
    • 6 Pages

    Erenben, C. (2009), “Cloud computing: the economic imperative”, eSchool News, 13, 9-26. Retrieved from http://www.eschoolnews.com/emails/esntoday/esntoday061509.htm.…

    • 1299 Words
    • 6 Pages
    Powerful Essays
  • Powerful Essays

    It is stated that the target market is seeking four basic servers. For providing consistency in the following pricing overview, the pricing shall be following that of four basic servers. Making use of the conservative method, this equals two Atlantic Bundles. As it does not improve performance by four times on every application, I shall make use of the ‘conservative method’ (two Tronn servers equal four Zink servers).…

    • 1316 Words
    • 6 Pages
    Powerful Essays
  • Good Essays

    After reading the case study of Guardian Life’s data center consolidation, there are a few elements that are worthy of discussion; the first is bandwidth and high-speed communication, the second is cloud computing as a consolidation option, and third, mobile or “pod” datacenters as consolidation options.…

    • 839 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Data Center

    • 1549 Words
    • 7 Pages

    The goal of any data center is to provide continuous availability of all network services. Therefore, whether discussing the network design strategy or the facility design strategy the overall philosophy remains the same: Keep it as simple as possible Design for scalability Utilize modularity wherever possible Be flexible and adaptable to change…

    • 1549 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Because of ET’s highly reliable infrastructure, ET clients are assured of speedy, reliable, and consistency first delivery attempt. ET is unparalleled in getting the mail out the door…

    • 676 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Grid Computing is a technique in which the idle systems in the Network and their “ wasted “ CPU cycles can be efficiently used by uniting pools of servers, storage systems and networks into a single large virtual system for resource sharing dynamically at runtime. Grid computing combines computers from multiple administrative domains to reach a common goal [1] These systems can be distributed across the globe; they're heterogeneous (some PCs, some servers, maybe mainframes and supercomputers); somewhat autonomous (a Grid can potentially access resources in different organizations).…

    • 2100 Words
    • 9 Pages
    Good Essays