![]() |
|
|
| |
|
||||
Distributed computing is the process of aggregating the power of several computing entities to collaboratively run a single computational task in a transparent and coherent way, so that they appear as a single, centralized system.
IntroductionDistributed computing differs from cluster computing in that computers in a distributed computing environment are typically not exclusively running 'group' tasks, whereas clustered computers are usually much more tightly coupled. The difference makes distributed computing attractive because, when properly configured, it can use computational resources that would otherwise be unused. It can also make available computing resources which would otherwise be impossible. For example, the SETI@home project uses 'idle time' on many thousands of computers throughout the world, and is able to analyze received signals that would have been impossible otherwise. Such arrangements permit handling of data that would otherwise require the power of expensive supercomputers. Distributed computing is very attractive in part because interactive operation leaves most computers in 'idle' most of the time. The process which implements the distributed aspect (ie: that running on a machine normally devoted to other work) is usually specially designed to be a low priority process, using only computing power that would be 'wasted' anyway. However, having a low-priority process constantly running prevents operating system power management routines from putting the processor into a low-power mode, resulting in increased electricity consumption. For some (typically recent, and high speed) CPUs, the difference can be on the order of tens of watts. Distributed computing also often involves competition with other distributed systems. This competition may be for prestige, or it may be a means of enticing users to donate processing power to a specific project. For example, there is the so-called "stat race": a measure of what project has managed to perform the most distributed work over the past day or week. This has been found to be so important in practice that virtually all distributed computing projects offer on-line statistical analyses of their performances, updated at least daily, if not in real-time. Distributed computing is also an active area of research with an abundant literature. The best known distributed computing conferences are The International Conference on Dependable Systems and Networks [1] (http://www.dsn.org/) and the ACM Symposium on Principles of Distributed Computing [2] (http://www.podc.org). Journals include the Journal of Parallel and Distributed Computing [3] (http://www.academicpress.com/jpdc), IEEE transactions on Parallel and Distributed Systems [4] (http://www.computer.org/tpds/about.htm), and Distributed Computing [5] (http://www.springeroline.com). The rendering of 3D computer images is often spread between several computers to speed up the process. These computers are often referred to as render farms. GoalsThere are many different types of distributed computing systems, and many challenges to overcome in successfully architecting one. The main goal of a distributed operating system is to connect users and resources in a transparent, open, and scalable way. TransparencyTransparency means that a distributed system should hide its distributed nature from its users, appearing and functioning as a normal centralized system. There are many types of transparency:
The degree to which these properties can or should be achieved may vary widely. Not every system can or should hide everything from its users. For instance, due to the existence of a fixed speed of light there will always be more latency on accessing resources distant from the user. If one expects real-time interaction with the distributed system, this may be very noticeable. OpennessOpenness is the property of distributed systems that measures the extent to which it offers a standardized interface that allows it to be extended and scaled. It is clear that a system that easily allows more computing entities to be plugged into it and more features to be easily added to it has an advantage over a perfectly closed and self-contained system. This is usually achieved by using an Interface Definition Language (IDL) that captures the syntax of all services offered by the system. ScalabilityA scalable system is one that can easily be altered to accommodate changes in the amount of users, resources and computing entities affected to it. Scalability can measured in three different dimensions:
Some loss of performance may occur in a system that allows itself to scale in one or more of these dimensions. ArchitectureVarious hardware and software architectures exist that are usually used for distributed computing. At a lower level, it is necessary to interconnect multiple CPUs with some sort of network, regardless of that network being printed onto a circuit board or made up of several loosely-coupled devices and cables. At a higher level, it is necessary to interconnect processes running on those CPUs with some sort of communication system. HardwareMultiprocessor systemsA multiprocessor system is simply a computer that has more than one CPU on its motherboard. If the operating system is built to take advantage of this, it can run different processes on different CPUs, or different threads belonging to the same process. Over the years, many different multiprocessing options have been explored for use in distributed computing. CPUs can be connected by bus or switch networks, use shared memory or their own private RAM, or even a hybrid approach. These days, multiprocessor systems are available commercially for end-users, and mainstream operating systems like Windows and Linux already have built-in support for this. Additionally, recent Intel CPUs have begun to employ a technology called Hyperthreading that allows more than one thread to run on the same CPU. Multicomputer systemsA multicomputer system is a system made up of several independent computers interconnected by a telecommunications network. Multicomputer systems can be homogeneous or heterogeneous: A homogeneous multicomputer is one where all CPUs are similar and are connected by a single type of network. They are often used for parallel computing which is a kind of distributed computing where every computer is working on different parts of a single problem.
SoftwareDistributed Operating SystemsNetwork Operating SystemsMiddlewareDistributed computing infrastructure
Proprietary
See also
External links
|
||
|
|
|
|
|
|
Copyright 2008 WordIQ.com - Privacy Policy
::
Terms of Use
:: Contact Us
:: About Us This article is licensed under the GNU Free Documentation License. It uses material from the Wikipedia article "Distributed processing". |