朴是什么意思| 慢热型是什么意思| 两肺纹理增重什么意思| 33朵玫瑰花代表什么| 送长辈什么礼物合适| 什么是本科| 梦见洗手是什么意思| 骨折忌口什么食物| fte是什么意思| 大满贯什么意思| 氯偏高是什么原因| 窦性心动过速吃什么药| 梦见猫吃老鼠什么意思| 心机boy什么意思| 浮生若梦什么意思| 磺胺是什么药| 1972属什么| 小宇宙是什么意思| 08年属什么生肖| 类风湿关节炎吃什么药效果好| 半元音是什么意思| 64属什么| kako是什么牌子| 什么是介入治疗| 六月二十六是什么日子| 肚脐上方是什么器官| 医学上pi是什么意思| 结核是什么| 河图洛书是什么意思| 雕琢是什么意思| 忌诸事不宜什么意思| 白细胞低是什么意思| 葵水是什么意思| feno是什么检查| 头皮毛囊炎用什么洗发水| 23是什么生肖| 健脾益气是什么意思| 玉势是什么| kingtis手表什么牌的| vk是什么| 经血颜色淡是什么原因| 李知恩为什么叫iu| 急性肠胃炎是什么原因引起的| 挑担是什么关系| 招财猫是什么品种| 气胸有什么症状| 莫逆之交是什么意思| 过是什么结构| 什么的鸭子| 充电玩手机有什么危害| 神阙穴在什么位置| 补气血喝什么泡水| 腿麻脚麻用什么药能治| 白带是什么味道| 今年什么时候入梅| 为什么会被限制高消费| 6月26日是什么星座| 玉米属于什么类| 什么是穿刺| 解脲支原体阳性吃什么药| 趴在桌子上睡觉有什么坏处| 甲状腺球蛋白高是什么原因| 38码衣服相当于什么码| 胃肠镜检查挂什么科| 冷战什么意思| 什么是认证| 不好意思是什么意思| 多吃香蕉有什么好处和坏处| 品质是什么| 狗叫是什么意思| 他叫什么名字| 待定是什么意思| 排湿气最快的方法吃什么| 16是什么意思| 南岳什么山| 怀孕是什么症状| 男人喝藏红花有什么好处| 糖类抗原125是什么指标| 咬牙齿是什么原因| 什么是辣木籽| 吉兆什么意思| 生育登记有什么用| icd是什么意思| 乳腺结节有什么症状| 失眠可以吃什么药| 口臭用什么牙膏| 肝囊肿有什么危害| 审美疲劳是什么意思| 什么字五行属金| 什么是胎记| 看输卵管是否堵塞做什么检查| 风采是什么意思| 失落感是什么意思| 孕妇多吃什么水果比较好| 野猫吃什么| 豺狼虎豹为什么豺第一| 医药代表是做什么的| 1月29日是什么星座| 脑震荡后眩晕吃什么药| 多囊卵巢综合征是什么意思| 脚酸疼是什么原因引起的吗| 不饱和脂肪酸是什么意思| 81年的鸡是什么命| 肠道紊乱有什么症状| 月亮为什么会有圆缺变化| 眼睛散光和近视有什么区别| 小脑萎缩吃什么药效果最好| 为什么右眼皮一直跳| 流黄鼻涕是什么感冒| 7月出生的是什么星座| 氟苯尼考兽药治什么病| 沮丧是什么意思| 男同是什么意思| 湿气重看中医挂什么科| 排异反应是什么意思| 男头发稀少适合什么发型| 黑五是什么时候| 吃什么补肝养肝| chase是什么意思| 马的贵人是什么生肖| 小便很臭是什么原因| 牛肉不能和什么一起吃| 缺乏维生素b12的症状是什么| 体质指数是什么意思| 珊瑚色是什么颜色| 糕面是什么面| 来大姨妈肚子疼是什么原因| 出汗对身体有什么好处| dce是什么溶剂| 2月19日什么星座| 大肠杆菌感染吃什么药| 手脚浮肿是什么原因引起的| 法西斯是什么| 处级干部是什么级别| 上海是什么省| 头疼发烧吃什么药| ur是什么缩写| 学制是什么| 放的偏旁是什么| 宠物医院需要什么资质| 怀孕分泌物是什么颜色| 女生的胸部长什么样| 什么网站可以看三节片| 咪咪是什么| 周莹是什么电视剧| 尾巴长长的是什么鸟| 吃什么水果美白| 头皮发热是什么原因| 睚眦必报是什么意思| 天干指的是什么| 瘢痕子宫是什么意思| 为什么正骨后几天越来越疼| 猫爪草长什么样| 先天性聋哑病属于什么遗传病| 怀孕了有什么症状| 鹤立鸡群代表什么生肖| 做美甲师容易得什么病| 还有什么寓言故事| 什么情况下要割包皮| 灰指甲用什么药最有效| 什么地问填词语| 上海什么房子不限购| 黑色屎是什么原因| 红豆是什么意思| 呼吸科属于什么科室| 剧透是什么意思| cyl是什么意思| 腰酸胀是什么原因男性| 尿道炎和阴道炎有什么区别| 何首乌长什么样子| 打猎是什么意思| 眼睛胀痛什么原因| 脸色发黑是什么病的前兆| 胆码是什么意思| 相得益彰意思是什么| 左腹下方隐痛什么原因| 孔雀的尾巴像什么| 痛风什么药止痛最快| 汤力水是什么| 最小的动物是什么| 常吃黑芝麻有什么好处和坏处| 44岁月经量少是什么原因| 什么是狂躁症| 白细胞计数偏低是什么意思| 梅子是什么水果| 为什么狐臭女很漂亮| 板蓝根长什么样| 吃什么食物补铁| 甲亢挂什么科| 脑血管痉挛是什么原因引起的| 暴饮暴食会得什么病| 孤独症是什么| 向日葵什么时候播种| 吃什么能解酒| alex是什么意思| 克是什么意思| 嗓子哑是什么原因引起的| 维生素b有什么用| 4月19是什么星座| 烫伤了抹什么| 歆字取名什么寓意| 长子是什么意思| 六味地黄丸吃多了有什么副作用| 绯色是什么颜色| 月经不调看什么科室| pc是什么材质| 尿蛋白质阳性是什么意思| 10月24号什么星座| 烽烟是什么意思| 莞尔一笑什么意思| 血小板低是什么引起的| 内膜薄吃什么补得最快| noon什么意思| 鼓风机是干什么用的| 杀跌是什么意思| 榴莲什么时候最便宜| 被猫抓了有什么症状| eb病毒是什么病| 足跟痛用什么药| 岳飞为什么必须死| 人各有命是什么意思| 补水什么意思| 艾灸什么时候做最好| 产检是什么意思| 无聊干什么| 药引子是什么意思| 排骨炒什么配菜好吃| 茭白不能和什么一起吃| 失眠可以吃什么药| 红痣用什么药膏去除| 第一颗原子弹叫什么| 83年猪是什么命| 钢琴十级什么水平| 黄桃什么时候上市| 铁低的原因是什么| 阴毛的作用是什么| 清真是什么意思啊| 阴道炎是什么症状| 善太息是什么意思| 限用日期是什么意思| 长期腹泻是什么病| 走仕途是什么意思| 膝盖痛吃什么| crp是什么检查| sale是什么牌子| 贤淑是什么意思| 梦见已故老人是什么预兆| ala是什么| 鼻子两侧毛孔粗大是什么原因造成的| 马为什么站着睡觉| 尾盘拉升意味着什么| 身骑白马是什么方言| 公筷是什么意思| 头孢不能和什么药一起吃| 晚上看到黄鼠狼什么预兆| 阑尾在什么位置| 蛇怕什么家禽| 龙龟适合什么属相人| 交感神经是什么| 烧腊是什么意思| 戊肝阳性是什么意思| 荡是什么意思| 双十一是什么节日| 灵芝孢子粉治什么病| 百度Jump to content

从六方面保障城市轨道交通安全运行

From Wikipedia, the free encyclopedia
(Redirected from Distributed systems)
百度 结果发现仪器鉴定的和专家感官品尝的结果一致。

Distributed computing is a field of computer science that studies distributed systems, defined as computer systems whose inter-communicating components are located on different networked computers.[1][2]

The components of a distributed system communicate and coordinate their actions by passing messages to one another in order to achieve a common goal. Three significant challenges of distributed systems are: maintaining concurrency of components, overcoming the lack of a global clock, and managing the independent failure of components.[1] When a component of one system fails, the entire system does not fail.[3] Examples of distributed systems vary from SOA-based systems to microservices to massively multiplayer online games to peer-to-peer applications. Distributed systems cost significantly more than monolithic architectures, primarily due to increased needs for additional hardware, servers, gateways, firewalls, new subnets, proxies, and so on.[4] Also, distributed systems are prone to fallacies of distributed computing. On the other hand, a well designed distributed system is more scalable, more durable, more changeable and more fine-tuned than a monolithic application deployed on a single machine.[5] According to Marc Brooker: "a system is scalable in the range where marginal cost of additional workload is nearly constant." Serverless technologies fit this definition but the total cost of ownership, and not just the infra cost must be considered.[6]

A computer program that runs within a distributed system is called a distributed program,[7] and distributed programming is the process of writing such programs.[8] There are many different types of implementations for the message passing mechanism, including pure HTTP, RPC-like connectors and message queues.[9]

Distributed computing also refers to the use of distributed systems to solve computational problems. In distributed computing, a problem is divided into many tasks, each of which is solved by one or more computers,[10] which communicate with each other via message passing.[11]

Introduction

[edit]

The word distributed in terms such as "distributed system", "distributed programming", and "distributed algorithm" originally referred to computer networks where individual computers were physically distributed within some geographical area.[12] The terms are nowadays used in a much wider sense, even referring to autonomous processes that run on the same physical computer and interact with each other by message passing.[11]

While there is no single definition of a distributed system,[13] the following defining properties are commonly used as:

  • There are several autonomous computational entities (computers or nodes), each of which has its own local memory.[14]
  • The entities communicate with each other by message passing.[15]

A distributed system may have a common goal, such as solving a large computational problem;[16] the user then perceives the collection of autonomous processors as a unit. Alternatively, each computer may have its own user with individual needs, and the purpose of the distributed system is to coordinate the use of shared resources or provide communication services to the users.[17]

Other typical properties of distributed systems include the following:

  • The system has to tolerate failures in individual computers.[18]
  • The structure of the system (network topology, network latency, number of computers) is not known in advance, the system may consist of different kinds of computers and network links, and the system may change during the execution of a distributed program.[19]
  • Each computer has only a limited, incomplete view of the system. Each computer may know only one part of the input.[20]

Patterns

[edit]

Here are common architectural patterns used for distributed computing:[21]

Events vs. Messages

[edit]

In distributed systems, events represent a fact or state change (e.g., OrderPlaced) and are typically broadcast asynchronously to multiple consumers, promoting loose coupling and scalability. While events generally don’t expect an immediate response, acknowledgment mechanisms are often implemented at the infrastructure level (e.g., Kafka commit offsets, SNS delivery statuses) rather than being an inherent part of the event pattern itself. [22][23]

In contrast, messages serve a broader role, encompassing commands (e.g., ProcessPayment), events (e.g., PaymentProcessed), and documents (e.g., DataPayload). Both events and messages can support various delivery guarantees, including at-least-once, at-most-once, and exactly-once, depending on the technology stack and implementation. However, exactly-once delivery is often achieved through idempotency mechanisms rather than true, infrastructure-level exactly-once semantics. [22][23]

Delivery patterns for both events and messages include publish/subscribe (one-to-many) and point-to-point (one-to-one). While request/reply is technically possible, it is more commonly associated with messaging patterns rather than pure event-driven systems. Events excel at state propagation and decoupled notifications, while messages are better suited for command execution, workflow orchestration, and explicit coordination. [22][23]

Modern architectures commonly combine both approaches, leveraging events for distributed state change notifications and messages for targeted command execution and structured workflows based on specific timing, ordering, and delivery requirements. [22][23]

Parallel and distributed computing

[edit]
(a), (b): a distributed system.
(c): a parallel system.

Distributed systems are groups of networked computers which share a common goal for their work. The terms "concurrent computing", "parallel computing", and "distributed computing" have much overlap, and no clear distinction exists between them.[24] The same system may be characterized both as "parallel" and "distributed"; the processors in a typical distributed system run concurrently in parallel.[25] Parallel computing may be seen as a particularly tightly coupled form of distributed computing,[26] and distributed computing may be seen as a loosely coupled form of parallel computing.[13] Nevertheless, it is possible to roughly classify concurrent systems as "parallel" or "distributed" using the following criteria:

  • In parallel computing, all processors may have access to a shared memory to exchange information between processors.[27]
  • In distributed computing, each processor has its own private memory (distributed memory). Information is exchanged by passing messages between the processors.[28]

The figure on the right illustrates the difference between distributed and parallel systems. Figure (a) is a schematic view of a typical distributed system; the system is represented as a network topology in which each node is a computer and each line connecting the nodes is a communication link. Figure (b) shows the same distributed system in more detail: each computer has its own local memory, and information can be exchanged only by passing messages from one node to another by using the available communication links. Figure (c) shows a parallel system in which each processor has a direct access to a shared memory.

The situation is further complicated by the traditional uses of the terms parallel and distributed algorithm that do not quite match the above definitions of parallel and distributed systems (see below for more detailed discussion). Nevertheless, as a rule of thumb, high-performance parallel computation in a shared-memory multiprocessor uses parallel algorithms while the coordination of a large-scale distributed system uses distributed algorithms.[29]

History

[edit]

The use of concurrent processes which communicate through message-passing has its roots in operating system architectures studied in the 1960s.[30] The first widespread distributed systems were local-area networks such as Ethernet, which was invented in the 1970s.[31]

ARPANET, one of the predecessors of the Internet, was introduced in the late 1960s, and ARPANET e-mail was invented in the early 1970s. E-mail became the most successful application of ARPANET,[32] and it is probably the earliest example of a large-scale distributed application. In addition to ARPANET (and its successor, the global Internet), other early worldwide computer networks included Usenet and FidoNet from the 1980s, both of which were used to support distributed discussion systems.[33]

The study of distributed computing became its own branch of computer science in the late 1970s and early 1980s. The first conference in the field, Symposium on Principles of Distributed Computing (PODC), dates back to 1982, and its counterpart International Symposium on Distributed Computing (DISC) was first held in Ottawa in 1985 as the International Workshop on Distributed Algorithms on Graphs.[34]

Architectures

[edit]

Various hardware and software architectures are used for distributed computing. At a lower level, it is necessary to interconnect multiple CPUs with some sort of network, regardless of whether that network is printed onto a circuit board or made up of loosely coupled devices and cables. At a higher level, it is necessary to interconnect processes running on those CPUs with some sort of communication system.[35]

Whether these CPUs share resources or not determines a first distinction between three types of architecture:

Distributed programming typically falls into one of several basic architectures: client–server, three-tier, n-tier, or peer-to-peer; or categories: loose coupling, or tight coupling.[36]

  • Client–server: architectures where smart clients contact the server for data then format and display it to the users. Input at the client is committed back to the server when it represents a permanent change.
  • Three-tier: architectures that move the client intelligence to a middle tier so that stateless clients can be used. This simplifies application deployment. Most web applications are three-tier.
  • n-tier: architectures that refer typically to web applications which further forward their requests to other enterprise services. This type of application is the one most responsible for the success of application servers.
  • Peer-to-peer: architectures where there are no special machines that provide a service or manage the network resources.[37]:?227? Instead all responsibilities are uniformly divided among all machines, known as peers. Peers can serve both as clients and as servers.[38] Examples of this architecture include BitTorrent and the bitcoin network.

Another basic aspect of distributed computing architecture is the method of communicating and coordinating work among concurrent processes. Through various message passing protocols, processes may communicate directly with one another, typically in a main/sub relationship. Alternatively, a "database-centric" architecture can enable distributed computing to be done without any form of direct inter-process communication, by utilizing a shared database.[39] Database-centric architecture in particular provides relational processing analytics in a schematic architecture allowing for live environment relay. This enables distributed computing functions both within and beyond the parameters of a networked database.[40]

Cell-Based Architecture

[edit]

Cell-based architecture is a distributed computing approach in which computational resources are organized into self-contained units called cells. Each cell operates independently, processing requests while maintaining scalability, fault isolation, and availability. [41][42][43]

A cell typically consists of multiple services or application components and functions as an autonomous unit. Some implementations replicate entire sets of services across multiple cells, while others partition workloads between cells. In replicated models, requests may be rerouted to an operational cell if another experiences a failure. This design is intended to enhance system resilience by reducing the impact of localized failures. [44][45][46]

Some implementations employ circuit breakers within and between cells. Within a cell, circuit breakers may be used to prevent cascading failures among services, while inter-cell circuit breakers can isolate failing cells and redirect traffic to those that remain operational. [47][48][49]

Cell-based architecture has been adopted in some large-scale distributed systems, particularly in cloud-native and high-availability environments, where fault isolation and redundancy are key design considerations. Its implementation varies depending on system requirements, infrastructure constraints, and operational objectives. [50][51][52]

Applications

[edit]

Reasons for using distributed systems and distributed computing may include:

  • The very nature of an application may require the use of a communication network that connects several computers: for example, data produced in one physical location and required in another location.
  • There are many cases in which the use of a single computer would be possible in principle, but the use of a distributed system is beneficial for practical reasons. For example:
    • It can allow for much larger storage and memory, faster compute, and higher bandwidth than a single machine.
    • It can provide more reliability than a non-distributed system, as there is no single point of failure. Moreover, a distributed system may be easier to expand and manage than a monolithic uniprocessor system.[53]
    • It may be more cost-efficient to obtain the desired level of performance by using a cluster of several low-end computers, in comparison with a single high-end computer.

Examples

[edit]

Examples of distributed systems and applications of distributed computing include the following:[54]

Reactive distributed systems

[edit]

According to Reactive Manifesto, reactive distributed systems are responsive, resilient, elastic and message-driven. Subsequently, Reactive systems are more flexible, loosely-coupled and scalable. To make your systems reactive, you are advised to implement Reactive Principles. Reactive Principles are a set of principles and patterns which help to make your cloud native application as well as edge native applications more reactive. [56]

Theoretical foundations

[edit]

Models

[edit]

Many tasks that we would like to automate by using a computer are of question–answer type: we would like to ask a question and the computer should produce an answer. In theoretical computer science, such tasks are called computational problems. Formally, a computational problem consists of instances together with a solution for each instance. Instances are questions that we can ask, and solutions are desired answers to these questions.

Theoretical computer science seeks to understand which computational problems can be solved by using a computer (computability theory) and how efficiently (computational complexity theory). Traditionally, it is said that a problem can be solved by using a computer if we can design an algorithm that produces a correct solution for any given instance. Such an algorithm can be implemented as a computer program that runs on a general-purpose computer: the program reads a problem instance from input, performs some computation, and produces the solution as output. Formalisms such as random-access machines or universal Turing machines can be used as abstract models of a sequential general-purpose computer executing such an algorithm.[57][58]

The field of concurrent and distributed computing studies similar questions in the case of either multiple computers, or a computer that executes a network of interacting processes: which computational problems can be solved in such a network and how efficiently? However, it is not at all obvious what is meant by "solving a problem" in the case of a concurrent or distributed system: for example, what is the task of the algorithm designer, and what is the concurrent or distributed equivalent of a sequential general-purpose computer?[citation needed]

The discussion below focuses on the case of multiple computers, although many of the issues are the same for concurrent processes running on a single computer.

Three viewpoints are commonly used:

Parallel algorithms in shared-memory model
  • All processors have access to a shared memory. The algorithm designer chooses the program executed by each processor.
  • One theoretical model is the parallel random-access machines (PRAM) that are used.[59] However, the classical PRAM model assumes synchronous access to the shared memory.
  • Shared-memory programs can be extended to distributed systems if the underlying operating system encapsulates the communication between nodes and virtually unifies the memory across all individual systems.
  • A model that is closer to the behavior of real-world multiprocessor machines and takes into account the use of machine instructions, such as Compare-and-swap (CAS), is that of asynchronous shared memory. There is a wide body of work on this model, a summary of which can be found in the literature.[60][61]
Parallel algorithms in message-passing model
  • The algorithm designer chooses the structure of the network, as well as the program executed by each computer.
  • Models such as Boolean circuits and sorting networks are used.[62] A Boolean circuit can be seen as a computer network: each gate is a computer that runs an extremely simple computer program. Similarly, a sorting network can be seen as a computer network: each comparator is a computer.
Distributed algorithms in message-passing model
  • The algorithm designer only chooses the computer program. All computers run the same program. The system must work correctly regardless of the structure of the network.
  • A commonly used model is a graph with one finite-state machine per node.

In the case of distributed algorithms, computational problems are typically related to graphs. Often the graph that describes the structure of the computer network is the problem instance. This is illustrated in the following example.[63]

An example

[edit]

Consider the computational problem of finding a coloring of a given graph G. Different fields might take the following approaches:

Centralized algorithms[63]
  • The graph G is encoded as a string, and the string is given as input to a computer. The computer program finds a coloring of the graph, encodes the coloring as a string, and outputs the result.
Parallel algorithms
  • Again, the graph G is encoded as a string. However, multiple computers can access the same string in parallel. Each computer might focus on one part of the graph and produce a coloring for that part.
  • The main focus is on high-performance computation that exploits the processing power of multiple computers in parallel.
Distributed algorithms
  • The graph G is the structure of the computer network. There is one computer for each node of G and one communication link for each edge of G. Initially, each computer only knows about its immediate neighbors in the graph G; the computers must exchange messages with each other to discover more about the structure of G. Each computer must produce its own color as output.
  • The main focus is on coordinating the operation of an arbitrary distributed system.[63]

While the field of parallel algorithms has a different focus than the field of distributed algorithms, there is much interaction between the two fields. For example, the Cole–Vishkin algorithm for graph coloring[64] was originally presented as a parallel algorithm, but the same technique can also be used directly as a distributed algorithm.

Moreover, a parallel algorithm can be implemented either in a parallel system (using shared memory) or in a distributed system (using message passing).[65] The traditional boundary between parallel and distributed algorithms (choose a suitable network vs. run in any given network) does not lie in the same place as the boundary between parallel and distributed systems (shared memory vs. message passing).

Complexity measures

[edit]

In parallel algorithms, yet another resource in addition to time and space is the number of computers. Indeed, often there is a trade-off between the running time and the number of computers: the problem can be solved faster if there are more computers running in parallel (see speedup). If a decision problem can be solved in polylogarithmic time by using a polynomial number of processors, then the problem is said to be in the class NC.[66] The class NC can be defined equally well by using the PRAM formalism or Boolean circuits—PRAM machines can simulate Boolean circuits efficiently and vice versa.[67]

In the analysis of distributed algorithms, more attention is usually paid on communication operations than computational steps. Perhaps the simplest model of distributed computing is a synchronous system where all nodes operate in a lockstep fashion. This model is commonly known as the LOCAL model. During each communication round, all nodes in parallel (1) receive the latest messages from their neighbours, (2) perform arbitrary local computation, and (3) send new messages to their neighbors. In such systems, a central complexity measure is the number of synchronous communication rounds required to complete the task.[68]

This complexity measure is closely related to the diameter of the network. Let D be the diameter of the network. On the one hand, any computable problem can be solved trivially in a synchronous distributed system in approximately 2D communication rounds: simply gather all information in one location (D rounds), solve the problem, and inform each node about the solution (D rounds).

On the other hand, if the running time of the algorithm is much smaller than D communication rounds, then the nodes in the network must produce their output without having the possibility to obtain information about distant parts of the network. In other words, the nodes must make globally consistent decisions based on information that is available in their local D-neighbourhood. Many distributed algorithms are known with the running time much smaller than D rounds, and understanding which problems can be solved by such algorithms is one of the central research questions of the field.[69] Typically an algorithm which solves a problem in polylogarithmic time in the network size is considered efficient in this model.

Another commonly used measure is the total number of bits transmitted in the network (cf. communication complexity).[70] The features of this concept are typically captured with the CONGEST(B) model, which is similarly defined as the LOCAL model, but where single messages can only contain B bits.

Other problems

[edit]

Traditional computational problems take the perspective that the user asks a question, a computer (or a distributed system) processes the question, then produces an answer and stops. However, there are also problems where the system is required not to stop, including the dining philosophers problem and other similar mutual exclusion problems. In these problems, the distributed system is supposed to continuously coordinate the use of shared resources so that no conflicts or deadlocks occur.

There are also fundamental challenges that are unique to distributed computing, for example those related to fault-tolerance. Examples of related problems include consensus problems,[71] Byzantine fault tolerance,[72] and self-stabilisation.[73]

Much research is also focused on understanding the asynchronous nature of distributed systems:

Note that in distributed systems, latency should be measured through "99th percentile" because "median" and "average" can be misleading.[77]

Coordinator election (or leader election) is the process of designating a single process as the organizer of some task distributed among several computers (nodes). Before the task is begun, all network nodes are either unaware which node will serve as the "coordinator" (or leader) of the task, or unable to communicate with the current coordinator. After a coordinator election algorithm has been run, however, each node throughout the network recognizes a particular, unique node as the task coordinator.[78]

The network nodes communicate among themselves in order to decide which of them will get into the "coordinator" state. For that, they need some method in order to break the symmetry among them. For example, if each node has unique and comparable identities, then the nodes can compare their identities, and decide that the node with the highest identity is the coordinator.[78]

The definition of this problem is often attributed to LeLann, who formalized it as a method to create a new token in a token ring network in which the token has been lost.[79]

Coordinator election algorithms are designed to be economical in terms of total bytes transmitted, and time. The algorithm suggested by Gallager, Humblet, and Spira[80] for general undirected graphs has had a strong impact on the design of distributed algorithms in general, and won the Dijkstra Prize for an influential paper in distributed computing.

Many other algorithms were suggested for different kinds of network graphs, such as undirected rings, unidirectional rings, complete graphs, grids, directed Euler graphs, and others. A general method that decouples the issue of the graph family from the design of the coordinator election algorithm was suggested by Korach, Kutten, and Moran.[81]

In order to perform coordination, distributed systems employ the concept of coordinators. The coordinator election problem is to choose a process from among a group of processes on different processors in a distributed system to act as the central coordinator. Several central coordinator election algorithms exist.[82]

Properties of distributed systems

[edit]

So far the focus has been on designing a distributed system that solves a given problem. A complementary research problem is studying the properties of a given distributed system.[83][84]

The halting problem is an analogous example from the field of centralised computation: we are given a computer program and the task is to decide whether it halts or runs forever. The halting problem is undecidable in the general case, and naturally understanding the behaviour of a computer network is at least as hard as understanding the behaviour of one computer.[85]

However, there are many interesting special cases that are decidable. In particular, it is possible to reason about the behaviour of a network of finite-state machines. One example is telling whether a given network of interacting (asynchronous and non-deterministic) finite-state machines can reach a deadlock. This problem is PSPACE-complete,[86] i.e., it is decidable, but not likely that there is an efficient (centralised, parallel or distributed) algorithm that solves the problem in the case of large networks.

Other Topics

[edit]

Linearizability

See also

[edit]

Notes

[edit]
  1. ^ a b Tanenbaum, Andrew S.; Steen, Maarten van (2002). Distributed systems: principles and paradigms. Upper Saddle River, NJ: Pearson Prentice Hall. ISBN 0-13-088893-1. Archived from the original on 2025-08-06. Retrieved 2025-08-06.
  2. ^ "Distributed Programs". Texts in Computer Science. London: Springer London. 2010. pp. 373–406. doi:10.1007/978-1-84882-745-5_11. ISBN 978-1-84882-744-8. ISSN 1868-0941. Systems consist of a number of physically distributed components that work independently using their private storage, but also communicate from time to time by explicit message passing. Such systems are called distributed systems.
  3. ^ Dusseau & Dusseau 2016, p. 1–2.
  4. ^ Ford, Neal (March 3, 2020). Fundamentals of Software Architecture: An Engineering Approach (1st ed.). O'Reilly Media. pp. 146–147. ISBN 978-1492043454.
  5. ^ Monolith to Microservices Evolutionary Patterns to Transform Your Monolith. O'Reilly Media. ISBN 9781492047810.
  6. ^ Building Serverless Applications on Knative. O'Reilly Media. ISBN 9781098142049.
  7. ^ "Distributed Programs". Texts in Computer Science. London: Springer London. 2010. pp. 373–406. doi:10.1007/978-1-84882-745-5_11. ISBN 978-1-84882-744-8. ISSN 1868-0941. Distributed programs are abstract descriptions of distributed systems. A distributed program consists of a collection of processes that work concurrently and communicate by explicit message passing. Each process can access a set of variables which are disjoint from the variables that can be changed by any other process.
  8. ^ Andrews (2000). Dolev (2000). Ghosh (2007), p. 10.
  9. ^ Magnoni, L. (2015). "Modern Messaging for Distributed Sytems (sic)". Journal of Physics: Conference Series. 608 (1): 012038. Bibcode:2015JPhCS.608a2038M. doi:10.1088/1742-6596/608/1/012038. ISSN 1742-6596.
  10. ^ Godfrey (2002).
  11. ^ a b Andrews (2000), p. 291–292. Dolev (2000), p. 5.
  12. ^ Lynch (1996), p. 1.
  13. ^ a b Ghosh (2007), p. 10.
  14. ^ Andrews (2000), pp. 8–9, 291. Dolev (2000), p. 5. Ghosh (2007), p. 3. Lynch (1996), p. xix, 1. Peleg (2000), p. xv.
  15. ^ Andrews (2000), p. 291. Ghosh (2007), p. 3. Peleg (2000), p. 4.
  16. ^ Ghosh (2007), p. 3–4. Peleg (2000), p. 1.
  17. ^ Ghosh (2007), p. 4. Peleg (2000), p. 2.
  18. ^ Ghosh (2007), p. 4, 8. Lynch (1996), p. 2–3. Peleg (2000), p. 4.
  19. ^ Lynch (1996), p. 2. Peleg (2000), p. 1.
  20. ^ Ghosh (2007), p. 7. Lynch (1996), p. xix, 2. Peleg (2000), p. 4.
  21. ^ Fundamentals of Software Architecture: An Engineering Approach. O'Reilly Media. 2020. ISBN 978-1492043454.
  22. ^ a b c d Kleppmann, Martin (2017). Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems. O'Reilly Media. ISBN 978-1449373320.
  23. ^ a b c d Building Event-Driven Microservices: Leveraging Organizational Data at Scale. ISBN 978-1492057895.
  24. ^ Ghosh (2007), p. 10. Keidar (2008).
  25. ^ Lynch (1996), p. xix, 1–2. Peleg (2000), p. 1.
  26. ^ Peleg (2000), p. 1.
  27. ^ Papadimitriou (1994), Chapter 15. Keidar (2008).
  28. ^ See references in Introduction.
  29. ^ Bentaleb, A.; Yifan, L.; Xin, J.; et al. (2016). "Parallel and Distributed Algorithms" (PDF). National University of Singapore. Archived (PDF) from the original on 2025-08-06. Retrieved 20 July 2018.
  30. ^ Andrews (2000), p. 348.
  31. ^ Andrews (2000), p. 32.
  32. ^ Peter (2004), The history of email Archived 2025-08-06 at the Wayback Machine.
  33. ^ Banks, M. (2012). On the Way to the Web: The Secret History of the Internet and its Founders. Apress. pp. 44–5. ISBN 9781430250746. Archived from the original on 2025-08-06. Retrieved 2025-08-06.
  34. ^ Tel, G. (2000). Introduction to Distributed Algorithms. Cambridge University Press. pp. 35–36. ISBN 9780521794831. Archived from the original on 2025-08-06. Retrieved 2025-08-06.
  35. ^ Ohlídal, M.; Jaro?, J.; Schwarz, J.; et al. (2006). "Evolutionary Design of OAB and AAB Communication Schedules for Interconnection Networks". In Rothlauf, F.; Branke, J.; Cagnoni, S. (eds.). Applications of Evolutionary Computing. Springer Science & Business Media. pp. 267–78. ISBN 9783540332374.
  36. ^ "Real Time And Distributed Computing Systems" (PDF). ISSN 2278-0661. Archived from the original (PDF) on 2025-08-06. Retrieved 2025-08-06. {{cite journal}}: Cite journal requires |journal= (help)
  37. ^ Vigna P, Casey MJ. The Age of Cryptocurrency: How Bitcoin and the Blockchain Are Challenging the Global Economic Order St. Martin's Press January 27, 2015 ISBN 9781250065636
  38. ^ Quang Hieu Vu; Mihai Lupu; Beng Chin Ooi (2010). Peer-to-peer computing : principles and applications. Heidelberg: Springer. p. 16. ISBN 9783642035135. OCLC 663093862.
  39. ^ Lind P, Alm M (2006), "A database-centric virtual chemistry system", J Chem Inf Model, 46 (3): 1034–9, doi:10.1021/ci050360b, PMID 16711722.
  40. ^ Chiu, G (1990). "A model for optimal database allocation in distributed computing systems". Proceedings. IEEE INFOCOM'90: Ninth Annual Joint Conference of the IEEE Computer and Communications Societies.
  41. ^ Newman, Sam (2025-08-06). Building Microservices. O'Reilly Media. ISBN 978-1491950357.
  42. ^ Richardson, Chris (2019). Microservices patterns: with examples in Java. Shelter Island, NY: Manning Publications. ISBN 978-1-61729-454-9.
  43. ^ Christudas, Binildas (2019). Practical Microservices Architectural Patterns: Event-Based Java Microservices with Spring Boot and Spring Cloud. Berkeley, CA: Apress L. P. ISBN 978-1-4842-4501-9.
  44. ^ Newman, Sam (2025-08-06). Building Microservices. O'Reilly Media. ISBN 978-1491950357.
  45. ^ Richardson, Chris (2019). Microservices patterns: with examples in Java. Shelter Island, NY: Manning Publications. ISBN 978-1-61729-454-9.
  46. ^ Christudas, Binildas (2019). Practical Microservices Architectural Patterns: Event-Based Java Microservices with Spring Boot and Spring Cloud. Berkeley, CA: Apress L. P. ISBN 978-1-4842-4501-9.
  47. ^ Newman, Sam (2025-08-06). Building Microservices. O'Reilly Media. ISBN 978-1491950357.
  48. ^ Richardson, Chris (2019). Microservices patterns: with examples in Java. Shelter Island, NY: Manning Publications. ISBN 978-1-61729-454-9.
  49. ^ Christudas, Binildas (2019). Practical Microservices Architectural Patterns: Event-Based Java Microservices with Spring Boot and Spring Cloud. Berkeley, CA: Apress L. P. ISBN 978-1-4842-4501-9.
  50. ^ Newman, Sam (2025-08-06). Building Microservices. O'Reilly Media. ISBN 978-1491950357.
  51. ^ Richardson, Chris (2019). Microservices patterns: with examples in Java. Shelter Island, NY: Manning Publications. ISBN 978-1-61729-454-9.
  52. ^ Christudas, Binildas (2019). Practical Microservices Architectural Patterns: Event-Based Java Microservices with Spring Boot and Spring Cloud. Berkeley, CA: Apress L. P. ISBN 978-1-4842-4501-9.
  53. ^ Elmasri & Navathe (2000), Section 24.1.2.
  54. ^ Andrews (2000), p. 10–11. Ghosh (2007), p. 4–6. Lynch (1996), p. xix, 1. Peleg (2000), p. xv. Elmasri & Navathe (2000), Section 24.
  55. ^ Haussmann, J. (2019). "Cost-efficient parallel processing of irregularly structured problems in cloud computing environments". Journal of Cluster Computing. 22 (3): 887–909. doi:10.1007/s10586-018-2879-3. S2CID 54447518.
  56. ^ Reactive Application Development. Manning. 2018. ISBN 9781638355816.
  57. ^ Toomarian, N.B.; Barhen, J.; Gulati, S. (1992). "Neural Networks for Real-Time Robotic Applications". In Fijany, A.; Bejczy, A. (eds.). Parallel Computation Systems For Robotics: Algorithms And Architectures. World Scientific. p. 214. ISBN 9789814506175. Archived from the original on 2025-08-06. Retrieved 2025-08-06.
  58. ^ Savage, J.E. (1998). Models of Computation: Exploring the Power of Computing. Addison Wesley. p. 209. ISBN 9780201895391.
  59. ^ Cormen, Leiserson & Rivest (1990), Section 30.
  60. ^ Herlihy & Shavit (2008), Chapters 2–6.
  61. ^ Lynch (1996)
  62. ^ Cormen, Leiserson & Rivest (1990), Sections 28 and 29.
  63. ^ a b c TULSIRAMJI GAIKWAD-PATIL College of Engineering & Technology, Nagpur Department of Information Technology Introduction to Distributed Systems[1]
  64. ^ Cole & Vishkin (1986). Cormen, Leiserson & Rivest (1990), Section 30.5.
  65. ^ Andrews (2000), p. ix.
  66. ^ Arora & Barak (2009), Section 6.7. Papadimitriou (1994), Section 15.3.
  67. ^ Papadimitriou (1994), Section 15.2.
  68. ^ Lynch (1996), p. 17–23.
  69. ^ Peleg (2000), Sections 2.3 and 7. Linial (1992). Naor & Stockmeyer (1995).
  70. ^ Schneider, J.; Wattenhofer, R. (2011). "Trading Bit, Message, and Time Complexity of Distributed Algorithms". In Peleg, D. (ed.). Distributed Computing. Springer Science & Business Media. pp. 51–65. ISBN 9783642240997. Archived from the original on 2025-08-06. Retrieved 2025-08-06.
  71. ^ Lynch (1996), Sections 5–7. Ghosh (2007), Chapter 13.
  72. ^ Lynch (1996), p. 99–102. Ghosh (2007), p. 192–193.
  73. ^ Dolev (2000). Ghosh (2007), Chapter 17.
  74. ^ Lynch (1996), Section 16. Peleg (2000), Section 6.
  75. ^ Lynch (1996), Section 18. Ghosh (2007), Sections 6.2–6.3.
  76. ^ Ghosh (2007), Section 6.4.
  77. ^ Kamburugamuve, Supun; Ekanayake, Saliya (2021). Foundations of Data Intensive Applications Large Scale Data Analytics Under the Hood. John Wiley & Sons. ISBN 9781119713012.
  78. ^ a b Haloi, S. (2015). Apache ZooKeeper Essentials. Packt Publishing Ltd. pp. 100–101. ISBN 9781784398323. Archived from the original on 2025-08-06. Retrieved 2025-08-06.
  79. ^ LeLann, G. (1977). "Distributed systems - toward a formal approach". Information Processing. 77: 155·160 – via Elsevier.
  80. ^ R. G. Gallager, P. A. Humblet, and P. M. Spira (January 1983). "A Distributed Algorithm for Minimum-Weight Spanning Trees" (PDF). ACM Transactions on Programming Languages and Systems. 5 (1): 66–77. doi:10.1145/357195.357200. S2CID 2758285. Archived (PDF) from the original on 2025-08-06.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  81. ^ Korach, Ephraim; Kutten, Shay; Moran, Shlomo (1990). "A Modular Technique for the Design of Efficient Distributed Leader Finding Algorithms" (PDF). ACM Transactions on Programming Languages and Systems. 12 (1): 84–101. CiteSeerX 10.1.1.139.7342. doi:10.1145/77606.77610. S2CID 9175968. Archived (PDF) from the original on 2025-08-06.
  82. ^ Hamilton, Howard. "Distributed Algorithms". Archived from the original on 2025-08-06. Retrieved 2025-08-06.
  83. ^ "Major unsolved problems in distributed systems?". cstheory.stackexchange.com. Archived from the original on 20 January 2023. Retrieved 16 March 2018.
  84. ^ "How big data and distributed systems solve traditional scalability problems". theserverside.com. Archived from the original on 17 March 2018. Retrieved 16 March 2018.
  85. ^ Svozil, K. (2011). "Indeterminism and Randomness Through Physics". In Hector, Z. (ed.). Randomness Through Computation: Some Answers, More Questions. World Scientific. pp. 112–3. ISBN 9789814462631. Archived from the original on 2025-08-06. Retrieved 2025-08-06.
  86. ^ Papadimitriou (1994), Section 19.3.

References

[edit]
Books
Articles
Web sites

Further reading

[edit]
Books
Articles
Conference Papers
  • Rodriguez, Carlos; Villagra, Marcos; Baran, Benjamin (2007). "Asynchronous team algorithms for Boolean Satisfiability". 2007 2nd Bio-Inspired Models of Network, Information and Computing Systems. pp. 66–69. doi:10.1109/BIMNICS.2007.4610083. S2CID 15185219.
[edit]
疏肝解郁是什么意思 疤痕痒是什么原因 左侧卵巢内无回声是什么意思 一般什么人容易得甲亢 阴虚火旺吃什么好
瑶五行属性是什么 hpv45型阳性是什么意思 盐酸苯海索片治什么病 黑洞到底是什么 唾液分泌过多是什么原因
秋葵有什么作用 人类是什么时候出现的 理想血压是什么意思 产妇吃什么水果好 冲蛇煞西是什么意思
持续高烧不退是什么原因 治飞蚊症用什么眼药水 红色菜叶的菜是什么菜 三五成群是什么生肖 成人达己是什么意思
什么水果助消化hcv9jop5ns2r.cn 骆驼是什么牌子hcv8jop5ns1r.cn 部分导联t波改变是什么意思hcv8jop7ns9r.cn 什么时间量血压最准hcv9jop4ns7r.cn 农历3月是什么月hcv9jop1ns8r.cn
神经内科主要看什么病hcv8jop9ns8r.cn 月经过多是什么原因hcv8jop7ns4r.cn 什么的雾hcv9jop8ns2r.cn 同人小说是什么意思hcv8jop0ns1r.cn 同一首歌为什么停播了hcv9jop7ns5r.cn
重症肌无力用什么药wmyky.com 腹泻可以吃什么食物hcv8jop2ns8r.cn 什么是公历年份hcv9jop0ns5r.cn 正月初六是什么星座hcv8jop3ns7r.cn 糖尿病不能吃什么yanzhenzixun.com
抗战纪念日为什么是9月3日hcv8jop5ns8r.cn 一什么草坪hcv9jop5ns7r.cn m是什么hcv8jop5ns6r.cn 身体年龄是什么意思hcv8jop0ns3r.cn 吃什么容易得胆结石hcv9jop7ns3r.cn
百度