You keep hearing about the term "Distributed Systems". But what is it exactly?

This 🧵 covers some key characteristics of a distributed system:

👇🏻
A distributed systems is basically a system with multiple components that communicate and co-ordinate with each other to perform a task.

The whole system appears to be a single entity to the end user.

Below are some of the key characteristics of a distributed system:

👇🏻
Scalability:

The ability to quickly and easily increase or decrease the size of a system or process to manage increased or decreased demand.

Any distributed system that can continuously evolve in order to support the growing amount of work is considered to be scalable.

👇🏻
Reasons to scale up a system:

▫️increased traffic to a service
▫️increase data volume
▫️increased number of transactions
etc

Types of scaling:

▫️horizontal
▫️vertical

👇🏻
Reliability:

A distributed system is considered reliable if it keeps delivering its services even when one or several of its software or hardware components fail.

Reliability is one of the main characteristics of any distributed system & is achieved through redundancy.

👇🏻
Redundancy basically means keeping backup resources like servers, data etc. When the main components go down, backups will serve the incoming traffic.

Redundancy comes with an additional cost. It's a tradeoff.

👇🏻
Availability:

Availability is the time a system remains operational to perform its required function in a specific period.

It is basically the percentage of the time for which the system remains operational.

👇🏻
You might have heard about 99.99% uptime. It means that the system / service can be down only for around 4 minutes in a month.

If a system is reliable, it is available. However, if it is available, it is not necessarily reliable.

👇🏻
Efficiency:

There are two standard ways to measure the efficiency of a system:

Latency: The total time taken to process a request and return a response.

Throughput: Number of requests processed in a given unit time.

👇🏻
Of course there are many other factors involved to say that a system is efficient based on the use case involved and how a system is built.

But above two are the major ones.

👇🏻
Concurrency:

Concurrency is when multiple mutually exclusive operations are executed at the same time on same/different components of the system without any inconsistencies.

Concurrency reduces the latency and increases the throughput of a system.

👇🏻
Serviceability:

This is one of the important factors to consider while designing a distributed system.

It basically means the ease with which new features can be added and failures can be diagnosed in the system.

👇🏻
If the time it takes to fix an issue increases, it's availability decreases.

There's no point building a complex system which is hard to maintain. The architecture should be flexible to add or remove new components frequently.

👇🏻
You can follow @sunilc_.
Tip: mention @twtextapp on a Twitter thread with the keyword “unroll” to get a link to it.

Latest Threads Unrolled: