You keep hearing about the term "Distributed Systems". But what is it exactly?
This
covers some key characteristics of a distributed system:
This


A distributed systems is basically a system with multiple components that communicate and co-ordinate with each other to perform a task.
The whole system appears to be a single entity to the end user.
Below are some of the key characteristics of a distributed system:
The whole system appears to be a single entity to the end user.
Below are some of the key characteristics of a distributed system:

Scalability:
The ability to quickly and easily increase or decrease the size of a system or process to manage increased or decreased demand.
Any distributed system that can continuously evolve in order to support the growing amount of work is considered to be scalable.
The ability to quickly and easily increase or decrease the size of a system or process to manage increased or decreased demand.
Any distributed system that can continuously evolve in order to support the growing amount of work is considered to be scalable.

Reasons to scale up a system:
increased traffic to a service
increase data volume
increased number of transactions
etc
Types of scaling:
horizontal
vertical



etc
Types of scaling:



Reliability:
A distributed system is considered reliable if it keeps delivering its services even when one or several of its software or hardware components fail.
Reliability is one of the main characteristics of any distributed system & is achieved through redundancy.
A distributed system is considered reliable if it keeps delivering its services even when one or several of its software or hardware components fail.
Reliability is one of the main characteristics of any distributed system & is achieved through redundancy.

Redundancy basically means keeping backup resources like servers, data etc. When the main components go down, backups will serve the incoming traffic.
Redundancy comes with an additional cost. It's a tradeoff.
Redundancy comes with an additional cost. It's a tradeoff.

Availability:
Availability is the time a system remains operational to perform its required function in a specific period.
It is basically the percentage of the time for which the system remains operational.
Availability is the time a system remains operational to perform its required function in a specific period.
It is basically the percentage of the time for which the system remains operational.

You might have heard about 99.99% uptime. It means that the system / service can be down only for around 4 minutes in a month.
If a system is reliable, it is available. However, if it is available, it is not necessarily reliable.
If a system is reliable, it is available. However, if it is available, it is not necessarily reliable.

Efficiency:
There are two standard ways to measure the efficiency of a system:
Latency: The total time taken to process a request and return a response.
Throughput: Number of requests processed in a given unit time.
There are two standard ways to measure the efficiency of a system:
Latency: The total time taken to process a request and return a response.
Throughput: Number of requests processed in a given unit time.

Of course there are many other factors involved to say that a system is efficient based on the use case involved and how a system is built.
But above two are the major ones.
But above two are the major ones.

Concurrency:
Concurrency is when multiple mutually exclusive operations are executed at the same time on same/different components of the system without any inconsistencies.
Concurrency reduces the latency and increases the throughput of a system.
Concurrency is when multiple mutually exclusive operations are executed at the same time on same/different components of the system without any inconsistencies.
Concurrency reduces the latency and increases the throughput of a system.

Serviceability:
This is one of the important factors to consider while designing a distributed system.
It basically means the ease with which new features can be added and failures can be diagnosed in the system.
This is one of the important factors to consider while designing a distributed system.
It basically means the ease with which new features can be added and failures can be diagnosed in the system.

If the time it takes to fix an issue increases, it's availability decreases.
There's no point building a complex system which is hard to maintain. The architecture should be flexible to add or remove new components frequently.
There's no point building a complex system which is hard to maintain. The architecture should be flexible to add or remove new components frequently.

If you're preparing for system design interview, this course
is highly recommended.
It covers all the topics and real world examples of designing distributed systems. https://www.educative.io/courses/grokking-the-system-design-interview?aff=xmn2

It covers all the topics and real world examples of designing distributed systems. https://www.educative.io/courses/grokking-the-system-design-interview?aff=xmn2