WikiGalaxy

Personalize

System Design Basics and Overview

Introduction to System Design

Why Learn System Design?

Fundamentals of Scalability and Performance

System Design Terminology for Beginners

Understanding Trade-offs in System Design

System Design Key Concepts and Terminology

Client-Server Architecture

Load Balancers and Reverse Proxies

Database Concepts in System Design

Horizontal vs Vertical Scaling

Consistency, Availability, and Partition Tolerance

Designing Scalable Systems

Principles of Scalable System Design

Designing for Read vs Write Heavy Workloads

Replication and Data Redundancy

Sharding Strategies and Their Use Cases

Designing Systems for Traffic Spikes

High-Level System Design Process

Understanding System Design Interview Process

Defining Requirements and Constraints

Designing High-Level Architecture

Estimating Scalability and Costs

Breaking Down the System Design into Components

System Design with Client-Server Architecture

Understanding Client-Server Architecture

Designing Scalable Client-Server Systems

Communication Protocols in Client-Server Models

Load Balancing and Traffic Distribution

Introduction to Load Balancing

Types of Load Balancers (Hardware and Software)

Traffic Distribution Techniques for Scalability

Database Design and Sharding

Fundamentals of Database Design

Database Sharding Strategies

ACID vs BASE in Database Design

Caching Mechanisms and Strategies

Overview of Caching Mechanisms

Types of Caches: In-Memory, Distributed, and Local

Cache Invalidation and Eviction Policies

Messaging Queues and Event Streaming

Introduction to Messaging Queues

Event Streaming and Event-Driven Architecture

Designing Scalable Event-Driven Systems

Distributed Systems and Consistency Models

Introduction to Distributed Systems

CAP Theorem in Distributed Systems

Consistency Models in Distributed Systems

System Design for Real-Time Applications

Designing Real-Time Systems

Real-Time Data Processing and Communication

Latency and Throughput Considerations in Real-Time Systems

APIs and Microservices Architecture

API Design Fundamentals

Building Scalable Microservices with APIs

Managing API Versioning and Backward Compatibility

Designing Resilient and Fault-Tolerant Systems

Resilience in System Design

Fault Tolerance Strategies in Distributed Systems

Designing for High Availability and Failover

Scalability and Performance Optimization

Scalability Considerations in System Design

Performance Optimization in Distributed Systems

Vertical vs Horizontal Scaling for Performance

CAP Theorem and System Trade-offs

Understanding the CAP Theorem

System Trade-offs and Trade-off Analysis

Designing Systems Based on CAP Theorem Trade-offs

Designing Systems for High Availability

Principles of High Availability Design

Fault Isolation and Redundancy in High Availability

Disaster Recovery and Data Replication for HA

Security Best Practices in System Design

Security Fundamentals in System Design

Data Encryption and Secure Data Transmission

Authentication and Authorization Strategies

Case Studies and Real-World System Design Examples

Real-World System Design: E-Commerce Platform

Real-World System Design: Social Media Application

Real-World System Design: Messaging Service

Database Concepts in System Design

Normalization:

Normalization is the process of organizing data to minimize redundancy. It involves dividing a database into two or more tables and defining relationships between the tables.

Denormalization:

Denormalization is the process of combining tables to improve read performance. This is often used in data warehousing where query speed is crucial.

ACID Properties:

ACID stands for Atomicity, Consistency, Isolation, Durability. These properties ensure reliable processing of database transactions.

CAP Theorem:

CAP Theorem states that a distributed database system can only provide two out of the three guarantees: Consistency, Availability, and Partition Tolerance.

Indexing:

Indexing improves the speed of data retrieval operations on a database table at the cost of additional writes and storage space.

Sharding:

Sharding involves partitioning a database into smaller, faster, more easily managed parts called shards.

Normalization

First Normal Form (1NF):

Ensures that the values in a table are atomic and each column contains unique data.

Second Normal Form (2NF):

Builds on 1NF by ensuring that all non-key attributes are fully functional dependent on the primary key.

Third Normal Form (3NF):

Ensures that all attributes are only dependent on the primary key, removing transitive dependencies.


      // Example of a table in 1NF
      Table: Students
      +---------+-----------+-------------------+
      | ID      | Name      | Courses           |
      +---------+-----------+-------------------+
      | 1       | Alice     | Math, Science     |
      | 2       | Bob       | English, History  |
      +---------+-----------+-------------------+

Explanation:

The table above violates 1NF because the 'Courses' column contains multiple values. To convert it into 1NF, we should split the values into separate rows.

Denormalization

Purpose:

Denormalization is used to improve the read performance of a database at the expense of write performance and storage.

Use Cases:

Commonly used in OLAP systems where complex queries are performed on large volumes of data.


      // Denormalized table example
      Table: Orders
      +---------+-----------+-------------------+
      | OrderID | Customer  | ProductDetails    |
      +---------+-----------+-------------------+
      | 101     | John Doe  | TV, 2, $400       |
      | 102     | Jane Doe  | Laptop, 1, $800   |
      +---------+-----------+-------------------+

Explanation:

In the denormalized table above, 'ProductDetails' combines multiple columns into one, making it easier to read but harder to update.

ACID Properties

Atomicity:

Ensures that each transaction is treated as a single unit, which either succeeds completely or fails completely.

Consistency:

Ensures that a transaction can only bring the database from one valid state to another, maintaining database invariants.

Isolation:

Ensures that concurrent execution of transactions leaves the database in the same state as if the transactions were executed sequentially.

Durability:

Ensures that once a transaction has been committed, it will remain so, even in the event of a power loss, crash, or error.


      // Pseudo-code illustrating ACID properties
      BEGIN TRANSACTION
        UPDATE Account SET Balance = Balance - 100 WHERE AccountID = 1;
        UPDATE Account SET Balance = Balance + 100 WHERE AccountID = 2;
      COMMIT;

Explanation:

The pseudo-code demonstrates a simple bank transfer operation where ACID properties ensure that the transaction is atomic, consistent, isolated, and durable.

CAP Theorem

Consistency:

Every read receives the most recent write or an error.

Availability:

Every request receives a response, without guarantee that it contains the most recent write.

Partition Tolerance:

The system continues to operate despite an arbitrary number of messages being dropped or delayed by the network between nodes.


      // Example of CAP theorem in distributed systems
      // Choose two: Consistency, Availability, Partition Tolerance

Explanation:

In distributed databases, it's impossible to achieve all three guarantees simultaneously. Systems must choose which two to prioritize based on their specific needs.

Indexing

Purpose:

Indexes are used to quickly locate data without having to search every row in a database table every time a database table is accessed.

Types:

Common types include B-tree indexes and hash indexes, each suitable for different types of queries.


      // SQL command to create an index
      CREATE INDEX idx_customer_name ON Customers (Name);

Explanation:

The SQL command above creates an index on the 'Name' column of the 'Customers' table, improving the speed of queries searching by customer name.

Sharding

Purpose:

Sharding helps in scaling a database by distributing the data across multiple machines, allowing for more efficient querying and storage.

Types:

Horizontal sharding splits data across rows, while vertical sharding splits data across columns.


      // Example of horizontal sharding
      // Shard 1: User data for users with ID 1-1000
      // Shard 2: User data for users with ID 1001-2000

Explanation:

The example demonstrates horizontal sharding where user data is distributed across multiple shards based on user IDs, allowing for better load balancing and performance.