CQRS Pattern and Event Sourcing System Design

#architecture #performance #systemdesign

Core Concepts and Overview

CQRS (Command Query Responsibility Segregation) separates the operations that modify data (commands) from those that read data (queries).
Traditional applications handle CRUD (Create, Read, Update, Delete) operations in a single database and layer, potentially causing bottlenecks during heavy read/write loads.
CQRS addresses this by splitting the system into two parts:
Command side: handles all data mutation (create, update, delete).
Query side: handles all read operations.
This separation helps optimize system performance, scalability, and maintainability, especially in high-complexity systems.

Traditional Application Architecture and Its Limitations

Users interact with a server layer exposing REST API endpoints (GET, POST, PATCH, DELETE).
The server processes requests via controllers and service layers, directly performing CRUD operations on a single database.
Scaling traditional apps involves vertical scaling (adding CPU/RAM) or horizontal scaling (adding more server instances).
Bottleneck: When reads and writes compete on the same database, locks during updates cause delays and slow queries, especially under high load (example: Amazon product price updates vs reads).
This leads to database contention and performance degradation.

CQRS Pattern Architecture

Component	Description
Presentation Layer	User Interface and REST API endpoints that act as the entry point for all requests
API Gateway	Routes read (query) requests to the query side and mutation (command) requests to the command side
Command Side	Handles commands (create, update, delete) and writes to a dedicated write database
Query Side	Handles queries (read operations) from a separate read database
Event System	Synchronizes changes from the write database to the read database using events and queues

Separate Databases for reads and writes: read database is optimized for queries (often denormalized), write database is normalized and optimized for transactions.
The write model processes commands validating and authorizing them before updating the write database.
The read model processes queries against the read database, which is updated asynchronously via events emitted after writes.
This results in eventual consistency between read and write databases, acceptable in many business scenarios but unsuitable for highly real-time systems (e.g., stock markets).

Event Sourcing Integration

CQRS can be combined with Event Sourcing, where every change is stored as an append-only event log rather than directly updating the state.
The system stores immutable logs of all commands/events, which can be replayed to rebuild the current state of the database (hydration).
This provides fault tolerance; if the read database becomes corrupt or stale, it can be regenerated from the event log.
Event logs can also trigger side effects such as sending promotional or notification emails.

Practical AWS-Based System Design Example

AWS Component	Role
API Gateway	Routes requests based on HTTP method to command or query services
Elastic Load Balancer (ELB)	Distributes requests among multiple horizontally scaled EC2 instances for command/query services
EC2 Instances (Command Handlers)	Execute commands, perform validation and authorization
Kafka (or AWS Kinesis)	Event/message broker for append-only event logs
SQS Queues	Handle asynchronous event processing and fan-out to services like email notifications
Lambda Functions	Process events to update read database and trigger other actions
DynamoDB (Read DB)	Stores denormalized data optimized for fast queries
ClickHouse or similar	Example write database storing append-only logs
CloudFront CDN	Caches GET requests for faster read performance with cache invalidation upon updates

The architecture enables horizontal scalability, fault tolerance, and efficient separation of concerns.
Read and write paths can be independently optimized with different database technologies (SQL for writes, NoSQL for reads).

Benefits and Trade-offs

Benefits:

Improved scalability by separating reads and writes.
Reduced contention and locking issues on databases.
Flexibility to use different databases optimized for different workloads.
Fault tolerance and recoverability via event sourcing.
Ability to implement complex business logic and authorization in command handlers.

Trade-offs:

When to Use CQRS

Suitable for complex, large-scale, distributed systems.
When read and write workloads have different performance, scaling, or consistency requirements.
When multiple microservices and databases are involved, requiring data segregation.
When eventual consistency is acceptable for the business domain.
Not recommended for small/simple applications or those needing immediate strong consistency.

Key Insights

CQRS is a powerful pattern for scaling complex applications by segregating commands and queries.
The use of different databases for read and write sides is central to the pattern.
Event sourcing complements CQRS by maintaining a reliable audit log and enabling system state reconstruction.
AWS ecosystem components like API Gateway, ELB, EC2, Lambda, DynamoDB, Kafka/Kinesis, and SQS can effectively implement CQRS with event sourcing.
Eventual consistency is a core characteristic and must be carefully evaluated against application needs.

DEV Community