Ticketflow

Playground event ticketing distributed system for browsing events, purchase tickets, and process payments to learn distributed systems concepts through hands-on experience.

Done with a build, break then fix approach i.e. steps on how requirements are built, what issues they solve and how it works starting with a monolith and gradually breaking it into distributed components progressively.

Prerequisites

Node.js (v18+ recommended)

node --version  # should show v18.x.x or higher

Docker & Docker Compose

docker --version         # should show Docker version 20.x.x or higher
docker-compose --version # should show docker-compose version 1.29.x or higher

npm or yarn

npm --version  # should show 8.x.x or higher

cURL or HTTPie (for API testing)
Artillery (for load testing - installed via npm)

How To Run

Running Monolith, Steps 1-2

# Terminal 1

# clone the repo
git clone https://github.com/oyinetare/ticketflow.git
cd ticketflow

cd monolith
npm install
docker-compose up -d
npm run dev

# Terminal 2 - test basic functionality

# get all events
curl http://localhost:3000/api/v2/events

# purchase a ticket
curl -X POST http://localhost:3000/api/v2/tickets/purchase \
  -H "Content-Type: application/json" \
  -d '{"eventId": "3", "userId": "test-user"}'

# stop docker
docker-compose down

Running Microservices, Steps 3-4

Requirements

Functional (What the system does)

Event management
- View all avaialable events (search events)
- View event details
- Create event
Ticket purchasing
- Purchase event tickets
- Reserve tickets (prevent others from buying same tickets)
- Process payment
- Confirm ticket purchase
- Handle failed payments gracefully
User Features
- View purchased tickets
- Get ticket confirmation
- View purchase history
Inventory Management
- Track available tickets per event
- Prevent overselling (no negative inventory)
- Release tickets if payment fails

Non-Functional (How well the system works)

Reliability: The system should prioritize availability i.e. 99.9% uptime (8.7 hours downtime/year) for searching & viewing events, but should prioritize consistency for booking events (no double booking, never sell same ticket twice, charge exactly once per ticket, no lost tickets or payments). Eventual consistency is OK (few seconds delay)
Fault tolerant: System stays up if one service fails
Scalability: The system should be scalable and able to handle high throughput in the form of popular events (10 million users, one event), Add more servers during peak times, Database Scaling(Handle millions of tickets/events), Queue Scaling(Process thousands of payments per minute)
Performance: The system should have low latency search (i.e. Response Time of Browse events < 200ms - 500ms) & purchasing < 3 seconds (including payment)
The system is read heavy, and thus needs to be able to support high read throughput (100:1), Handle 1000 concurrent users
High concurrency
Security: No storing credit cards, Rate limiting to prevent abuse for API, Data Privacy(Secure user information)

Steps - How it works/How to reproduce this project

1 — Monolith
2 — Scaling 1: Queues + Redis Locking + API improvements
3 — Scaling 2: Microservices
4 - Advanced patterns: Saga & Event Sourcing

1 - Monolith

features

REST API for events (create, list, get)
Ticket purchase endpoint (synchronous)
Mock payment processing
load testing

Architecture

Client → API → Services → SQLite

Tech Stack used here

TypeScript + Node.js + Express
SQLite for db (easy to start, single file DB)
Simple in-memory payment processing

Core Entities

Event
Ticket
Payment

export interface Event {
  id: string;
  name: string;
  venue: string;
  date: Date;
  totalTickets: number;
  availableTickets: number;
  price: number;
  createdAt: Date;
}

export interface Ticket {
  id: string;
  eventId: string;
  userId: string;
  purchaseDate: Date;
  price: number;
  status: "pending" | "confirmed" | "cancelled";
}

export interface Payment {
  id: string;
  ticketId: string;
  userId: string;
  amount: number;
  status: "pending" | "completed" | "failed";
  processedAt?: Date;
  createdAt: Date;
}

DB

SqLite In-Memory Database

TABLE events (
	id TEXT PRIMARY KEY,
	name TEXT NOT NULL,
	venue TEXT NOT NULL,
	date TEXT NOT NULL,
	totalTickets INTEGER NOT NULL,
	availableTickets INTEGER NOT NULL,
	price REAL NOT NULL,
	createdAt TEXT NOT NULL
)

TABLE tickets (
	id TEXT PRIMARY KEY,
	eventId TEXT NOT NULL,
	userId TEXT NOT NULL,
	purchaseDate TEXT NOT NULL,
	price REAL NOT NULL,
	status TEXT NOT NULL,
	FOREIGN KEY (eventId) REFERENCES events(id)
)

TABLE payments (
	id TEXT PRIMARY KEY,
	ticketId TEXT NOT NULL,
	userId TEXT NOT NULL,
	amount REAL NOT NULL,
	status TEXT NOT NULL,
	processedAt TEXT,
	createdAt TEXT NOT NULL,
	FOREIGN KEY (ticketId) REFERENCES tickets(id)
)

API or System Interface

GET  /api/events                # get all events
GET  /api/events/:id            # get single event by id
POST /api/events                # create event
POST /api/tickets/purchase      # purchase ticket
GET  /api/tickets/user/:userId  # get users tickets

Services

Event Service
Payment Service
Ticket Service

Flow: What part of the requirements does it solve

simple Express app
- Middleware
  - cors
  - express.json(), so that app can read JSON data sent from the client (like in POST or PUT requests) and make it available in req. body. Without it, Express cannot understand JSON data in requests ExpressJS express.json() Function
  - logging middleware
  - error handler
- init db
Event management: View all avaialable events (search events) + View event details + Create event
- routes call asynchronous functions in service i.e. await getAllEvents, await getEventById, await createEvent
- services return Promise<> running either SELECT FROM <table> or INSERT INTO <table>
Ticket purchasing: Purchase event tickets + Process payment + Confirm ticket
1. check event availability (eventService.getEventById) ISSUE: causes RACE CONDITION
2. create ticket (ticketService.createTicket) with status pending
3. process payment (paymentService.processPayment) ISSUE: SLOW & causes problems
- simulate api call to payment gateway (this will cause race conditions, multiple attempts to pay) & simulate occasional payment failures
- create payment with status completed
1. Update ticket status to confirmed or cancelled (ticketService.updateTicketStatus)
2. Update event tickets count (eventService.updateEventTickets)

Test endpoints

# get all events
curl http://localhost:3000/api/v2/events

# purchase a ticket
curl -X POST http://localhost:3000/api/v2/tickets/purchase \
  -H "Content-Type: application/json" \
  -d '{"eventId": "3", "userId": "test-user"}'

Notes

API

using nouns for resource names & plural nouns to name collection URIs., i.e. /events instead of /create-events. The verbal action on a URI is already implied by the HTTP GET, POST, PUT, PATCH, and DELETE methods
relationships kept simple and flexible
no API versioning for now
Interface types dont mirror the internal structure of db, so theres less risk of increasing the attack surface and might lead to data leakage in API and API is an abstraction of the database
Implement asynchronous methods in services

Why SQL In-Memory DB?

ACID Transactions are Critical

BEGIN TRANSACTION;
-- Check availability
SELECT availableTickets FROM events WHERE id = ? FOR UPDATE;
-- Create ticket
INSERT INTO tickets (eventId, userId, status) VALUES (?, ?, 'pending');
-- Update inventory
UPDATE events SET availableTickets = availableTickets - 1 WHERE id = ?;
-- Process payment
INSERT INTO payments (ticketId, amount, status) VALUES (?, ?, 'completed');
COMMIT;

Strong Consistency Requirements

// With SQL, this is guaranteed to be accurate
const event = await db.query(
  "SELECT COUNT(*) as soldTickets FROM tickets WHERE eventId = ? AND status = 'confirmed'",
  [eventId]
);

NoSQL often provides eventual consistency
For ticket inventory, you need immediate consistency
Can't risk overselling due to replication lag

Complex Relational Queries

-- Find all events a user has attended with payment details
SELECT e.name, e.date, t.purchaseDate, p.amount, p.status
FROM tickets t
JOIN events e ON t.eventId = e.id
JOIN payments p ON p.ticketId = t.id
WHERE t.userId = ?
ORDER BY e.date DESC;

Events → Tickets → Payments → Users all interconnected
NoSQL would require multiple queries or denormalization

Financial Data Integrity

-- Ensure payment reconciliation
SELECT
  SUM(p.amount) as totalRevenue,
  COUNT(DISTINCT t.id) as ticketsSold
FROM payments p
JOIN tickets t ON p.ticketId = t.id
WHERE p.status = 'completed' AND t.eventId = ?;

Payment records must be 100% accurate
SQL constraints prevent orphaned records
Foreign keys ensure referential integrity

Using NoSQL db i.e. The Cost of Getting It Wrong in this situation

Overselling = Angry customers, refunds, reputation damage
Payment inconsistencies = Financial liability, audit failures
Lost transactions = Revenue loss, customer frustration
SQL's strong guarantees make these disasters much less likely.

For the core ticketing logic (inventory, payments, orders), SQL is the clear winner because:

ACID transactions prevent race conditions
Strong consistency ensures accurate inventory
Relational model matches the domain perfectly
Financial data requires bulletproof integrity
Sqlite
- Sqlite was not designed for Client/Server Applications: If there are many client programs sending SQL to the same database over a network, then use a client/server database engine instead of SQLite. SQLite will work over a network filesystem, but because of the latency associated with most network filesystems, performance will not be great. Also, file locking logic is buggy in many network filesystem implementations (on both Unix and Windows). If file locking does not work correctly, two or more clients might try to modify the same part of the same database at the same time, resulting in corruption. Because this problem results from bugs in the underlying filesystem implementation, there is nothing SQLite can do to prevent it.
- High Concurrency: SQLite supports an unlimited number of simultaneous readers, but it will only allow one writer at any instant in time. For many situations, this is not a problem. Writers queue up. Each application does its database work quickly and moves on, and no lock lasts for more than a few dozen milliseconds. But there are some applications that require more concurrency, and those applications may need to seek a different solution.

Issues in design i.e. what problems are introduced, leading up to next section

Race condition happens in process of checking for available tickets and actually decrementing them as slow payment processing creates large time window for concurrent requests

test-race-condition.js
Test race condition
	get initial ticket count for event
	create array of 10 concurrent purchase attempts
		get event and check availability
		if not exists return 404, no event
		if no available tickets return 400, no tickets left
		if event exists and available events
			try 
				create ticket with status pending
				process payment
					simulate api call to payment gateway (this will cause race conditions, multiple attempts to pay) & simulate occasional payment failures
					create payment with status completed
				update ticket status to confirmed
				update inventory by decrement available tickets for event
			 catch paymentError
				update ticket status to cancelled and throw paymentError
	wait for them all to complete
	get success count 
	get final ticket count for event
	there’s a race condition if success count isn’t same as (initial - final)

—————————————————————

The Critical Race Condition Path
Thread 1: Checks availableTickets = 5 ✓
Thread 2: Checks availableTickets = 5 ✓  // Same value!
Thread 3: Checks availableTickets = 5 ✓  // Still same!
...
Thread 1: Creates ticket (pending)
Thread 2: Creates ticket (pending)
Thread 1: Processes payment (1-3 seconds)
Thread 2: Processes payment (1-3 seconds)
Thread 1: Updates availableTickets to 4
Thread 2: Updates availableTickets to 4  // Should be 3!

Initial count: Get starting availableTickets
Concurrent attempts: 10 parallel purchases
Success tracking: Count successful purchases
Validation: successCount ≠ (initial - final) proves overselling

Lost tickets when payments fail
Multiple users can buy the same ticket, double booking
No Transaction Boundaries as each operation is independent, allowing partial failures
Phantom Tickets because tickets can be created even when inventory is exhausted
Database locks cause timeouts
Inconsistent inventory counts
Using SQLite in memory DB whcih doesnt scale well in distributed systems
- different from client/server SQL database engines such as MySQL, Oracle, PostgreSQL, or SQL Server since SQLite is trying to solve a different problem
- Client/server SQL database engines strive to implement a shared repository of enterprise data and emphasize scalability, concurrency, centralization, and control whereas SQLite strives to provide local data storage for individual applications and devices. SQLite emphasizes economy, efficiency, reliability, independence, and simplicity.
- SQLite does not compete with client/server databases. SQLite competes with fopen().
- SQLite database requires no administration, it works well in devices that must operate without expert human support
- Client/server database engines are designed to live inside a lovingly-attended datacenter at the core of the network. SQLite works there too, but SQLite also thrives at the edge of the network, fending for itself while providing fast and reliable data services to applications that would otherwise have dodgy connectivity
- SQLite is a good fit for use in "internet of things" devices.
- Generally speaking, any site that gets fewer than 100K hits/day should work fine with SQLite
- Reference: https://sqlite.org/whentouse.html

2 - Scaling 1: Queues + Redis Locking + API improvements

features

better error handling
Implement versioning in API: imporvements to API
Redis for distributed locking
Bull queue for async payment processing
Idempotency keys for payments: imporvements to API
query result caching to speed up frequently repeated search queries and reduce load on our search infrastructure?: imporvements to API
Implement data pagination and filtering: imporvements to API
HATEOS link: imporvements to API

Architecture

Client → API → [Redis Lock] → Services → Queues → SQLite

Issues in previous - How are you addreessing each issue**

Race conditions - Multiple users checking/buying simultaneously
Inconsistent inventory counts - Stale reads lead to wrong counts
Slow payments - 1-3 second payment processing blocks everything
Phantom Tickets - Tickets created even when inventory exhausted
Lost tickets when payments fail - No proper rollback
Double booking - Same ticket sold multiple times
Duplicate charges
Database locks/timeouts

Ideas

How do we improve the booking experience by reserving tickets?: Dealing w contention (Race condition)
1. pessimistic locking
2. status & expiration time with cron
3. mpicit status with tatus and time expiration
4. distirbuted lock with TTL
- Redis for distributed locking with TTL to prevent race conditions
How to prefent duplicate charges
1. Idempotent payment handling: Use Idempotency keys for payments
2. Proper rollback on failures
How can you speed up frequently repeated search queries and reduce load on our search infrastructure?
1. implement caching trategies using redis and memcached
2. impoklement query result caching and edge caching techniques

Decided Solutions, i.e. imoprovements to design based on issues above**

Distributed Locking (Redis with TTL) Solves:

Race conditions
Double booking
Inconsistent inventory counts
Phantom Tickets

Async Payment Queue (Bull/Redis) Solves:

Slow payments
Database locks/timeouts
Partially helps with scalability

Idempotency for Payments Solves:

Duplicate charges
Partial failure recovery

Redis cache

speed up frequently repeated search queries and reduce load on our search infrastructure?

Notes

How idempotency works

Idempotent payment handling: Use Idempotency keys for payments
- idempotency
  - handler
  - if not POST, then next
  - get idempotency key from redis cache or generate if one doesnt exist
    - Generate key based on user, endpoint, and request body
  - check for cached response with idempotency key
  - store original json method using redis cache
  - ovveride json method to cache response
  - attach key to request for use in handlers
Proper rollback on failures, Partial failure recovery

Issues in design i.e. what problems are introduced, leading up to next section

Problem appears: Single database bottleneck

Move SQLite down here

Single Database Bottleneck (SQLite)

🔴 Still can't handle true horizontal scaling 🔴 Write operations still bottlenecked 🔴 No replication/failover

Additional Issues in Phase 2: markdownNew Problems in Phase 2:
Distributed System Complexity
- Lock failures/deadlocks
- Redis single point of failure
- Queue processing delays
Consistency Challenges
- Cache invalidation complexity
- Eventual consistency between Redis/SQLite
- Lock orphaning if process crashes
Operational Overhead
- Need to monitor Redis, queues, cron jobs
- Debugging distributed transactions
- Managing TTLs and expirations
Still Limited by SQLite
- No read replicas
- No sharding capability
- File-based = single server
Real time updates: How will the system ensure a good user experience during high-demand events with millions simultaneously booking tickets? + Reduceunnecessary API call + Fair access to tickets

SSE for real time seat updates
Virtual waiting queue for xtrmemly popular events

How can you improve search to ensure we meet our low latency requirements?

indexinfg & sql qurery ptimization
full-text idnexes in db
se a full text search engine liem elastic db

Scaling reads: How is the view API going to scale to support 10s of millions of concurrent requests during popular events?

cachign, lb, horizontal scaling

Ticket Reservation System (Status + Expiration) Solves:

✅ Lost tickets when payments fail ✅ Phantom Tickets ✅ Better UX during high contention

No Transaction Boundaries - Operations aren't atomic
Database locks cause timeouts - SQLite locks entire DB
No scalability - SQLite single-file limitation

3 - Scaling 2: Microservices: Scales to 50,000 users/day

move to PostgreSQL
Separate services can scale independently
Payment service gets more servers
New capabilities: Multi-region, analytics, real-time updates

The lockfile is important because it:

Ensures consistent installs across different machines Locks specific versions of all dependencies and sub-dependencies Speeds up CI/CD builds Helps prevent security vulnerabilities by pinning versions

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
monolith		monolith
services		services
shared		shared
.dockerignore		.dockerignore
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

oyinetare/ticketflow

Folders and files

Latest commit

History

Repository files navigation

Ticketflow

Table of Contents

Prerequisites

How To Run

Running Monolith, Steps 1-2

Running Microservices, Steps 3-4

Requirements

Functional (What the system does)

Non-Functional (How well the system works)

Steps - How it works/How to reproduce this project

1 - Monolith

features

Architecture

Tech Stack used here

Core Entities

DB

API or System Interface

Services

Flow: What part of the requirements does it solve

Test endpoints

Notes

API

Why SQL In-Memory DB?

Issues in design i.e. what problems are introduced, leading up to next section

2 - Scaling 1: Queues + Redis Locking + API improvements

features

Architecture

Issues in previous - How are you addreessing each issue**

Decided Solutions, i.e. imoprovements to design based on issues above**

Notes

Issues in design i.e. what problems are introduced, leading up to next section

3 - Scaling 2: Microservices: Scales to 50,000 users/day

4 - Advanced patterns: Saga & Event Sourcing

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Languages