Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

README.md

Systems Design Plugin

Comprehensive systems design patterns for scalability, distributed systems, ML infrastructure, observability, and platform engineering. Language-agnostic conceptual patterns with practical guidance.

Installation

/plugin install systems-design@claude-code-plugins

Skills (35)

Interview Preparation

Skill Purpose
design-interview-methodology 4-step framework for system design interviews
estimation-techniques Back-of-envelope calculations (QPS, storage, bandwidth)
quality-attributes-taxonomy The "-ilities" framework for NFRs

API Design

Skill Purpose
api-design-fundamentals REST/GraphQL/gRPC selection, resource modeling, API patterns
api-security OAuth 2.0, OIDC, JWT, API keys, rate limiting, security vulnerabilities
api-versioning Versioning strategies, breaking changes, deprecation
rate-limiting-patterns Token bucket, sliding window, distributed rate limiting
idempotency-patterns Idempotency keys, at-most-once semantics, duplicate prevention

Data & ML

Skill Purpose
data-architecture Data lakes, lakehouses, warehouses, data mesh
data-modeling Dimensional modeling, star schema, data vault, ER design
etl-elt-patterns Data pipelines, ETL vs ELT, transformation patterns
stream-processing Kafka, Flink, event-driven architectures
ml-system-design End-to-end ML pipelines, feature stores, model serving
ml-inference-optimization Model compression, distillation, edge deployment
llm-serving-patterns vLLM, TGI, quantization, batching, streaming
rag-architecture Chunking, embeddings, retrieval, context assembly
vector-databases Vector storage, ANN algorithms, semantic search

Distributed Systems & Resilience

Skill Purpose
resilience-patterns Circuit breakers, retries, bulkheads, failure handling
distributed-tracing OpenTelemetry, span context, trace correlation
chaos-engineering-fundamentals Fault injection, experiment design, resilience testing
gameday-planning GameDay exercises, failure scenarios, chaos drills
incident-response Incident management, runbooks, on-call practices

Observability

Skill Purpose
observability-patterns Logs, metrics, traces integration
slo-sli-error-budget SLO definition, SLI selection, error budget policies
instrumentation-planning Instrumentation strategy, naming, cardinality management

Infrastructure & Platform

Skill Purpose
cdn-architecture CDN design, cache hierarchies, origin shielding
edge-computing Edge functions, compute placement, Cloudflare Workers, Lambda@Edge
multi-region-deployment Global distribution, active-active, failover patterns
internal-developer-platform IDP design, Backstage, portal design, platform teams
golden-paths Standardized workflows, paved roads, guardrails
self-service-infrastructure IaC templates, automated provisioning, Terraform/Pulumi
latency-optimization Latency budgets, geographic routing, protocol optimization

Security

Skill Purpose
mtls-service-mesh Service-to-service security, Istio, Linkerd, certificate management
zero-trust-architecture Microsegmentation, identity-based access, ZTNA
secrets-management Vault, AWS Secrets Manager, secret rotation

Agents (13)

Agent Purpose
api-reviewer Review API designs for best practices and consistency
capacity-planner Back-of-envelope calculations and infrastructure sizing
chaos-engineer Design chaos experiments and validate resilience patterns
data-architect Design data pipelines, ETL/ELT, streaming architectures
edge-architect CDN strategies, edge deployment, global latency optimization
llm-optimization-advisor LLM serving latency and cost optimization
ml-interviewer ML system design interview practice
ml-systems-designer End-to-end ML systems, feature stores, training pipelines
observability-consultant SLO definition, monitoring strategies, three pillars
platform-engineer Internal Developer Platforms, self-service, golden paths
rag-architect RAG system design, embeddings, retrieval optimization
security-reviewer Architecture security review, zero trust assessment
senior-staff-interviewer Staff+ level interview practice with rigorous pushback

User-Invocable Skills (11)

Skill Purpose
/systems-design:api-review Review API design for best practices
/systems-design:chaos-plan Design chaos experiments and GameDay plans
/systems-design:data-flow Design data flow and pipeline architecture
/systems-design:edge-strategy Design edge and CDN architecture
/systems-design:explain Explain a systems design concept
/systems-design:latency-budget Calculate and allocate latency budgets
/systems-design:ml-pipeline Design ML pipeline architecture
/systems-design:mock-interview Interactive system design mock interview
/systems-design:rag-design Design RAG system architecture
/systems-design:security-review Review architecture for security gaps
/systems-design:slo-workshop Interactive SLO definition workshop

Features

Interview Preparation

Complete toolkit for system design interviews:

  • 4-step framework: Requirements, high-level design, deep dive, wrap-up
  • Estimation techniques: QPS, storage, bandwidth calculations
  • Mock interviews: Practice with ML, data, and staff+ interviewers

ML & AI Systems

Design production ML systems:

  • ML pipelines: Feature stores, training, serving infrastructure
  • LLM optimization: vLLM, quantization, batching strategies
  • RAG systems: Chunking, embeddings, retrieval, vector databases

Observability & Reliability

Build reliable, observable systems:

  • SLO workshops: Define SLIs, set targets, establish error budgets
  • Chaos engineering: Design experiments, plan GameDays
  • Incident response: Runbooks, on-call practices, postmortems

Platform Engineering

Design developer platforms:

  • Internal Developer Platforms: Backstage, portal design
  • Golden paths: Standardized workflows, guardrails
  • Self-service infrastructure: IaC templates, automated provisioning

License

MIT