Comprehensive systems design patterns for scalability, distributed systems, ML infrastructure, observability, and platform engineering. Language-agnostic conceptual patterns with practical guidance.
/plugin install systems-design@claude-code-plugins
Skill
Purpose
design-interview-methodology
4-step framework for system design interviews
estimation-techniques
Back-of-envelope calculations (QPS, storage, bandwidth)
quality-attributes-taxonomy
The "-ilities" framework for NFRs
Skill
Purpose
api-design-fundamentals
REST/GraphQL/gRPC selection, resource modeling, API patterns
api-security
OAuth 2.0, OIDC, JWT, API keys, rate limiting, security vulnerabilities
api-versioning
Versioning strategies, breaking changes, deprecation
rate-limiting-patterns
Token bucket, sliding window, distributed rate limiting
idempotency-patterns
Idempotency keys, at-most-once semantics, duplicate prevention
Skill
Purpose
data-architecture
Data lakes, lakehouses, warehouses, data mesh
data-modeling
Dimensional modeling, star schema, data vault, ER design
etl-elt-patterns
Data pipelines, ETL vs ELT, transformation patterns
stream-processing
Kafka, Flink, event-driven architectures
ml-system-design
End-to-end ML pipelines, feature stores, model serving
ml-inference-optimization
Model compression, distillation, edge deployment
llm-serving-patterns
vLLM, TGI, quantization, batching, streaming
rag-architecture
Chunking, embeddings, retrieval, context assembly
vector-databases
Vector storage, ANN algorithms, semantic search
Distributed Systems & Resilience
Skill
Purpose
resilience-patterns
Circuit breakers, retries, bulkheads, failure handling
distributed-tracing
OpenTelemetry, span context, trace correlation
chaos-engineering-fundamentals
Fault injection, experiment design, resilience testing
gameday-planning
GameDay exercises, failure scenarios, chaos drills
incident-response
Incident management, runbooks, on-call practices
Skill
Purpose
observability-patterns
Logs, metrics, traces integration
slo-sli-error-budget
SLO definition, SLI selection, error budget policies
instrumentation-planning
Instrumentation strategy, naming, cardinality management
Infrastructure & Platform
Skill
Purpose
cdn-architecture
CDN design, cache hierarchies, origin shielding
edge-computing
Edge functions, compute placement, Cloudflare Workers, Lambda@Edge
multi-region-deployment
Global distribution, active-active, failover patterns
internal-developer-platform
IDP design, Backstage, portal design, platform teams
golden-paths
Standardized workflows, paved roads, guardrails
self-service-infrastructure
IaC templates, automated provisioning, Terraform/Pulumi
latency-optimization
Latency budgets, geographic routing, protocol optimization
Skill
Purpose
mtls-service-mesh
Service-to-service security, Istio, Linkerd, certificate management
zero-trust-architecture
Microsegmentation, identity-based access, ZTNA
secrets-management
Vault, AWS Secrets Manager, secret rotation
Agent
Purpose
api-reviewer
Review API designs for best practices and consistency
capacity-planner
Back-of-envelope calculations and infrastructure sizing
chaos-engineer
Design chaos experiments and validate resilience patterns
data-architect
Design data pipelines, ETL/ELT, streaming architectures
edge-architect
CDN strategies, edge deployment, global latency optimization
llm-optimization-advisor
LLM serving latency and cost optimization
ml-interviewer
ML system design interview practice
ml-systems-designer
End-to-end ML systems, feature stores, training pipelines
observability-consultant
SLO definition, monitoring strategies, three pillars
platform-engineer
Internal Developer Platforms, self-service, golden paths
rag-architect
RAG system design, embeddings, retrieval optimization
security-reviewer
Architecture security review, zero trust assessment
senior-staff-interviewer
Staff+ level interview practice with rigorous pushback
User-Invocable Skills (11)
Skill
Purpose
/systems-design:api-review
Review API design for best practices
/systems-design:chaos-plan
Design chaos experiments and GameDay plans
/systems-design:data-flow
Design data flow and pipeline architecture
/systems-design:edge-strategy
Design edge and CDN architecture
/systems-design:explain
Explain a systems design concept
/systems-design:latency-budget
Calculate and allocate latency budgets
/systems-design:ml-pipeline
Design ML pipeline architecture
/systems-design:mock-interview
Interactive system design mock interview
/systems-design:rag-design
Design RAG system architecture
/systems-design:security-review
Review architecture for security gaps
/systems-design:slo-workshop
Interactive SLO definition workshop
Complete toolkit for system design interviews:
4-step framework : Requirements, high-level design, deep dive, wrap-up
Estimation techniques : QPS, storage, bandwidth calculations
Mock interviews : Practice with ML, data, and staff+ interviewers
Design production ML systems:
ML pipelines : Feature stores, training, serving infrastructure
LLM optimization : vLLM, quantization, batching strategies
RAG systems : Chunking, embeddings, retrieval, vector databases
Observability & Reliability
Build reliable, observable systems:
SLO workshops : Define SLIs, set targets, establish error budgets
Chaos engineering : Design experiments, plan GameDays
Incident response : Runbooks, on-call practices, postmortems
Design developer platforms:
Internal Developer Platforms : Backstage, portal design
Golden paths : Standardized workflows, guardrails
Self-service infrastructure : IaC templates, automated provisioning
MIT