Platform Engineering Workbook

This workbook provides platform engineering-specific exercises, templates, and assessment tools for the Judo Engineering Self-Paced Training Program. Each exercise uses real infrastructure scenarios that platform engineers face daily.

Level 1: Force vs Leverage Recognition

Exercise 1.1: Daily Decision Audit

Time: 5 minutes daily Instructions: At the end of each day, identify one infrastructure decision where you defaulted to a force approach.

Real Example - Kubernetes Resource Management:

Date: 2025-01-27
Problem: Pods are getting OOMKilled due to insufficient memory limits
Force Approach: Increase memory limits for all pods across all namespaces, then monitor for resource waste
Leverage Alternative: Analyze actual memory usage patterns, implement horizontal pod autoscaling, and set realistic limits based on data
Key Learning: Instead of fighting against resource constraints, work with the system's natural scaling patterns

Template:

Date: ___________
Problem: _________________________________
Force Approach: __________________________
Leverage Alternative: ____________________
Key Learning: ____________________________

Exercise 1.2: Leverage Question Practice

Time: 2 minutes before starting any infrastructure work Instructions: Before beginning any new work, ask these questions:

What’s the smallest change that creates the biggest impact?
How can I work with existing systems instead of against them?
What constraints can I use as creative catalysts?

Real Example - Service Mesh Implementation:

Before: "I need to implement a service mesh for microservices communication"
Leverage Questions:
- What's the smallest change? → Start with just ingress/egress traffic, not all inter-service communication
- Work with existing systems? → Use existing load balancer configuration as a foundation
- Use constraints as catalysts? → Limited budget forces me to choose the most critical services first

Exercise 1.3: Force vs Leverage Journal

Time: 10 minutes weekly Instructions: Document one example each week of both force and leverage approaches in your infrastructure work.

Real Example - Database Migration:

Week of: 2025-01-20

Force Example:
Problem: Need to migrate PostgreSQL database to a new version with zero downtime
Approach: Built complex custom migration scripts with multiple rollback scenarios, extensive testing
Outcome: Took 4 weeks to build and test, worked but was overly complex and fragile

Leverage Example:
Problem: Same database migration requirement
Approach: Used PostgreSQL's logical replication to create a read replica, then promoted it
Outcome: Completed in 3 days using the database's native capabilities

Key Insight: Instead of fighting against database limitations, I worked with its built-in replication features

Level 2: Constraint-Based Problem Solving

Exercise 2.1: Artificial Constraint Challenge

Time: 30-60 minutes per challenge Instructions: Take a real infrastructure problem and solve it with artificial constraints.

Real Example - Monitoring Implementation:

Problem: Implement comprehensive monitoring for a Kubernetes cluster
Original Approach: Deploy Prometheus, Grafana, Jaeger, and custom dashboards for all services
Artificial Constraints: Can only use existing monitoring tools, no new deployments, must work with current resource limits
Constrained Solution: 
- Extend existing Prometheus configuration to scrape new metrics
- Create Grafana dashboards using existing data sources
- Use Kubernetes native metrics (kube-state-metrics) instead of custom exporters
- Implement alerting rules that work with existing notification channels
Key Insights: The constraints forced me to work with existing systems, resulting in a more maintainable solution

Exercise 2.2: Constraint Reframing Exercise

Time: 5 minutes when encountering infrastructure constraints Instructions: When you hit a limitation, practice reframing it.

Real Example - Legacy Infrastructure Constraints:

Constraint: "This legacy system doesn't support modern container orchestration"
Reframing:
- What constraint? → Limited to traditional VM-based deployment
- How could this help? → Forces me to focus on application-level monitoring and logging
- What opportunities? → Creates a clean separation between infrastructure and application concerns

Exercise 2.3: Minimum Viable Solution

Time: 15-30 minutes per problem Instructions: Solve infrastructure problems with the absolute minimum resources needed.

Real Example - CI/CD Pipeline:

Problem: Need to implement CI/CD for a new microservice
Minimum Solution: 
- Use existing GitHub Actions workflow as template
- Deploy to existing Kubernetes cluster using existing Helm charts
- Use existing container registry and monitoring
Value Created: Automated deployment with minimal new infrastructure
Where to Add Complexity: 
- Custom deployment strategies (blue-green, canary)
- Advanced testing (integration, performance)
- Custom monitoring and alerting
Why: Basic CI/CD provides immediate value, advanced features can be added incrementally

Level 3: Momentum-Building Strategies

Exercise 3.1: Existing System Audit

Time: 20 minutes weekly Instructions: Identify what’s already working in your infrastructure and how you can build on it.

Real Example - Kubernetes Cluster Enhancement:

Existing System: Basic Kubernetes cluster with simple deployments
What's Working: 
- Stable cluster with good uptime
- Basic monitoring with Prometheus
- Simple deployment process
How We Can Build On It: 
- Add horizontal pod autoscaling (leverage existing metrics)
- Implement network policies (build on existing CNI)
- Add service mesh (extend existing service discovery)
Next Steps: 
- Week 1: Implement HPA using existing metrics
- Week 2: Add network policies for security
- Week 3: Evaluate service mesh options

Exercise 3.2: Momentum Mapping Exercise

Time: 15 minutes when starting new infrastructure projects Instructions: Map existing energy in your organization and identify how to channel it.

Real Example - Infrastructure as Code Migration:

Project Goal: Migrate from manual server provisioning to Terraform
Existing Energy Sources: 
- DevOps team already using Terraform for some services
- Management wants cost optimization (Terraform can help with resource management)
- Developers frustrated with manual deployment processes
How to Channel Energy: 
- Start with services the DevOps team already manages
- Frame as cost optimization initiative (management buy-in)
- Show developers how it reduces deployment friction
Potential Resistance: 
- "We don't have time to learn new tools"
- "Current process works fine"
Mitigation Strategy: 
- Start with non-critical services to prove value
- Provide training sessions during lunch hours
- Show concrete time savings from automation

Exercise 3.3: Incremental Improvement Practice

Time: 10 minutes daily Instructions: Identify one small improvement you can make today that will compound over time.

Real Example - Infrastructure Documentation:

Today's Small Improvement: Document one Terraform module with clear input/output descriptions
Expected Compound Effect: 
- Future team members understand infrastructure faster
- Reduces time spent debugging configuration issues
- Creates culture of self-documenting infrastructure
- Makes infrastructure changes safer and more predictable
How to Measure Progress: 
- Track number of documented modules
- Measure time to understand infrastructure during onboarding
- Count questions asked about infrastructure configuration

Level 4: Advanced Integration

Exercise 4.1: Judo Engineering Design Review

Time: 10 minutes before starting any new infrastructure work Instructions: Evaluate all new work through the four principles.

Real Example - Microservices Infrastructure Design:

Project: Design infrastructure for new microservices architecture
Leverage Approach: 
- Use existing Kubernetes cluster instead of building new infrastructure
- Extend current monitoring setup rather than implementing new tools
- Leverage existing CI/CD pipeline patterns
Constraint Strategy: 
- Limited to existing cloud provider and services
- Must work with current security and compliance requirements
- Can't modify existing services (only new ones)
Momentum Building: 
- Build on team's existing Kubernetes knowledge
- Use established deployment patterns
- Follow current monitoring and alerting standards
Precision Focus: 
- Focus on critical infrastructure components (networking, security, monitoring)
- Skip nice-to-have features that don't impact reliability
- Prioritize observability over complex orchestration

Exercise 4.2: Leverage Opportunity Hunt

Time: 5 minutes daily Instructions: Actively look for opportunities to apply judo engineering principles in your daily infrastructure work.

Real Example - Infrastructure Automation:

Opportunity: Manual infrastructure provisioning is error-prone and time-consuming
Judo Principle: Leverage over Force - Use existing tools and processes more effectively
Action Plan: 
- Create Terraform modules for common infrastructure patterns
- Use existing CI/CD pipeline to deploy infrastructure changes
- Implement infrastructure testing using existing testing frameworks
Expected Impact: 
- Faster infrastructure provisioning (less manual work)
- Better reliability (automated testing catches issues)
- More consistent infrastructure (standardized modules)

Exercise 4.3: Team Judo Moments

Time: 15 minutes weekly team meeting Instructions: Share examples of applying judo engineering principles in infrastructure work.

Real Example - Infrastructure Migration:

Team Member: Alex (Platform Engineer)
Judo Moment: Instead of rebuilding our entire monitoring stack, I extended the existing Prometheus setup to handle the new microservices by creating reusable Helm chart templates
Principle Applied: Momentum over Muscle - Built on existing systems instead of starting from scratch
Key Learning: The existing monitoring was actually well-designed; I just needed to understand how to extend it rather than replace it
Team Discussion: 
- "This approach saved us 3 weeks of work"
- "We should document this pattern for future microservices"
- "Let's create a template library for common monitoring patterns"

Platform Engineering Self-Assessment Tools

Infrastructure Leverage Ratio

Instructions: Track your infrastructure decisions for one week and calculate the ratio of leverage to force approaches.

Template:

Day 1: Force: ___ Leverage: ___
Day 2: Force: ___ Leverage: ___
Day 3: Force: ___ Leverage: ___
Day 4: Force: ___ Leverage: ___
Day 5: Force: ___ Leverage: ___

Total Force: ___ Total Leverage: ___
Ratio: ___% Leverage

Goal: Increase leverage ratio by 10% each week

Infrastructure Constraint Creativity Index

Instructions: Rate your creativity in finding solutions within infrastructure constraints (1-10 scale).

Template:

Week 1: ___/10
Week 2: ___/10
Week 3: ___/10
Week 4: ___/10

Average: ___/10
Trend: ________________

Infrastructure Momentum Building Score

Instructions: Rate how effectively you’re building on existing infrastructure and systems (1-10 scale).

Template:

Week 1: ___/10
Week 2: ___/10
Week 3: ___/10
Week 4: ___/10

Average: ___/10
Trend: ________________

Infrastructure Precision Focus Meter

Instructions: Track how much time you spend on high-impact vs low-impact infrastructure work.

Template:

High-Impact Work: ___ hours
Low-Impact Work: ___ hours
Ratio: ___% High-Impact

Goal: Increase high-impact ratio by 5% each week

Platform Engineering Implementation Checklist

Week 1-3: Foundation

Team commitment session completed
Infrastructure Judo Champion assigned
Daily Decision Audit started
Leverage Question Practice implemented
Force vs Leverage Journal established

Week 4-6: Constraints

Artificial Constraint Challenge completed
Constraint Reframing Exercise practiced
Minimum Viable Solution approach adopted
Infrastructure Constraint Creativity Index baseline established

Week 7-9: Momentum

Existing System Audit completed
Momentum Mapping Exercise practiced
Incremental Improvement Practice started
Infrastructure Momentum Building Score baseline established

Week 10-12: Integration

Judo Engineering Design Review implemented
Leverage Opportunity Hunt active
Team Judo Moments sharing established
All assessment tools showing improvement

Success Celebration Template

Platform Engineering Judo Achievement Certificate

This certifies that [Team Name] has successfully completed the Platform Engineering Judo Engineering Self-Paced Training Program, demonstrating mastery of:

Leverage over Force thinking in infrastructure
Constraint-based problem solving in platform engineering
Momentum-building strategies for infrastructure
Precision focus on high-impact platform work

Key Achievements:

Infrastructure Leverage Ratio: ___%
Infrastructure Constraint Creativity Index: ___/10
Infrastructure Momentum Building Score: ___/10
Infrastructure Precision Focus: ___% high-impact

Date Completed: ___________ Team Champion: ___________

Platform engineering is about working with existing infrastructure and systems. Focus on extending what works rather than replacing it.

Infrastructure constraints often exist for good reasons. Work with them to find creative solutions rather than fighting against them.

Small, strategic infrastructure changes often create disproportionate impact. Focus on the most critical components first.