Platform Engineering Workbook
This workbook provides platform engineering-specific exercises, templates, and assessment tools for the Judo Engineering Self-Paced Training Program. Each exercise uses real infrastructure scenarios that platform engineers face daily.
Level 1: Force vs Leverage Recognition
Exercise 1.1: Daily Decision Audit
Time: 5 minutes daily Instructions: At the end of each day, identify one infrastructure decision where you defaulted to a force approach.
Real Example - Kubernetes Resource Management:
Date: 2025-01-27
Problem: Pods are getting OOMKilled due to insufficient memory limits
Force Approach: Increase memory limits for all pods across all namespaces, then monitor for resource waste
Leverage Alternative: Analyze actual memory usage patterns, implement horizontal pod autoscaling, and set realistic limits based on data
Key Learning: Instead of fighting against resource constraints, work with the system's natural scaling patternsTemplate:
Date: ___________
Problem: _________________________________
Force Approach: __________________________
Leverage Alternative: ____________________
Key Learning: ____________________________Exercise 1.2: Leverage Question Practice
Time: 2 minutes before starting any infrastructure work Instructions: Before beginning any new work, ask these questions:
- What’s the smallest change that creates the biggest impact?
- How can I work with existing systems instead of against them?
- What constraints can I use as creative catalysts?
Real Example - Service Mesh Implementation:
Before: "I need to implement a service mesh for microservices communication"
Leverage Questions:
- What's the smallest change? → Start with just ingress/egress traffic, not all inter-service communication
- Work with existing systems? → Use existing load balancer configuration as a foundation
- Use constraints as catalysts? → Limited budget forces me to choose the most critical services firstExercise 1.3: Force vs Leverage Journal
Time: 10 minutes weekly Instructions: Document one example each week of both force and leverage approaches in your infrastructure work.
Real Example - Database Migration:
Week of: 2025-01-20
Force Example:
Problem: Need to migrate PostgreSQL database to a new version with zero downtime
Approach: Built complex custom migration scripts with multiple rollback scenarios, extensive testing
Outcome: Took 4 weeks to build and test, worked but was overly complex and fragile
Leverage Example:
Problem: Same database migration requirement
Approach: Used PostgreSQL's logical replication to create a read replica, then promoted it
Outcome: Completed in 3 days using the database's native capabilities
Key Insight: Instead of fighting against database limitations, I worked with its built-in replication featuresLevel 2: Constraint-Based Problem Solving
Exercise 2.1: Artificial Constraint Challenge
Time: 30-60 minutes per challenge Instructions: Take a real infrastructure problem and solve it with artificial constraints.
Real Example - Monitoring Implementation:
Problem: Implement comprehensive monitoring for a Kubernetes cluster
Original Approach: Deploy Prometheus, Grafana, Jaeger, and custom dashboards for all services
Artificial Constraints: Can only use existing monitoring tools, no new deployments, must work with current resource limits
Constrained Solution:
- Extend existing Prometheus configuration to scrape new metrics
- Create Grafana dashboards using existing data sources
- Use Kubernetes native metrics (kube-state-metrics) instead of custom exporters
- Implement alerting rules that work with existing notification channels
Key Insights: The constraints forced me to work with existing systems, resulting in a more maintainable solutionExercise 2.2: Constraint Reframing Exercise
Time: 5 minutes when encountering infrastructure constraints Instructions: When you hit a limitation, practice reframing it.
Real Example - Legacy Infrastructure Constraints:
Constraint: "This legacy system doesn't support modern container orchestration"
Reframing:
- What constraint? → Limited to traditional VM-based deployment
- How could this help? → Forces me to focus on application-level monitoring and logging
- What opportunities? → Creates a clean separation between infrastructure and application concernsExercise 2.3: Minimum Viable Solution
Time: 15-30 minutes per problem Instructions: Solve infrastructure problems with the absolute minimum resources needed.
Real Example - CI/CD Pipeline:
Problem: Need to implement CI/CD for a new microservice
Minimum Solution:
- Use existing GitHub Actions workflow as template
- Deploy to existing Kubernetes cluster using existing Helm charts
- Use existing container registry and monitoring
Value Created: Automated deployment with minimal new infrastructure
Where to Add Complexity:
- Custom deployment strategies (blue-green, canary)
- Advanced testing (integration, performance)
- Custom monitoring and alerting
Why: Basic CI/CD provides immediate value, advanced features can be added incrementallyLevel 3: Momentum-Building Strategies
Exercise 3.1: Existing System Audit
Time: 20 minutes weekly Instructions: Identify what’s already working in your infrastructure and how you can build on it.
Real Example - Kubernetes Cluster Enhancement:
Existing System: Basic Kubernetes cluster with simple deployments
What's Working:
- Stable cluster with good uptime
- Basic monitoring with Prometheus
- Simple deployment process
How We Can Build On It:
- Add horizontal pod autoscaling (leverage existing metrics)
- Implement network policies (build on existing CNI)
- Add service mesh (extend existing service discovery)
Next Steps:
- Week 1: Implement HPA using existing metrics
- Week 2: Add network policies for security
- Week 3: Evaluate service mesh optionsExercise 3.2: Momentum Mapping Exercise
Time: 15 minutes when starting new infrastructure projects Instructions: Map existing energy in your organization and identify how to channel it.
Real Example - Infrastructure as Code Migration:
Project Goal: Migrate from manual server provisioning to Terraform
Existing Energy Sources:
- DevOps team already using Terraform for some services
- Management wants cost optimization (Terraform can help with resource management)
- Developers frustrated with manual deployment processes
How to Channel Energy:
- Start with services the DevOps team already manages
- Frame as cost optimization initiative (management buy-in)
- Show developers how it reduces deployment friction
Potential Resistance:
- "We don't have time to learn new tools"
- "Current process works fine"
Mitigation Strategy:
- Start with non-critical services to prove value
- Provide training sessions during lunch hours
- Show concrete time savings from automationExercise 3.3: Incremental Improvement Practice
Time: 10 minutes daily Instructions: Identify one small improvement you can make today that will compound over time.
Real Example - Infrastructure Documentation:
Today's Small Improvement: Document one Terraform module with clear input/output descriptions
Expected Compound Effect:
- Future team members understand infrastructure faster
- Reduces time spent debugging configuration issues
- Creates culture of self-documenting infrastructure
- Makes infrastructure changes safer and more predictable
How to Measure Progress:
- Track number of documented modules
- Measure time to understand infrastructure during onboarding
- Count questions asked about infrastructure configurationLevel 4: Advanced Integration
Exercise 4.1: Judo Engineering Design Review
Time: 10 minutes before starting any new infrastructure work Instructions: Evaluate all new work through the four principles.
Real Example - Microservices Infrastructure Design:
Project: Design infrastructure for new microservices architecture
Leverage Approach:
- Use existing Kubernetes cluster instead of building new infrastructure
- Extend current monitoring setup rather than implementing new tools
- Leverage existing CI/CD pipeline patterns
Constraint Strategy:
- Limited to existing cloud provider and services
- Must work with current security and compliance requirements
- Can't modify existing services (only new ones)
Momentum Building:
- Build on team's existing Kubernetes knowledge
- Use established deployment patterns
- Follow current monitoring and alerting standards
Precision Focus:
- Focus on critical infrastructure components (networking, security, monitoring)
- Skip nice-to-have features that don't impact reliability
- Prioritize observability over complex orchestrationExercise 4.2: Leverage Opportunity Hunt
Time: 5 minutes daily Instructions: Actively look for opportunities to apply judo engineering principles in your daily infrastructure work.
Real Example - Infrastructure Automation:
Opportunity: Manual infrastructure provisioning is error-prone and time-consuming
Judo Principle: Leverage over Force - Use existing tools and processes more effectively
Action Plan:
- Create Terraform modules for common infrastructure patterns
- Use existing CI/CD pipeline to deploy infrastructure changes
- Implement infrastructure testing using existing testing frameworks
Expected Impact:
- Faster infrastructure provisioning (less manual work)
- Better reliability (automated testing catches issues)
- More consistent infrastructure (standardized modules)Exercise 4.3: Team Judo Moments
Time: 15 minutes weekly team meeting Instructions: Share examples of applying judo engineering principles in infrastructure work.
Real Example - Infrastructure Migration:
Team Member: Alex (Platform Engineer)
Judo Moment: Instead of rebuilding our entire monitoring stack, I extended the existing Prometheus setup to handle the new microservices by creating reusable Helm chart templates
Principle Applied: Momentum over Muscle - Built on existing systems instead of starting from scratch
Key Learning: The existing monitoring was actually well-designed; I just needed to understand how to extend it rather than replace it
Team Discussion:
- "This approach saved us 3 weeks of work"
- "We should document this pattern for future microservices"
- "Let's create a template library for common monitoring patterns"Platform Engineering Self-Assessment Tools
Infrastructure Leverage Ratio
Instructions: Track your infrastructure decisions for one week and calculate the ratio of leverage to force approaches.
Template:
Day 1: Force: ___ Leverage: ___
Day 2: Force: ___ Leverage: ___
Day 3: Force: ___ Leverage: ___
Day 4: Force: ___ Leverage: ___
Day 5: Force: ___ Leverage: ___
Total Force: ___ Total Leverage: ___
Ratio: ___% Leverage
Goal: Increase leverage ratio by 10% each weekInfrastructure Constraint Creativity Index
Instructions: Rate your creativity in finding solutions within infrastructure constraints (1-10 scale).
Template:
Week 1: ___/10
Week 2: ___/10
Week 3: ___/10
Week 4: ___/10
Average: ___/10
Trend: ________________Infrastructure Momentum Building Score
Instructions: Rate how effectively you’re building on existing infrastructure and systems (1-10 scale).
Template:
Week 1: ___/10
Week 2: ___/10
Week 3: ___/10
Week 4: ___/10
Average: ___/10
Trend: ________________Infrastructure Precision Focus Meter
Instructions: Track how much time you spend on high-impact vs low-impact infrastructure work.
Template:
High-Impact Work: ___ hours
Low-Impact Work: ___ hours
Ratio: ___% High-Impact
Goal: Increase high-impact ratio by 5% each weekPlatform Engineering Implementation Checklist
Week 1-3: Foundation
- Team commitment session completed
- Infrastructure Judo Champion assigned
- Daily Decision Audit started
- Leverage Question Practice implemented
- Force vs Leverage Journal established
Week 4-6: Constraints
- Artificial Constraint Challenge completed
- Constraint Reframing Exercise practiced
- Minimum Viable Solution approach adopted
- Infrastructure Constraint Creativity Index baseline established
Week 7-9: Momentum
- Existing System Audit completed
- Momentum Mapping Exercise practiced
- Incremental Improvement Practice started
- Infrastructure Momentum Building Score baseline established
Week 10-12: Integration
- Judo Engineering Design Review implemented
- Leverage Opportunity Hunt active
- Team Judo Moments sharing established
- All assessment tools showing improvement
Success Celebration Template
Platform Engineering Judo Achievement Certificate
This certifies that [Team Name] has successfully completed the Platform Engineering Judo Engineering Self-Paced Training Program, demonstrating mastery of:
- Leverage over Force thinking in infrastructure
- Constraint-based problem solving in platform engineering
- Momentum-building strategies for infrastructure
- Precision focus on high-impact platform work
Key Achievements:
- Infrastructure Leverage Ratio: ___%
- Infrastructure Constraint Creativity Index: ___/10
- Infrastructure Momentum Building Score: ___/10
- Infrastructure Precision Focus: ___% high-impact
Date Completed: ___________ Team Champion: ___________