Change Management with Blast Radius

Last Updated: January 7, 2026 Estimated Reading Time: 7 minutes

Overview

The Blast Radius feature helps you visualize the cascading impact of CI failures before implementing changes. This guide shows you how to integrate blast radius analysis into your change management process to reduce outages and make better-informed decisions.

What is Blast Radius?

Blast radius simulates what happens when a Configuration Item (CI) fails or becomes unavailable. It shows:

Direct Impact: CIs affected immediately (for example, applications that run on a server)
Cascading Impact: Downstream CIs affected through dependencies
Business Impact: Services and users impacted by the change
Risk Assessment: Severity level based on criticality and scope

Think of blast radius as "what breaks if this CI goes down?" It is essential for change planning, risk assessment, and incident response.

When to Use Blast Radius

Change Management Scenarios

Server Maintenance: Assess impact before patching or rebooting
Infrastructure Upgrades: Understand dependencies before hardware changes
Application Deployments: Identify affected services and users
Network Changes: Visualize connectivity impacts
Decommissioning: Verify no critical dependencies before retiring CIs

Risk Assessment Questions

Answer these questions with blast radius:

What services will be unavailable during maintenance?
Which users/teams will be affected?
Are there alternative CIs that can take over?
What is the worst-case scenario if the change fails?
Should this change require a CAB (Change Advisory Board) review?

How to Use Blast Radius Analysis

Step 1: Access Blast Radius

From CI Health Workspace:

Go to CI Health Workspace
Find the CI you want to analyze
Click on the CI card
Select "Blast Radius" tab in the details panel

From Dependency Map:

Go to Dependency Map
Locate the target CI on the graph
Right-click the CI node
Select "Show Blast Radius"

Direct Access:

Go to Features → Blast Radius
Search for the CI by name or ID
Click "Analyze"

Step 2: Configure Simulation Parameters

Impact Depth:

1 Level: Direct dependencies only (immediate impact)
2 Levels: Direct + one degree of separation (recommended)
3 Levels: Full cascading impact (comprehensive view)
Unlimited: All connected CIs (useful for critical systems)

Failure Scenario:

Planned Outage: Graceful shutdown (allows failover)
Unplanned Failure: Sudden loss (worst-case scenario)
Degraded Performance: Partial availability (50% capacity)

Time Window:

Immediate: Impact at time of change
Scheduled: Impact during maintenance window
Extended: Impact if recovery takes longer than expected

Start with 2 levels and "Planned Outage" for most change requests. Use "Unplanned Failure" + Unlimited depth for disaster recovery planning.

Step 3: Interpret Results

Visual Indicators:

Red nodes: Critically impacted CIs
Orange nodes: High impact CIs
Yellow nodes: Moderate impact CIs
Gray nodes: Low impact or redundant CIs

Impact Summary Panel:

Total CIs Affected: 47
├── Critical: 5 (production services)
├── High: 12 (applications)
├── Medium: 18 (supporting infrastructure)
└── Low: 12 (dev/test systems)

Business Services Impacted: 3
├── Online Banking (Critical)
├── Mobile App (High)
└── Internal Portal (Medium)

Estimated Users Affected: 15,000
Risk Level: HIGH

Dependency Chain View:

Shows path from target CI to affected services
Identifies single points of failure
Highlights redundancy gaps

Step 4: Assess Risk Level

Risk Matrix:

| Affected CIs | Critical Services | Risk Level | Recommendation | |--------------|-------------------|------------|----------------| | 0-10 | 0 | LOW | Standard change process | | 11-50 | 0-1 | MEDIUM | Manager approval required | | 51-100 | 2-5 | HIGH | CAB review required | | 100+ | 5+ | CRITICAL | Executive approval + rollback plan |

Additional Risk Factors:

Timing: Changes during business hours increase risk
Redundancy: No failover = higher risk
Testing: Untested changes = higher risk
Rollback: No rollback plan = higher risk

Step 5: Document Findings

Include in Change Request:

Blast Radius Screenshot: Visual representation of impact
Impact Summary: Number of CIs and services affected
Business Impact: Services unavailable and estimated users
Risk Assessment: Risk level with justification
Mitigation Plan: How you will reduce the impact
Rollback Plan: How to revert if issues arise

Example Documentation:

Change Request: Patch DB Server PROD-SQL-01
Blast Radius Analysis (2 levels, planned outage):
- Total Impact: 23 CIs
- Critical Services: 2 (Online Banking, Payment Processing)
- Estimated Users: 8,500
- Risk Level: HIGH

Mitigation:
- Schedule during maintenance window (Sat 2-6 AM)
- Enable database replication failover
- Pre-stage rollback scripts
- Notify affected teams 48 hours in advance

Rollback Plan:
- Revert to previous server snapshot (15 min)
- Restore database from backup if needed (2 hours)

Integration with Change Management Process

Pre-Change Assessment

Step 1: Initial Screening

Run blast radius analysis for proposed change
Determine risk level
Route to appropriate approval chain

Step 2: CAB Review (for high/critical risk)

Present blast radius visualization to CAB
Discuss mitigation strategies
Approve, defer, or reject based on risk

Step 3: Stakeholder Notification

Identify affected teams from blast radius
Send impact notifications with:
- Affected services
- Maintenance window
- Expected outage duration
- Contact for questions

During Change Implementation

Step 1: Pre-Implementation Validation

Re-run blast radius to catch any CMDB updates
Verify mitigation measures are in place
Confirm rollback plan is ready

Step 2: Monitoring

Watch for unexpected impacts beyond blast radius
Monitor health scores of affected CIs
Be prepared to execute rollback

Step 3: Post-Change Verification

Verify all CIs in blast radius are operational
Check for cascading issues not predicted
Update CMDB if new dependencies discovered

Post-Change Review

Accuracy Assessment:

Compare actual impact vs. predicted blast radius
Identify false positives (CIs not actually affected)
Identify false negatives (CIs affected but not predicted)

CMDB Improvement:

Add missing relationships discovered during change
Update CI criticality if impact was underestimated
Document lessons learned

If actual impact exceeds predicted blast radius by 20%+, your CMDB relationships are incomplete. Schedule a CMDB audit.

Advanced Blast Radius Techniques

Scenario Planning

Test Multiple Scenarios:

Best Case: Planned outage with successful failover
Expected Case: Planned outage with partial failover
Worst Case: Unplanned failure during peak hours

Compare results to understand risk range.

Comparative Analysis

Before Infrastructure Changes:

Run blast radius on current infrastructure
Model proposed new infrastructure in CSDM Workbench
Run blast radius on new design
Compare impact reduction

Example:

Current: Single web server
- Blast Radius: 12 critical services affected
- Risk: CRITICAL

Proposed: Load-balanced web server cluster
- Blast Radius: 0 critical services (automatic failover)
- Risk: LOW

Dependency Mapping

Identify Single Points of Failure:

Run blast radius on all critical CIs
Sort by "Total Impact" descending
CIs with highest impact are single points of failure
Prioritize redundancy investments

Incident Response Planning

Pre-Build Blast Radius Reports:

Generate blast radius for all critical infrastructure
Save reports for offline access
Include in incident runbooks
Update quarterly

During Major Incidents:

Quickly reference pre-built blast radius
Understand full scope of outage
Prioritize restoration order (critical services first)

Best Practices

Recommended Actions

Run Blast Radius Before Every Change: Make it a required step
Document Risk Level: Include in change request
Update CMDB First: Accurate data = accurate predictions
Compare Predicted vs. Actual: Improve CMDB accuracy over time
Share with Stakeholders: Use visuals to communicate risk

Actions to Avoid

Do Not Skip for "Small" Changes: Unexpected impacts happen.
Do Not Ignore Warning Signs: High impact needs more planning.
Do Not Proceed Without Mitigation: Reduce the risk before you make the change.
Do Not Forget Rollback Plans: Always have an exit strategy.
Do Not Trust Outdated Data: Sync the CMDB before analysis.

How to Measure Blast Radius Effectiveness

Key Metrics

Change Success Rate: % of changes without unplanned outages
Prediction Accuracy: Actual vs. predicted impact correlation
MTTR Improvement: Faster recovery with better planning
CAB Efficiency: Faster reviews with clear risk data
Incident Reduction: Fewer outages from better planning

Success Indicators

Change success rate > 95%
Blast radius prediction accuracy > 80%
High-risk changes always have mitigation plans
CAB approval time reduced by 30%+
Unplanned outages reduced by 50%+

Dependency Map - How to visualize dependencies
Health Dashboard - How to monitor CI health
CSDM Workbench - How to model infrastructure
Best Practices - CMDB hygiene

Need Help?

Contact [email protected] or use the AI Assistant to analyze blast radius for your specific changes.