Navigating Change & Uncertainty in Network Administration: A Guide to IPAM, DDI, and the Art of Transformation

There is a tension every network administrator knows well: when the network works, it’s invisible. When it fails, it’s everybody’s emergency.

Networks are never static. Devices appear without tickets, subnets get carved informally, DNS records accumulate, and IPAM databases drift from reality the moment they’re created. The conditions that make change necessary also make change dangerous.

This guide covers four phases of change management: auditing the real environment; building a structured change process; executing under real-world constraints; and using device sequestration as a controlled change management tool, with particular focus on IPAM and DDI.

Auditing What You Have

Start With an Inventory

Every organization believes its documentation is better than it is. The most dangerous version is not obviously missing documentation. It is wrong documentation that you trust. Treat your IPAM database as an hypothesis, not the truth itself. Then test it against reality.

The Audit Toolkit

Network Discovery: Nmap remains the minimum standard first-step in active host discovery, but good discovery involves layers of detail and the ability to tie those details together. A capable IPAM (or even better, DDI) stores metadata accurately, where smaller issues, such as devices having multiple IP addresses, don’t break scanning systems.

However, while Nmap is effective for legacy subnets, active scanning is not workable for IPv6. In dual-stack or native IPv6 environments, you must pull discovery data from router NDP cache tables, IPv6 multicast listener discovery (MLD), and rely heavily on the IPAM as the definitive source of truth rather than relying on sweeps. In sum, a layered approach creates the best representation of reality.

IPAM Platforms: These platforms enable running reconciliations on a schedule and acting on the results.

DDI Auditing: Cross-reference DNS A and PTR records, DHCP lease and reservation data, and active host discovery. Key items to find:

DNS A and AAAA records with no corresponding DHCP reservation and no live host responding
PTR records that do not match forward A records (common after incomplete migrations)
DHCP reservations for MAC addresses with no lease in over 90 days
DHCP scopes above 80% utilization that are not flagged for expansion

SNMP and NetFlow: SNMP neighbor tables on core switches are usually the most complete map of what’s connected. NetFlow identifies communication patterns your documented topology doesn’t know about or report on. Syslog from DHCP and DNS servers capture devices that only appear overnight and are missed by daytime scans and should also be tied to an IPAM as the service that kicks everything off.

Interpreting the Gaps

Orphaned IPs and zombie DNS records are indicators of process failure. The missing decommissioning checklist is the disease, the records are symptoms. Overlapping subnets may represent potential routing problems, but not always – in some cases, the subnets will overlap because teams can’t renumber. DHCP lease exhaustion patterns are predictive so track trends proactively rather than responding to emergency ‘no available address’ events.

Watch for ‘shadow IPAM’: when the official IPAM is slow or bureaucratic to update. This can happen when teams maintain their own spreadsheets or disparate IPAMs. These shadow records may be more current for assets they cover but are invisible to everyone else. Obviously, this creates blind spots.

Be aware that as networks modernize, traditional DHCP use metrics lose visibility. You should also audit DHCPv6 Prefix Delegation (DHCPv6-PD) exhaustion and ensure your IPAM can track devices self-assigning addresses via SLAAC (Stateless Address Autoconfiguration). A healthy IPv4 scope might hide a completely exhausted IPv6 prefix pool.

Documenting What You Find

Structure your gap register around four confidence categories:

Authoritative: IPAM, DNS, DHCP, and discovery all agree; ownership is known
Observed: discovered on the network but not fully documented
Suspected: in IPAM but not responding; may be decommissioned or intermittently active. Additionally, in DDI if the object appears in IPAM and DHCP but not DNS.
Unknown: present on the network with no documentation at all.

Build an ownership field into your IPAM for every subnet and every static assignment. No owner means no support. Let the team know no support means you have standing to sequester or reclaim the resource if needed.

Defining the Timeline and Process

Sizing the Scope

Categorize every change by scope of change:

Cosmetic; renaming IPAM objects, updating documentation. This allows an expedited review path.
Structural: re-IPing a subnet, migrating DHCP scopes, reorganizing DNS zones. This requires careful planning and staged execution.
Systemic: replacing the IPAM platform, restructuring DNS namespace, implementing DNSSEC. Serious change management that requires architectural planning, extended parallel-run work, and executive agreement.

For each planned change, define the blast radius explicitly before writing the change ticket. Note the specific systems that are affected, the teams to notify, and the dependencies among applications, the networks and the system generally that could be disrupted.

Building a Timeline

Build realistic timelines by working backward from what the change actually requires, not forward from when someone wants or needs it done. Phase changes by risk level, with low-blast-radius changes first. This planning will help form an understanding of the level of difficulty involved in making the change happen.

However, given this approach to early planning, the business application of the changes also have to be considered. Planners should consult with business process owners, sharing changes, timeframes and risk assessments. This may alter timeframes and subdivide tasks to reduce risk or fit into tighter maintenance windows. Bringing in business owners early is better than spending a great deal of time on detailed planning and preparation only to have those plans changed due to perceived risk or business needs.

DNS TTL management is an under-used timeline tool. Before any DNS change, reduce affected record TTLs to 300 seconds and let that propagate for from 24 to 48 hours. When you make the actual change, resolvers pick up the new record within five minutes rather than standing by for a 3,600-second TTL. Also build change blackouts into your timeline explicitly. Consider end-of-quarter, major launches, holiday periods, audit windows. Wherever possible, phase changes needs documented rollback path.

Designing the Process

Change request templates should capture the specific change, the business driver, the blast radius assessment, a rollback procedure, your validation steps, the maintenance window, and all the stakeholders notified. Be sure to define ‘done’ before you start. A change is complete when the intended state has been verified from the perspective of the consuming service.

Stakeholder communication is what technical people most often overlook or minimize. Application owners need to know their service will be briefly unreachable. The help desk needs to know users may call. Security needs to know firewall rules referencing old IPs will need updating.

The runbook is not optional. It shouldn’t be written during the change window. It documents every step, with exact commands, expected result of each step, and decision points where you pause to assess before continuing. Writing the runbook is itself should be a planned exercise. Being methodical in creating it frequently exposes gaps in the change plan.

Executing Under Uncertainty

IPAM and DDI Changes: The Order of Operations

Re-IP Projects: The correct sequence is DNS updates first, then DHCP reservations, then static assignments, then firewall and ACL rules. For static assignments, work outside-in: change leaf devices before core devices. Changing a core router’s address should be done as a monitored transition, not a cutover. In practice, the old and new ranges can run in parallel across separate interfaces while both are announced. During that period, teams should track any traffic still sending to or requesting data from the decommissioned IPs. Once that tally remains at zero for a defined window, the legacy dependencies are resolved and the core addresses can be safely changed

Zone Delegation Changes: Old and new authoritative servers must both serve correct answers during the whole propagation window (24 to 48 hours). Do not decommission the old server until complete propagation is confirmed. Use DNSViz to verify delegation status from multiple vantage points.

DHCP Scope Migrations: Use the overlap window technique: run old and new scopes in parallel with overlapping lease times, modifying the lease times to standard once the migration is complete. Existing leases naturally renew against the new scope as they expire, avoiding a simultaneous mass renewal. This avoids brief but nasty widespread connectivity loss.

Split-DNS Testing: A successful test from your workstation does not confirm that external clients will see the change correctly. Test from both internal and external contexts and don’t think you’ve been successful until both return the expected answer.

API-Driven Changes: Whenever possible, execute these sequences using Infrastructure as Code (IaC) tools. Pushing zone delegation changes or bulk Re-IPs through your DDI platform’s API ensures the execution is perfectly repeatable and allows for rapid, automated rollback if your validation trips an escalation threshold.

Maintaining Visibility and Human Factors

Prior to implementation, it’s recommended to run a new round of network discovery to certify that no network changes occurred within that period. Those that may not have been picked up in the initial discovery will require retroactive support and will disrupt the process. Monitor systems cache data before major change windows. Configure a real-time supplemental view. Including continuous pings to critical hosts, a streaming NetFlow view, or a DHCP server log tail. Packet capture at network chokepoints is invaluable during high-risk changes. Define escalation tripwires before the change window opens. Know specific, observable conditions that trigger a pause-and-assess decision.

Keep a running log of what was executed and when (not just ‘ran DNS update script’ but ‘ran DNS update script at 22:47, confirmed new records visible from internal resolver at 22:51’). Resist the ‘it’s working, ship it’ temptation before completing the soak period. Many change-related incidents show up hours after the change when caches expire and offline clients come back online.

Sequestration

What Sequestration Means

Sequestration means placing a device in a controlled, restricted network environment where it can be observed and managed. There, its ability to affect other systems is controlled. It is different from decommissioning (retirement) and security quarantine (threat response). Operational sequestration is a proactive tool for devices that are unknown, non-compliant, pending changes, or awaiting decisions about their future.

Examples include unknown devices that turn up during discovery; end-of-life systems that cannot be retired yet; devices pending re-IP as part of a migration; test systems that should not communicate with production; devices that failed post-change validation.

Sequestration VLAN Architecture

The minimum design to move forward includes:

A distinct subnet with its own DHCP scope. This is needed so sequestered devices receive clearly identifiable addresses.
A default gateway with ACLs that limit outbound connectivity only to management systems
A DNS sinkhole or captive portal so devices attempting to reach unauthorized resources get a controlled response
Full packet capture or NetFlow collection on sequestration VLAN egress

With 802.1X NAC, devices failing authentication can be automatically placed in the sequestration VLAN via RADIUS-driven VLAN assignment. DHCP can fingerprint devices into sequestration automatically using DHCP option 55 or vendor class option 60. Sequestered devices must receive fixed address assignments (not dynamic leases) so a rebooted device returns to the same controlled environment.

And, security policies should encompass all routing planes to prevent data leakage. Make sure your sequestration architecture is strictly dual-stack aware. A device placed in a restricted VLAN with perfectly configured legacy ACLs and sinkholes can still exfiltrate data or communicate laterally if the corresponding IPv6 routing and security policies are not identically enforced on the gateway.

IPAM Hygiene and Return Process

IPAM records for sequestered devices must include the fact of sequestration, the date of it, the reason it, an owner or responsible team if possible, and an planned review date. Without these, sequestration becomes a place where devices go to be forgotten.

A complete return-from-sequestration process includes:

A review of the original sequestration reason and confirmation it has been resolved
Validation that IPAM, DNS, and DHCP records are correct and complete
Ownership confirmation. Someone has to explicitly claim the device and accept responsibility for it. Of course, in large environments the responsible parties are likely to be numerous. You need to define the hardware, software and application stack responsibilities as they are typically going to be different people/groups.
Post-return monitoring period with alert suppression removed
IPAM status update clearing the sequestration flag and documenting the resolution

Closing the Loop

Post-Change Reconciliation

Within 48 hours of a change process completion, run a full reconciliation of IPAM and DDI for affected subnets and zones.

Confirm every change is accurately reflected
Confirm DNS and DHCP match IPAM state
Confirm temporary artifacts (parallel DNS entries, interim DHCP reservations) have been cleaned up.

Then archive all records for decommissioned resources with a decommission date rather than deleting them. This historical record may turn out to be valuable, so much so that ProVision has a holding tank for this use case: admins need a way to store the IP data in a safe way so it won’t be accidentally re-used until the block is clear and ready to be used on the network.

Metrics for IPAM and DDI Health

IPAM accuracy rate: The percentage of discovered live hosts with an accurate IPAM record. Healthy: above 95%; below 80% indicates serious process gaps
DNS record staleness ratio: The percentage of A records with no live host at the recorded IP. Above 10–15% usually signals a decommissioning process failure
DHCP scope utilization trends: Track over time, not just current utilization; scopes trending toward exhaustion are predictable failures
Mean time to detect undocumented devices: How quickly does your discovery process surface a device that connected without IPAM registration?
Change-to-incident ratio: What percentage of change windows result in an unplanned incident?

Building Resilience Over Time

Automation is the long-term answer to documentation drift. When IPAM updates are triggered automatically by provisioning workflows, when DNS records are managed through infrastructure-as-code, when DHCP scope changes generate automatic IPAM records, the gap between intended state and documented state narrows dramatically. Every manual documentation step that gets automated is a future audit discrepancy that never happens.

Conclusion

Network environments are never static. The practical approach is managed entropy. Which means accepting that the environment will always be changing, and investing in the tools, processes, and disciplines that make change survivable. Discovery and reconciliation make the environment legible. Structured change management makes transitions predictable. Sequestration provides a controlled holding environment for inevitable ambiguities. Post-change reconciliation ensures each change cycle leaves documentation more accurate than you found it.

← Previous Post