NEW - You can now accelerate your migration and modernization with AWS Transform. Read Getting Started in the AWS Transform User Guide.
Troubleshooting FSx for ONTAP issues
This section covers common issues when using FSx for ONTAP as the target storage type with MGN.
Topics
Troubleshooting FSx for ONTAP iSCSI connectivity
When MGN migrates a server using FSx for ONTAP, the target instance must establish iSCSI
sessions to the ONTAP SVM. A postboot script (mgn_iscsi_postboot) runs automatically
to configure iSCSI connectivity with multipath redundancy when available. If no iSCSI sessions
are established within the validation window (15 minutes for Linux, 25 minutes for Windows),
the job fails.
Postboot log location:
Linux:
/var/log/mgn_iscsi_postboot.logWindows:
C:\Windows\Temp\mgn_iscsi_postboot.log
Connect
to the target instance via
SSM
Session Manager or SSH to read the log. The target
instance ID is shown in the MGN console Launch status section. Look for ERROR or
WARNING entries.
Common failures:
| Symptom | Cause | Resolution |
|---|---|---|
Package installation errors or No supported package manager found |
Target instance cannot reach OS package repositories | Ensure outbound internet access (NAT gateway or internet gateway) for package downloads. See Step 6. |
| Package installation fails due to missing or inaccessible repositories (SLES, RHEL, CentOS) | Instance-bound or subscription-based repository credentials are not valid on the migrated target instance | Pre-install the required iSCSI and multipath packages on the source server before migration. See the packages table in Step 6. |
| Connection refused or timeout on port 3260 | Security groups or NACLs not configured for iSCSI traffic | Allow TCP 3260 between the target instance and FSx for ONTAP security groups. See Step 1. |
| No route to host or network unreachable | No network path between target instance and FSx for ONTAP subnets | Verify routing between the target subnet and FSx for ONTAP subnets (VPC route tables, VPC peering, or transit gateway). |
| Multipath-IO could not be installed | MPIO packages unavailable or Windows Desktop SKU without the feature | The migration will succeed with single-path connectivity. For full HA redundancy,
ensure the instance can install MPIO (Linux: multipath-tools or
device-mapper-multipath packages; Windows: Server SKU with
MultiPath-IO feature available). |
| Phase 1 completed but Phase 2 never starts (Windows only) | Instance failed to reboot after MPIO installation or postboot service did not re-trigger | Verify the instance is running in the EC2 console. Check Windows Event Viewer for boot errors. Retry the launch. |
| Log file does not exist | Postboot script did not run (conversion failure before postboot stage) | Check the MGN console for earlier errors in the launch status. |
Note
If the validation reports "iSCSI connectivity established with 1 of 2 expected sessions - operating without multipath", the migration succeeds but without full HA redundancy. Verify that both FSx for ONTAP subnet endpoints are routable from the target instance.
For manual iSCSI verification, see Mounting iSCSI LUNs on FSx for ONTAP.
After fixing the issue, launch a new test or cutover from the MGN console. The postboot script will run again automatically.
FSx for ONTAP storage operation timed out
If a migration operation fails with a storage operation timeout, this indicates that MGN could not complete a storage request to the FSx for ONTAP file system within the expected time. This can be caused by insufficient capacity, degraded performance, or a network connectivity issue between MGN and the file system.
Possible causes and resolutions:
| Cause | How to verify | Resolution |
|---|---|---|
| File system is out of storage capacity | In the FSx console |
Increase the file system's storage capacity. For more information, see Managing storage capacity and provisioned IOPS. |
| Throughput capacity is insufficient for the workload | In the FSx console, check the Throughput CloudWatch metrics for the file system. Look for sustained throughput near the provisioned limit. | Increase the file system's throughput capacity. You can modify throughput at any time. For more information, see Managing throughput capacity. |
| Network connectivity issue between MGN and the FSx for ONTAP REST API | Verify that the security group attached to the FSx for ONTAP file system allows inbound HTTPS (TCP 443) from the FSx for ONTAP preferred and standby subnet CIDRs. These rules are required for MGN to access the ONTAP REST API. See 1.2 FSx for ONTAP security group. | Add inbound HTTPS (TCP 443) rules to the FSx for ONTAP security group with the preferred and standby subnet CIDRs as the source. For details on identifying these CIDRs, see Step 1: Configure security groups. |
After resolving the issue, retry the migration operation from the MGN console.
Replication volume not deleted after Finalize cutover/Disconnect from service (FlexClone split blocked by backup)
Symptoms:
-
After finalize cutover, the replication volume remains in the FSx for ONTAP file system and is not cleaned up by MGN.
-
The MGN console may show the source server in "Cutover" state but the old replication volume persists.
Cause:
After finalize cutover, MGN creates a FlexClone from the replication volume and initiates a split to make it independent. If FSx for ONTAP automatic backups (or a manual backup) were taken on the target volume (FlexClone) after finalize cutover, the backup creates a locked snapshot that blocks the FlexClone split operation. MGN cannot complete the split until the locked snapshot is removed.
Resolution:
-
Check if automatic backups are enabled – In the FSx for ONTAP console, navigate to your file system and check the backup settings. If automatic backups are enabled, disable them to prevent new backups from being created on the volumes.
-
Delete backups from target volumes – In the FSx for ONTAP console, navigate to Backups and delete all backups associated with volumes matching the
target_pattern. There may be more than one target volume per source server. Select each backup and choose Actions → Delete backup. This releases the locked snapshots that block the split.source_server_id_timestampNote
Deleting the backup does not affect the target volume data. If you need to retain backup data, you can restore it to a separate volume before deleting.
-
Wait 24 hours for cleanup – MGN will automatically complete the FlexClone split and delete the replication volume within 24 hours. No further manual action is needed.
-
Re-enable automatic backups – After the replication volume has been cleaned up, re-enable automatic backups on the FSx for ONTAP file system to resume regular backup protection.
For more information, see Managing FSx for ONTAP volumes and FSx for ONTAP backups.
Orphaned FSx for ONTAP target volumes (FlexClone) after launch cleanup
Symptom:
FSx for ONTAP volumes with the naming pattern
target_
remain on the file system after a "Revert to Ready for testing", "Revert to Ready for cutover",
or "Terminate launched instances" action.source_server_id_timestamp
Cause:
MGN cannot delete FlexClone volumes that have FSx for ONTAP backups on them.
How to confirm:
-
Navigate to the MGN console → Launch history.
-
Find the job corresponding to the termination action for the specific source server.
-
Check the job event logs for the following error:
"Failed to delete FSx FlexClone volume due to active SnapMirror relationship. Disable automatic backups on the file system and manually delete the volume."
Resolution:
Delete the orphaned volume manually via the FSx for ONTAP console:
-
Open the FSx for ONTAP console → Volumes.
-
Locate the volume matching the
target_pattern.source_server_id_timestamp -
Delete the volume.