Background
CVM network interruption fault is a common issue, that hardware faults, improper network configuration, network equipment faults, network congestion, and other factors may cause. It will result in CVM network interruption during network communication and impact the normal operation of businesses. For businesses that rely on high availability and low delay, network interruption will cause immeasurable losses to users.
To improve network reliability and stability in CVM, network interruption experiments are required. Through the experiments, the capability of the system for normal operation in the situation of network interruption can be verified and issues caused by faults can be revealed in advance so that the system architecture can be optimized and contingency plans can be prepared.
Experiment Implementation
Step 1: Experiment Preparation
Prepare several CVM instances that are available for the experiment.
Step 2: Experiment Orchestration
1. Check network status before fault injection. Send messages to the target machine through ping commands to check network connectivity. If no response is received from the target address or there is a high packet loss rate, there may be an issue of network interruption.
3. Click Skip and create a blank experiment. Fill in the experiment information, and select a CVM instance for the experiment objective.
4. Click Add Now, select Network Resources, click Network interruption, and click Next.
5. Configure fault action parameters, click Confirm. No parameter is required for the action, and configuration of general parameters is not required.
6. After action parameter configuration, click Next. Configure guardrail policies and monitoring metrics considering actual situations, and click Submit to complete experiment creation.
Step 3: Experiment Execution
1. Go to experiment details, and click Go to the action group for execution.
2. Click Execute to start an experiment.
3. Click the Action Card to check the details for the action execution results.
4. Check the network connection status of the target machine after fault injection. It can be seen that it is cannot be pinged. 5. Execute a recovery action, and check the execution logs of the recovery action.
6. Check the result of fault recovery. Log in to a machine and ping the target machine. Response messages show that the fault has been cleared and network transmission has recovered.
Was this page helpful?