tencent cloud

Feedback

Experiment on Container Resource Pod Operation Faults

Last updated: 2024-09-26 15:47:38

    Background

    Pods are the smallest deployable units in a Kubernetes cluster that run applications. In practical use cases, Pods may encounter issues such as container crashes, resource shortages, Pod termination, and Pod failures, which can prevent applications from running properly.
    To enhance the reliability and stability of container services, it is necessary to conduct Pod fault experiments. These experiments help verify whether the system can continue to operate normally in the event of a Pod fault, and they allow for early identification of potential issues in such fault scenarios, enabling quick and effective resolution.

    Experiment Execution

    Step 1: Experiment Preparation

    Purchase container instances and deploy test services. If there is already a container instance available for the experiment, proceed directly to create the experiment.
    Enter Agent Management page and install the agents.

    Step 2: Create an Experiment

    1. Log in Tencent Smart Advisor > Chaotic Fault Generator, go to Experiment Management page, and click Create a New Experiment.
    2. Click Skip and create a blank experiment.
    3. Select Container as the instance type, select Standard Cluster Pod as the instance object, and then Add Instance.
    4. Add a fault action. Click Add Now and select Pod Operation.
    5. Select the fault action Pod deletion.
    6. Set the action parameters and click Confirm.
    7. After action parameter configuration, click Next. Configure Guardrail Policy and Monitoring Metrics considering actual situations, click Submit to complete experiment creation.

    Step 3: Execute the Experiment

    1. View the Pod's information before executing the fault, focusing on the Pod's creation time and running time.
    2. Go to experiment details, click Go to the action group for execution.
    3. Click Execute to start an experiment.
    4. View the details of the action execution results.
    5. View the execution logs to confirm it has been executed successfully.
    6. Verify the execution effect. View the Pod's information after the fault execution. Based on the Pod's creation time and running time, you can observe that a new Pod has been created, indicating that the previous Pod was deleted and a new Pod has been recreated.
    Contact Us

    Contact our sales team or business advisors to help your business.

    Technical Support

    Open a ticket if you're looking for further assistance. Our Ticket is 7x24 avaliable.

    7x24 Phone Support