As your IDC resources may be limited, if you need to handle business traffic surges, the computing resources in your IDC may be insufficient to meet the requirements. In this case, you can use public cloud resources to handle temporary traffic. Based on custom scheduling policies and by leveraging TKE Serverless Container Service, TKE Resilience Chart adds supernodes to elastically migrate workloads in your IDC cluster to the cloud, so your cluster can get greater elastic scalability and enjoy the following benefits:
TKE Resilience Chart mainly consists of a supernode manager, scheduler, and toleration controller as detailed below:
Alias | Component Name | Description |
---|---|---|
eklet | Supernodes manager | It manages the lifecycle of PodSandboxes and provides APIs related to native kubelet and nodes. |
tke-scheduler | Scheduler | It migrates workloads to the cloud elastically according to scheduling policies and is only installed in non-TKE Kubernetes Distro K8s clusters. TKE Kubernetes Distro is a K8s distribution released by TKE to help you create exactly the same K8s cluster in TKE. Currently, it has been open sourced at GitHub. For more information, please see TKE Kubernetes Distro. |
admission-controller | Toleration controller | It adds a toleration to a Pod in pending status to make it able to be scheduled to a supernode. |
AUTO_SCALE_EKS=true|false
is available in global and local dimensions respectively to control whether workloads in pending
status should be elastically scheduled to EKS as detailed below:AUTO_SCALE_EKS
in kubectl get cm -n kube-system eks-config
is enabled by default.spec.template.metadata.annotations ['AUTO_SCALE_EKS']
Global Switch | Local Switch | Behavior |
---|---|---|
AUTO_SCALE_EKS=true | AUTO_SCALE_EKS=false | Successfully scheduled |
AUTO_SCALE_EKS=true | Undefined | Successfully scheduled |
AUTO_SCALE_EKS=true | AUTO_SCALE_EKS=true | Successfully scheduled |
AUTO_SCALE_EKS=false | AUTO_SCALE_EKS=false | Failed to be scheduled |
AUTO_SCALE_EKS=false | Undefined | Failed to be scheduled |
AUTO_SCALE_EKS=false | AUTO_SCALE_EKS=true | Successfully scheduled |
Undefined | AUTO_SCALE_EKS=false | Successfully scheduled |
Undefined | Undefined | Successfully scheduled |
Undefined | AUTO_SCALE_EKS=true | Successfully scheduled |
tke-scheduler
in workloads. In TKE Kubernetes Distro, you don't need to specify the scheduler.LOCAL_REPLICAS: N
.pending
status will be scaled out to EKS.pending
status will be scaled out to TKE Serverless cluster.DaemonsetForbidden
will be displayed.kube-system
and tke-eni-ip-webhook
namespaces cannot be scheduled to supernodes.securityContext.sysctls ["net.ipv4.ip_local_port_range"]
value includes 61000–65534 cannot be scheduled.Pod.Annotations [tke.cloud.tencent.com/vpc-ip-claim-delete-policy]
cannot be scheduled.container (initContainer).ports [].containerPort (hostPort)
value includes 61000–65534 cannot be scheduled.container (initContainer)
where the probe points to 61000–65534 cannot be scheduled.eks.tke.cloud.tencent.com/resolv-conf
annotation to a supernode, /etc/resolv.conf
in the generated CVM instance will be updated to the custom content.
Note:The original DNS configuration on the supernodes will be overwritten, and your custom configuration will prevail.
eks.tke.cloud.tencent.com/resolv-conf: |
nameserver 4.4.4.4
nameserver 8.8.8.8
tke-resilience helm chart
git clone https://github.com/tkestack/charts.git
Edit charts/incubator/tke-resilience/values.yaml
and configure the following information:
cloud:
appID: "{Tencent Cloud account APPID}"
ownerUIN: "{Tencent Cloud account ID}"
secretID: "{Tencent Cloud account secretID}"
secretKey: "{Tencent Cloud account secretKey}"
vpcID: "{ID of the VPC where the EKS Pod resides}"
regionShort: "{Abbreviation of the region where the EKS Pod resides}"
regionLong: "{Full name of the region where the EKS Pod resides}"
subnets:
- id: "{ID of the subnet where the EKS Pod resides}"
zone: "{AZ where the EKS Pod resides}"
eklet:
PodUsedApiserver: "{API server address of the current cluster}"
Note:For more information on the regions and AZs where TKE Serverless container service is available, please see Regions and AZs.
You can use the local Helm client to connect to the cluster.
Run the following command to use a Helm chart to install TKE Resilience Chart in a third-party cluster:
helm install tke-resilience --namespace kube-system ./tke-resilience --debug
Run the following command to check whether the required components in the Helm application are installed. This document uses a TKE Kubernetes Distro cluster with no tke-scheduler installed as an example.
# kubectl get Pod -n kube-system | grep resilience
eklet-tke-resilience-5f9dcd99df-rgsmc 1/1 Running 0 43h
eks-admission-tke-resilience-5bb588dc44-9hvhs 1/1 Running 0 44h
You can see that one supernode has been deployed in the cluster.
# kubectl get node
NAME STATUS ROLES AGE VERSION
10.0.1.xx Ready <none> 2d4h v1.20.4-tke.1
10.0.1.xx Ready master 2d4h v1.20.4-tke.1
eklet-subnet-xxxxxxxx Ready <none> 43h v2.4.6
Create a demo application nginx-deployment
, which has four replicas (three in TKE Serverless cluster and one in the local cluster). Below is the sample YAML configuration:
apiVersion: apps/v1
kind: Deployment
metadata:
name: nginx-deployment
labels:
app: nginx
spec:
replicas: 4
strategy:
type: RollingUpdate
selector:
matchLabels:
app: nginx
template:
metadata:
annotations:
AUTO_SCALE_EKS: "true"
LOCAL_REPLICAS: "1" # Set the number of running replicas in the local cluster to 1
labels:
app: nginx
spec:
#schedulerName: tke-scheduler If it is a third-party cluster, you need to run the scheduler as `tke-scheduler`
containers:
- name: nginx
image: nginx
imagePullPolicy: IfNotPresent
Check whether the replica status and distribution meet the expectations.
# kubectl get Pod -owide
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
nginx-deployment-77b9b9bc97-cq9ds 1/1 Running 0 27s 10.232.1.88 10.0.1.xxx <none> <none>
nginx-deployment-77b9b9bc97-s9vzc 1/1 Running 0 27s 10.0.1.118 eklet-subnet-xxxxxxxx <none> <none>
nginx-deployment-77b9b9bc97-sd4z5 1/1 Running 0 27s 10.0.1.7 eklet-subnet-xxxxxxxx <none> <none>
nginx-deployment-77b9b9bc97-z86tx 1/1 Running 0 27s 10.0.1.133 eklet-subnet-xxxxxxxx <none> <none>
Check the scale-in feature. As a TKE Kubernetes Distro cluster is used, TKE Serverless cluster instances will be scaled in preferentially. Here, the number of application replicas is adjusted from 4 to 3.
# kubectl scale deployment nginx-deployment --replicas=3
As shown below, replicas in Tencent Cloud are scaled in first, which meets the expectation:
# kubectl get Pod -owide
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
nginx-deployment-77b9b9bc97-cq9ds 1/1 Running 0 7m38s 10.232.1.88 10.0.1.xxx <none> <none>
nginx-deployment-77b9b9bc97-s9vzc 1/1 Running 0 7m38s 10.0.1.118 eklet-subnet-xxxxxxxx <none> <none>
nginx-deployment-77b9b9bc97-sd4z5 1/1 Running 0 7m38s 10.0.1.7 eklet-subnet-xxxxxxxx <none> <none>
Apakah halaman ini membantu?