AWS群集自动缩放器部署不断抛出错误无法重新生成ASG缓存



我正在尝试在AWS EKS上部署ClusterAutoscaler,它有多个节点组。节点位于公用子网中,并从内部连接到internet。但是ClusterAutoscaler部署(Multi-ASG(不断失败,pod不断重新启动,并出现以下错误。

I0323 14:09:02.441010       1 auto_scaling_groups.go:138] Registering ASG eks-2ab883a6-97e1-c240-5d22-5a87384ef2fe
I0323 14:09:02.441022       1 auto_scaling_groups.go:354] Regenerating instance to ASG map for ASGs: [eks-1ab883a6-97e6-5d39-89b2-ceaa807bd403 eks-2ab883a6-97e1-c240-5d22-5a87384ef2fe]
I0323 14:09:02.441602       1 reflector.go:123] Starting reflector *v1.StorageClass (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.441630       1 reflector.go:161] Listing and watching *v1.StorageClass from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.441956       1 reflector.go:123] Starting reflector *v1.Service (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.441973       1 reflector.go:161] Listing and watching *v1.Service from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.442239       1 reflector.go:123] Starting reflector *v1.ReplicaSet (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.442266       1 reflector.go:161] Listing and watching *v1.ReplicaSet from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.442478       1 reflector.go:123] Starting reflector *v1.Pod (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.442499       1 reflector.go:161] Listing and watching *v1.Pod from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.442710       1 reflector.go:123] Starting reflector *v1.Node (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.442725       1 reflector.go:161] Listing and watching *v1.Node from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.443103       1 reflector.go:123] Starting reflector *v1.PersistentVolume (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.443119       1 reflector.go:161] Listing and watching *v1.PersistentVolume from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.443379       1 reflector.go:123] Starting reflector *v1.PersistentVolumeClaim (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.443394       1 reflector.go:161] Listing and watching *v1.PersistentVolumeClaim from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.443652       1 reflector.go:123] Starting reflector *v1.ReplicationController (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.443669       1 reflector.go:161] Listing and watching *v1.ReplicationController from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.443881       1 reflector.go:123] Starting reflector *v1.StatefulSet (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.443901       1 reflector.go:161] Listing and watching *v1.StatefulSet from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.540405       1 reflector.go:123] Starting reflector *v1beta1.PodDisruptionBudget (0s) from k8s.io/client-go/informers/factory.go:132
I0323 14:09:02.540446       1 reflector.go:161] Listing and watching *v1beta1.PodDisruptionBudget from k8s.io/client-go/informers/factory.go:132
E0323 14:11:03.122943       1 aws_manager.go:259] Failed to regenerate ASG cache: RequestError: send request failed
caused by: Post https://autoscaling.us-east-2.amazonaws.com/: dial tcp: i/o timeout
F0323 14:11:03.122980       1 aws_cloud_provider.go:330] Failed to create AWS Manager: RequestError: send request failed
caused by: Post https://autoscaling.us-east-2.amazonaws.com/: dial tcp: i/o timeout

所有的ASG都有自动发现所需的标签,我也尝试过集群自动缩放器自动发现部署,这也显示了类似的问题。

如果您使用EKS地形模块,您只需要通过添加以下内容来启用irsa;

enable_irsa=true

最新更新