Alert about spawning DHCP server into the cluster via helm (dgxie service) #41

hightoxicity · 2018-12-06T10:49:22Z

Using helm to spawn DHCP server (dgxie service), for unknown reason, we lost the docker daemon on the master, following that the kubelet could no more keep running/start critical services like apiserver... A reboot allowed docker + kubelet recovery.
But some worker nodes lost their ips (not able to renew their leases during the incident), we got an unhealthy ceph cluster. Looking at dgxie pod state after reboot, the pod was stuck on ContainerCreating state due to ceph partial failure (Volume Claim stuck).
Finally all the things recovered replacing Volume claims by empty volumes at dgxie helm service creation, the dgxie service could from there start, nodes recovered their ips, the ceph cluster went healthy and any volume claim could be satisfied.

So two things here:

Spawning dgxie service into k8s cluster should propose a ha mechanism (dhcp is a critical service)
We should think what are the dgxie true storage requirement and ensure dgxie storage resiliency

hightoxicity · 2018-12-07T15:56:33Z

I made a first work to be able to run several dgxie instances on several master nodes

hightoxicity@9bdf03b

In fact, all the dgxie instances can serve the static ip leases (no mater here). But to avoid collision on dynamic ranges, I made something to split ip range bewteen replicas of a statefulset (a kind of consistent hashing).

To allow to have distinct volumes to persist leases and to avoid volume claim collision, I switch from ceph claim to local mount point.

Last choice also allow to not have a critical component dependant of ceph cluster health.

Please tell me what you think about it!

Thx

hightoxicity · 2018-12-07T16:04:31Z

It also allow to scale dgxie service to x nodes (spreading some load accross more master nodes)...

hightoxicity · 2018-12-10T13:39:14Z

We are currently not able to set a fixed key to sign urls at pixiecore level (https://github.com/google/netboot/blob/cc33920b4f3296801a64d731d269978116f40d92/pixiecore/booters.go#L137).

hightoxicity · 2018-12-11T15:03:15Z

I closed a previous PR #43 since url nonces can not be checked on different pixiecore instances.
To be able to have dgxie ha, we need to have a way to provide the pixiecore signing key from "outside" (env var fed by kube secret for example).

I am trying to submit my work on the suject here:

danderson/netboot#84

dholt · 2018-12-12T16:11:42Z

This is really interesting, nice work!

I don't understand how multiple dgxie instances would help HA, if you split the DHCP pool and lose a replica, the nodes getting leases from that instance would still no longer be able to get IPs. So at best it seems like you'd (hopefully) only loose half of the cluster at a time vs the whole thing. Am I missing something?

hightoxicity · 2018-12-13T08:39:04Z

Hi guy, it is just the dynamic range that is splited, all instances are able to provide a static lease for cluster nodes because they are all feeding themselves near the pxe-machines configmap (if I understood well how dgxie service is working). This is where the solution provide true ha avoiding to lose any k8s worker node. The counterpart of this is that the dynamic range is not very well managed by the solution (ability to renew a lease is mastered by only one shard and we do not master who will answer to DHCP request the first, so maybe just a new lease will be issued in the fragment owned by hit server) but it should work as far as the dynamic ips is not the rule but the exception.

dholt · 2018-12-13T14:35:28Z

Gotcha. Seems reasonable, I'll try it out as soon as I can.

supertetelman · 2021-01-28T21:13:57Z

Marking this as stale.

hightoxicity mentioned this issue Dec 12, 2018

dgxie HA #44

Closed

hightoxicity mentioned this issue Dec 13, 2018

PXE boot phase (pixiecore) finishing on OK on imgfetch --name ready http://mgmtip:81/_/booting?mac=00%3A00%3A00%3A00%3A00%3A00 #32

Closed

supertetelman closed this as completed Jan 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alert about spawning DHCP server into the cluster via helm (dgxie service) #41

Alert about spawning DHCP server into the cluster via helm (dgxie service) #41

hightoxicity commented Dec 6, 2018 •

edited

Loading

hightoxicity commented Dec 7, 2018

hightoxicity commented Dec 7, 2018

hightoxicity commented Dec 10, 2018

hightoxicity commented Dec 11, 2018

dholt commented Dec 12, 2018

hightoxicity commented Dec 13, 2018

dholt commented Dec 13, 2018

supertetelman commented Jan 28, 2021

Alert about spawning DHCP server into the cluster via helm (dgxie service) #41

Alert about spawning DHCP server into the cluster via helm (dgxie service) #41

Comments

hightoxicity commented Dec 6, 2018 • edited Loading

hightoxicity commented Dec 7, 2018

hightoxicity commented Dec 7, 2018

hightoxicity commented Dec 10, 2018

hightoxicity commented Dec 11, 2018

dholt commented Dec 12, 2018

hightoxicity commented Dec 13, 2018

dholt commented Dec 13, 2018

supertetelman commented Jan 28, 2021

hightoxicity commented Dec 6, 2018 •

edited

Loading