Stop deleting underlying services when federation service is deleted #37353

nikhiljindal · 2016-11-23T07:46:49Z

Fixing federation service controller to not delete services from underlying clusters when federated service is deleted.
None of the federation controller should do this unless explicitly asked by the user using DeleteOptions. This is the only federation controller that does that.

cc @kubernetes/sig-cluster-federation @madhusudancs

federation service controller: stop deleting services from underlying clusters when federated service is deleted.

This change is

nikhiljindal · 2016-11-23T07:51:06Z

cc @saad-ali as FYI for 1.5.

This is a bug fix for 1.5. Required in 1.5 since we are introducing cascading deletion based on DeleteOptions.OrphanDependents for federation resource. Federation service controller was doing it even without that option which is unexpected.

nikhiljindal · 2016-11-23T08:43:08Z

Brought up a cluster and verified that services are not deleted in underlying clusters and DNS records are not touched.

madhusudancs

Other than that, a few minor nits. Please feel free to apply the LGTM label after addressing them.

Could you please also audit the federated service e2e tests to ensure that we don't accidentally leak services in the underlying clusters after this change?

madhusudancs · 2016-11-23T18:57:42Z

federation/pkg/federation-controller/service/servicecontroller.go

-	// or we do nothing for service deletion
+	// TODO: Should uncomment this?
+	// We should delete the load balancer when service is deleted
+	// Look at the same method in kube service controller.
 	//err := s.dns.balancer.EnsureLoadBalancerDeleted(service)


Remove this as well. We are not going to do this.

umm. Why not? kube service controller calls this.
Leaving this as is. Can remove it in a separate PR if we want to.

@nikhiljindal by "this" I meant the TODO comment and commented code. What was your "this"?

What loadbalancer do you think we are going to delete here? Also, the commented code is wrong anyway. s.dns is an interface that doesn't contain a field called balancer. My gut feeling is that, this was a copy-paste from cluster-level service controller and doesn't apply here.

Please remove it. The comment change is making things worse. And the right fix for that is just removing it.

ah you are right. Removed it. Thanks for clarifying

madhusudancs · 2016-11-23T19:01:12Z

federation/pkg/federation-controller/service/servicecontroller.go

-		err := s.deleteClusterService(clusterName, cachedService, cluster.clientset)
-		if err != nil {
-			hasErr = true
-		} else if err := s.ensureDnsRecords(clusterName, cachedService); err != nil {


We should ensure that this behavior is carried over to the cascading deletion. Could you please add a TODO somewhere? Or open an issue may be? One of the two, either one is fine.

I have a PR for it: #36390.
Also tracked in #33612.

I don't see the effect of s.ensureDnsRecords() in PR #36390. May be I am missing something?

madhusudancs · 2016-11-23T19:02:59Z

test/e2e/federated-service.go

+				for clusterName, clusterClientset := range clusters {
+					_, err := clusterClientset.Core().Services(service.Namespace).Get(service.Name)
+					if err != nil {
+						framework.Failf("Unexpected error in fetching service %s in cluster %s, %s", service.Name, clusterName, err)


I am not entirely sure about failing here. It should log an error and continue?

Also, I would just put this in a defer block.

aah the comment "Cleanup" was wrong. This is the test. We want to verify that services are not deleted from underlying clusters. Removed the comment.

AfterEach deletes these services.

nikhiljindal · 2016-11-23T19:41:24Z

Thanks @madhusudancs
Updated as per comments.
Also verified that services are not being leaked.
DNS records are being leaked tough. Tracked in #29335.

Adding LGTM label

madhusudancs · 2016-11-23T20:15:11Z

@nikhiljindal please look at the comment.

nikhiljindal · 2016-11-23T21:03:28Z

Thanks @madhusudancs
Updated as per comment.

dims · 2016-11-28T18:52:53Z

@k8s-bot test this

nikhiljindal · 2016-11-28T20:11:46Z

@k8s-bot cvm gce e2e test this issue: #IGNORE (stuck on Waiting for status to be reported)

nikhiljindal · 2016-11-28T20:20:43Z

@k8s-bot cvm gce e2e test this issue: #IGNORE (failed with "Error starting build")

nikhiljindal · 2016-11-28T20:49:57Z

Removing requires release czar attention as per offline discussion with saad last week.

k8s-ci-robot · 2016-11-28T21:52:30Z

Jenkins GCE e2e failed for commit 34eae22. Full PR test history.

The magic incantation to run this job again is @k8s-bot cvm gce e2e test this. Please help us cut down flakes by linking to an open flake issue when you hit one in your PR.

nikhiljindal · 2016-11-29T07:45:15Z

k8s-bot cvm gce e2e test this (build failed)

nikhiljindal · 2016-11-29T09:35:49Z

@k8s-bot cvm gce e2e test this (build failed)

nikhiljindal · 2016-11-29T12:46:54Z

@k8s-bot cvm gce e2e test this

dims · 2016-11-29T21:41:43Z

@k8s-bot gci gke e2e test this

saad-ali · 2016-11-29T22:18:55Z

cc @saad-ali as FYI for 1.5.

This is a bug fix for 1.5. Required in 1.5 since we are introducing cascading deletion based on DeleteOptions.OrphanDependents for federation resource. Federation service controller was doing it even without that option which is unexpected.

Ack. Thanks

nikhiljindal · 2016-11-30T12:21:01Z

Unit tests seem to have been stuck for more than 11 hours.
@k8s-bot unit test this

k8s-github-robot · 2016-11-30T13:21:24Z

@k8s-bot test this [submit-queue is verifying that this PR is safe to merge]

k8s-github-robot · 2016-11-30T14:02:39Z

Automatic merge from submit-queue

#37604-#37353-#37290-#37572-#37684-#37671-upstream-release-1.5 Automated cherry pick of #37287 #37604 #37353 #37290 #37572 #37684 #37671 upstream release 1.5

k8s-cherrypick-bot · 2016-12-01T06:05:23Z

Commit found in the "release-1.5" branch appears to be this PR. Removing the "cherrypick-candidate" label. If this is an error find help to get your PR picked.

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Nov 23, 2016

nikhiljindal added area/cluster-federation kind/bug Categorizes issue or PR as related to a bug. labels Nov 23, 2016

nikhiljindal assigned madhusudancs Nov 23, 2016

nikhiljindal added the release-note Denotes a PR that will be considered when it comes time to generate release notes. label Nov 23, 2016

nikhiljindal added this to the v1.5 milestone Nov 23, 2016

nikhiljindal added the requires-release-czar-attention label Nov 23, 2016

k8s-github-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Nov 23, 2016

madhusudancs reviewed Nov 23, 2016

View reviewed changes

nikhiljindal force-pushed the serviceDelete branch from 82b5284 to e2540b7 Compare November 23, 2016 19:39

nikhiljindal added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 23, 2016

Stop deleting underlying services when federation service is deleted

34eae22

nikhiljindal force-pushed the serviceDelete branch from e2540b7 to 34eae22 Compare November 23, 2016 21:02

nikhiljindal removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 23, 2016

nikhiljindal added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 23, 2016

nikhiljindal removed the requires-release-czar-attention label Nov 28, 2016

nikhiljindal added the cherrypick-candidate label Nov 29, 2016

k8s-github-robot merged commit d51f07b into kubernetes:master Nov 30, 2016

saad-ali mentioned this pull request Dec 1, 2016

Automated cherry pick of #37287 #37604 #37353 #37290 #37572 #37684 #37671 upstream release 1.5 #37778

Merged

saad-ali added the cherry-pick-approved Indicates a cherry-pick PR into a release branch has been approved by the release branch manager. label Dec 1, 2016

k8s-cherrypick-bot removed the cherrypick-candidate label Dec 1, 2016

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop deleting underlying services when federation service is deleted #37353

Stop deleting underlying services when federation service is deleted #37353

nikhiljindal commented Nov 23, 2016 •

edited

Loading

nikhiljindal commented Nov 23, 2016

nikhiljindal commented Nov 23, 2016

madhusudancs left a comment

madhusudancs Nov 23, 2016

nikhiljindal Nov 23, 2016

madhusudancs Nov 23, 2016

nikhiljindal Nov 23, 2016

madhusudancs Nov 23, 2016

nikhiljindal Nov 23, 2016

madhusudancs Nov 23, 2016

madhusudancs Nov 23, 2016

nikhiljindal Nov 23, 2016

nikhiljindal commented Nov 23, 2016

madhusudancs commented Nov 23, 2016

nikhiljindal commented Nov 23, 2016

dims commented Nov 28, 2016

nikhiljindal commented Nov 28, 2016

nikhiljindal commented Nov 28, 2016

nikhiljindal commented Nov 28, 2016

k8s-ci-robot commented Nov 28, 2016

nikhiljindal commented Nov 29, 2016

nikhiljindal commented Nov 29, 2016

nikhiljindal commented Nov 29, 2016

dims commented Nov 29, 2016

saad-ali commented Nov 29, 2016

nikhiljindal commented Nov 30, 2016

k8s-github-robot commented Nov 30, 2016

k8s-github-robot commented Nov 30, 2016

k8s-cherrypick-bot commented Dec 1, 2016

Stop deleting underlying services when federation service is deleted #37353

Stop deleting underlying services when federation service is deleted #37353

Conversation

nikhiljindal commented Nov 23, 2016 • edited Loading

nikhiljindal commented Nov 23, 2016

nikhiljindal commented Nov 23, 2016

madhusudancs left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nikhiljindal commented Nov 23, 2016

madhusudancs commented Nov 23, 2016

nikhiljindal commented Nov 23, 2016

dims commented Nov 28, 2016

nikhiljindal commented Nov 28, 2016

nikhiljindal commented Nov 28, 2016

nikhiljindal commented Nov 28, 2016

k8s-ci-robot commented Nov 28, 2016

nikhiljindal commented Nov 29, 2016

nikhiljindal commented Nov 29, 2016

nikhiljindal commented Nov 29, 2016

dims commented Nov 29, 2016

saad-ali commented Nov 29, 2016

nikhiljindal commented Nov 30, 2016

k8s-github-robot commented Nov 30, 2016

k8s-github-robot commented Nov 30, 2016

k8s-cherrypick-bot commented Dec 1, 2016

nikhiljindal commented Nov 23, 2016 •

edited

Loading