Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a deployment to run the HP/Katib controller for the GitHub issue example #161

Merged
merged 1 commit into from
Jul 11, 2018

Conversation

jlewi
Copy link
Contributor

@jlewi jlewi commented Jun 30, 2018

  • Some of the code is copied over from https://github.com/kubeflow/katib/tree/master/examples/GKEDemo

    • I think it makes sense to centralize all the code in a single place.
  • Update the controller program (git-issue-summarize-demo.go) so that can
    specify the Docker image containing the training code.

  • Create a ksonnet deployment for running the controller on the cluster.

  • The HP tuning job isn't functional here's an incomplete list of issues

    • The training jobs launched fail because they don't have GCP credentials
      so they can't download the data.

    • We don't actually extract and report metrics back to Katib.

Related to: kubeflow/katib#116


This change is Reviewable

…ue example.

* Some of the code is copied over from https://github.com/kubeflow/katib/tree/master/examples/GKEDemo

  * I think it makes sense to centralize all the code in a single place.

* Update the controller program (git-issue-summarize-demo.go) so that can
  specify the Docker image containing the training code.

* Create a ksonnet deployment for running the controller on the cluster.

* The HP tuning job isn't functional here's an incomplete list of issues

  * The training jobs launched fail because they don't have GCP credentials
    so they can't download the data.

  * We don't actually extract and report metrics back to Katib.

Related to: kubeflow/katib#116
@jlewi
Copy link
Contributor Author

jlewi commented Jun 30, 2018

/assign @YujiOshima @gaocegege

jlewi added a commit to jlewi/katib that referenced this pull request Jun 30, 2018
* The GKEDemo is using the GitHub summarization example; I think we should
  put all of the code for that demo kubeflow/examples
  (see kubeflow/examples#161)

* The main code is the Katib HP controller git-issue-summarize-demo.go

* We don't need the manifests for deploying katib because we can deploy
  Katib using the Kubeflow ksonnet package.

* The code in docker-image duplicates the code in kubeflow/examples so we
  shouldn't need it.

Related to: kubeflow#116
k8s-ci-robot pushed a commit to kubeflow/katib that referenced this pull request Jul 1, 2018
* The GKEDemo is using the GitHub summarization example; I think we should
  put all of the code for that demo kubeflow/examples
  (see kubeflow/examples#161)

* The main code is the Katib HP controller git-issue-summarize-demo.go

* We don't need the manifests for deploying katib because we can deploy
  Katib using the Kubeflow ksonnet package.

* The code in docker-image duplicates the code in kubeflow/examples so we
  shouldn't need it.

Related to: #116
@jlewi
Copy link
Contributor Author

jlewi commented Jul 10, 2018

/assign @ankushagarwal

@ankushagarwal
Copy link

/lgtm
/approve

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ankushagarwal

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot merged commit eaf0298 into kubeflow:master Jul 11, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants