Install Kubeflow

How to deploy Kubeflow on a Nutanix Kubernetes Engine(NKE) cluster


Installing Kubeflow

Do these steps to deploy Kubeflow 1.7.0 on your NKE cluster.

  1. Download the terraform script to deploy kubeflow on NKE by cloning the Github repository shown.

    git clone -b release-v1.7 && cd kubeflow-manifests
    cd automation/terraform/install_kubeflow
  2. Create env.tfvars file in the same folder with the following cluster variables. Override other variables from file if required.

    prism_central_username = "enter username"
    prism_central_password = "enter password"
    prism_central_endpoint = "enter endpoint_ip_or_host_fqdn"
    karbon_cluster_name    = "enter NKE_cluster_name"
    kubeconfig_filename    = "enter NKE_cluster_name-kubectl.cfg"
    kubeflow_version       = "1.7.0"
  3. Apply terraform commands to deploy Kubeflow in the cluster.

    terraform init
    terraform plan --var-file=env.tfvars
    terraform apply --var-file=env.tfvars
  4. Make sure all the pods are running before continuing to the next step.

    $ kubectl -n kubeflow get pods
    NAME                                                         READY   STATUS    RESTARTS   AGE
    admission-webhook-deployment-65dcd649d8-468g9                1/1     Running   0          3m39s
    cache-deployer-deployment-6b78494889-6lfg9                   2/2     Running   1          3m1s
    cache-server-bff956474-lm952                                 2/2     Running   0          3m
    centraldashboard-6b5fb79878-h9dqn                            1/1     Running   0          3m40s
    jupyter-web-app-deployment-75559c6c87-mt4q2                  1/1     Running   0          3m1s
    katib-controller-79f44b76bb-t7rzl                            1/1     Running   0          3m
    katib-db-manager-6d9857f658-p4786                            1/1     Running   0          2m59s
    katib-mysql-586f79b694-2qcl5                                 1/1     Running   0          2m59s
    katib-ui-5fdb7869cf-jmssr                                    1/1     Running   0          3m
    kfserving-controller-manager-0                               2/2     Running   0          3m15s
    kubeflow-pipelines-profile-controller-6cfd6bf9bd-cptgg       1/1     Running   0          2m59s
    metacontroller-0                                             1/1     Running   0          3m15s
    metadata-envoy-deployment-6756c995c9-gqkbd                   1/1     Running   0          3m
    metadata-grpc-deployment-7cb87744c7-4crm9                    2/2     Running   3          3m40s
    metadata-writer-6bf5cfd7d8-fgq9f                             2/2     Running   0          3m40s
    minio-5b65df66c9-9z7mg                                       2/2     Running   0          2m59s

Add a new Kubeflow user

New users are created using the Profile resource. A new namespace is created with the same Profile name. For creating a new user with email in a namespace project1, apply the following profile

cat <<EOF | kubectl apply -f -
kind: Profile
    name: project1   # replace with the name of profile you want, this will be the user's namespace name
        kind: User
        name:   # replace with the user email

If you are using basic authentication, add the user credentials in dex which is the default OpenId Connect provider in Kubeflow. Generate the hash by using bcrypt (available at in the following configmap

kubectl edit cm dex -o yaml -n auth

Add the following under staticPasswords section

- email:
  hash: <hash>
  username: user2

Rollout restart dex deployment

kubectl -n auth rollout restart deployment dex

Setup a LoadBalancer (Optional)

If you already have a load balancer set up for your NKE cluster, you can skip this step. If you do not wish to expose the kubeflow dashboard to an external load balancer IP, you can also skip this step. If not, you can install the MetalLB load balancer manifests on your NKE cluster.

$ kubectl apply -f
$ kubectl apply -f

After the manifests have been applied, we need to configure MetalLB with the IP range that it can use to assign external IPs to services of type LoadBalancer. You can find the range from the subnet in Prism Central’s networking and security settings.

apiVersion: v1
kind: ConfigMap
  namespace: metallb-system
  name: config
  config: |
      - name: default
        protocol: layer2
        - <IP_ADDRESS_RANGE: x.x.x.x-x.x.x.x>

Create a ConfigMap with the following information, substitute the addresses field with your IP address range, and apply it to the cluster.

$ kubectl apply -f metallb-configmap.yaml

Access Kubeflow Central Dashboard

There are multiple ways to acces your Kubeflow Central Dashboard:

  • Port Forward: The default way to access Kubeflow Central Dashboard is by using Port-Forward. You can port forward the istio ingress gateway to local port 8080.

    kubectl --kubeconfig=<NKE_k8s_cluster_kubeconfig_path> port-forward svc/istio-ingressgateway -n istio-system 8080:80

    You can now access the Kubeflow Central Dashboard at http://localhost:8080. At the Dex login page, enter user credentials that you previously created.

  • NodePort: For accessing through NodePort, you need to configure HTTPS. Create a certificate using cert-manager for your Worker node IP in your cluster. Add HTTPS to kubeflow gateway as given in Istio Secure Gateways. Then access your cluster at

  • LoadBalancer: If you have a LoadBalancer set up (See optional “Setup a LoadBalancer” section above), you can access the dashboard using the external IP by making the following changes.

    • Update Istio Gateway to expose port 443 with HTTPS and make port 80 redirect to 443:
      kubectl -n kubeflow edit kubeflow-gateway
      The updated gateway spec should look like:
      kind: Gateway
        name: kubeflow-gateway
        namespace: kubeflow
          istio: ingressgateway
      - hosts:
          - '*'
              name: http
              number: 80
              protocol: HTTP
          # Upgrade HTTP to HTTPS
              httpsRedirect: true
      - hosts:
          - '*'
              name: https
              number: 443
              protocol: HTTPS
              mode: SIMPLE
              privateKey: /etc/istio/ingressgateway-certs/tls.key
              serverCertificate: /etc/istio/ingressgateway-certs/tls.crt
    • Change the type of the istio-ingressgateway service to LoadBalancer
      kubectl -n istio-system  patch service istio-ingressgateway -p '{"spec": {"type": "LoadBalancer"}}'
      Get the IP address for the LoadBalancer
      kubectl -n istio-system get svc istio-ingressgateway -o jsonpath='{.status.loadBalancer.ingress[0]}'
      Create a certificate.yaml with the YAML below to create a self-signed certificate
      kind: Certificate
        name: istio-ingressgateway-certs
        namespace: istio-system
        commonName: istio-ingressgateway.istio-system.svc
        isCA: true
          kind: ClusterIssuer
          name: kubeflow-self-signing-issuer
        secretName: istio-ingressgateway-certs
      Apply certificate.yaml to the istio-system namespace
      kubectl -n istio-system apply -f certificate.yaml
    • You can now access the kubeflow dashboard by navigating to the istio-ingressgateway external IP e.g. x.x.x.x


