Kubeflow

Kubeflow
Original author(s)	Google
Initial release	March 28, 2018; 3 years ago
Stable release	1.3 / April 23, 2021; 9 months ago
Repository	github.com/kubeflow/kubeflow
Platform	Kubernetes
License	Apache License 2.0
Website	www.kubeflow.org

Kubeflow is a free and open-source machine learning platform designed to enable using machine learning pipelines to orchestrate complicated workflows running on Kubernetes (e.g. doing data processing then using TensorFlow or PyTorch to train a model, and deploying to or Seldon). Kubeflow was based on Google's internal method to deploy TensorFlow models called TensorFlow Extended.^[2]

Kubeflow Overview[]

Kubeflow is a free and open-sourced project designed to make running Machine Learning workflows on Kubernetes clusters simpler and more coordinated. This is a Cloud-Native framework for employing Machine Learning in containerized environments in Kubernetes. Kubeflow's integration with and extension of Kubernetes has become seamless and Kubeflow has been designed to run everywhere Kubernetes runs:^[3] on-prem, GCP, AWS, Azure, etc.

Kubeflow began as an internal Google project^[4] as a simpler & easier way to run TensorFlow jobs on Kubernetes, based specifically on the TensorFlow Extended pipeline. Google open-source engineers David Aronchick, Jeremy Lewi and Vishnu Kannan co-founded the Kubeflow project and after its initial release at Kubecon ^[5] companies such as Google, , Cisco, IBM, Red Hat, CoreOS and began publicly contributing to the GitHub issue board.^[6]

What is Kubeflow?[]

At its core, Kubeflow offers an end-to-end orchestration toolkit to build on Kubernetes as a way to deploy, scale and manage complex systems.^[7] Features such as running JupyterHub servers allowing multiple users to contribute to a project simultaneously has become an invaluable asset of Kubeflow. Detailed management of a project and in depth monitoring/analyzing of said project are paramount attributes in Kubeflow.

Data scientists and engineers are now able to develop a complete pipeline composed of segmented steps. These segmented steps in Kubeflow are loosely coupled components of an ML pipeline, a feature not core to other frameworks, allowing pipelines to become easily reusable and modifiable for other jobs. This added flexibility has the potential to save an incalculable amount of labor necessary to develop a new data pipeline for each specific use case. Through this process, Kubeflow aims to simplify Kubernetes deployments while also accounting for future needs of portability and scalability.

Release History[]

Kubeflow 1.0 was announced on February 26, 2020 via a blog post,^[8] The 1.0 release is available through the public GitHub repository.^[9] Specifically, Kubeflow 1.0 focused on stabilizing the following core Kubeflow components: Kubeflow's UI - the central dashboard, Jupyter notebook controller and web app, Tensorflow Operator (TFJob) and PyTorch Operator for distributed training, kfctl for deployment and upgrades, Profile Controller and UI for multiuser management.

Kubeflow 1.1 was released on June 30, 2020,^[10] and is available through the public GitHub repository.^[11] The focus for the release was simplification of notebook automation with Fairing and Kale, MXNet and XGBoost distributed training operators, and multi-user pipelines.

Kubeflow 1.2 was released on November 18, 2020,^[12] and is available through the public GitHub repository.^[13]

Kubeflow 1.3 was released on April 23, 2021,^[14] and is available through the public GitHub repository.^[15]

References[]

^ "The Kubeflow 1.3 software release streamlines ML workflows and simplifies ML platform operations". Kubeflow. Retrieved 2021-06-03.
^ "Kubeflow". Kubeflow. Retrieved 2019-06-18.
^ "Introducing Kubeflow - A Composable, Portable, Scalable ML Stack Built for Kubernetes". kubernetes.io. 21 December 2017. Retrieved 2020-01-09.
^ "Kubeflow". Kubeflow. Retrieved 2020-01-09.
^ Hot Dogs or Not - At Scale with Kubernetes [I] - Vish Kannan & David Aronchick, Google, retrieved 2019-12-20
^ "Kubeflow Issues". Retrieved 2020-01-28.
^ "END-TO-END MACHINE LEARNING STACK". May 17, 2018.
^ Lamkin, Thea (March 3, 2020). "Kubeflow 1.0: Cloud Native ML for Everyone". Medium.
^ "Release v1.0.0 · kubeflow/kubeflow". GitHub.
^ "Kubeflow 1.1 improves ML Workflow Productivity, Isolation & Security, and GitOps". Kubeflow. July 31, 2020.
^ "Release v1.1.0 · kubeflow/kubeflow". GitHub.
^ "Kubeflow 1.2 release announcement". Kubeflow. 2020-11-18. Retrieved 2020-12-13.
^ "Release v1.2.0 · kubeflow/kubeflow". GitHub. Retrieved 2020-12-13.
^ "The Kubeflow 1.3 software release streamlines ML workflows and simplifies ML platform operations". Kubeflow. Retrieved 2021-06-03.
^ "Release v1.3.0 · kubeflow/manifests". GitHub. Retrieved 2021-06-03.

External links[]

kubeflow on GitHub

[1] "The Kubeflow 1.3 software release streamlines ML workflows and simplifies ML platform operations". Kubeflow. Retrieved 2021-06-03.

[2] "Kubeflow". Kubeflow. Retrieved 2019-06-18.

[3] "Introducing Kubeflow - A Composable, Portable, Scalable ML Stack Built for Kubernetes". kubernetes.io. 21 December 2017. Retrieved 2020-01-09.

[4] "Kubeflow". Kubeflow. Retrieved 2020-01-09.

[5] Hot Dogs or Not - At Scale with Kubernetes [I] - Vish Kannan & David Aronchick, Google, retrieved 2019-12-20

[6] "Kubeflow Issues". Retrieved 2020-01-28.

[7] "END-TO-END MACHINE LEARNING STACK". May 17, 2018.

[8] Lamkin, Thea (March 3, 2020). "Kubeflow 1.0: Cloud Native ML for Everyone". Medium.

[9] "Release v1.0.0 · kubeflow/kubeflow". GitHub.

[10] "Kubeflow 1.1 improves ML Workflow Productivity, Isolation & Security, and GitOps". Kubeflow. July 31, 2020.

[11] "Release v1.1.0 · kubeflow/kubeflow". GitHub.

[12] "Kubeflow 1.2 release announcement". Kubeflow. 2020-11-18. Retrieved 2020-12-13.

[13] "Release v1.2.0 · kubeflow/kubeflow". GitHub. Retrieved 2020-12-13.

[14] "The Kubeflow 1.3 software release streamlines ML workflows and simplifies ML platform operations". Kubeflow. Retrieved 2021-06-03.

[15] "Release v1.3.0 · kubeflow/manifests". GitHub. Retrieved 2021-06-03.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]