Files
awesome-kubernetes/docs/mlops.md
Inaki Fernandez e618b8bccc update
2024-01-29 10:18:54 +01:00

36 KiB
Raw Blame History

Machine Learning Ops (MLOps) and Data Science

  1. Introduction. MLOps
  2. MLOps Roadmap
  3. Blogs
  4. ML Infra
  5. Object Detection Libraries
  6. MLFlow
  7. Kubeflow
  8. Flyte
  9. AWS ML
  10. Azure ML
  11. KServe Cloud Native Model Server
  12. Data Science
  13. Machine Learning workloads in kubernetes using Nix and NVIDIA
  14. Other Tools
  15. Debugging ML Jobs
  16. Samples
  17. ML Courses
  18. ML Competitions and Challenges
  19. Polls
  20. Tweets

Introduction. MLOps

MLOps Roadmap

Blogs

ML Infra

Object Detection Libraries

MLFlow

Kubeflow

Flyte

  • https://flyte.org
  • Union Cloud ML and Data Orchestration powered by Flyte
  • mlops.community: MLOps with Flyte: The Convergence of Workflows Between Machine Learning and Engineering
  • ==Machine Learning in Production. What does an end-to-end ML workflow look like in production? (transcript)== 🌟🌟🌟 - Play Recording
    • Kelsey Hightower joined the @flyteorg team to discuss what ML looks like in the real world, from ingesting data to consuming ML models via an API.
    • @kelseyhightower You can't go swimming in a #data_lake if you actually can't swim, right? You're going to drown. 🏊‍♂️
    • @ketanumare Machine Learning products deteriorate in time. If you have the best model today it's not guaranteed to be the best model tomorrow.
    • @thegautam It's hard to verify models before you put them in production. We need our systems to be fully reproducible, which is why an #orchestration_tool is important, running multiple models in parallel.
    • @ketanumare We at @union_ai unify the extremely fragmented world of ML and give the choice to users when to use proprietary technology versus when to use open source. (1/2)
    • @ketanumare #Flyte makes it seamless to work on #kubernetes with spark jobs, and that's a big use case, but you can also use @databricks. Similarly, we are working on Ray and you can also use @anyscalecompute. (2/2)
    • @Ketanumare Most machine learning engineers are not distributed systems engineers. This becomes a challenge when youre deploying models to production. Infrastructure abstraction is key to unlock your teams potential.
    • @ketanumare on #Machine_Learning workflows: Creating Machine Learning workflows is a team sport. 🤝
    • @arnoxmp: A Machine Learning model is often a blackbox. If you encounter new data, do a test run first.
    • @fabio_graetz In classical software engineering the only thing that changes is the code, in a ML system the data can change. You need to version and test data changes.
    • @Forcebananza This is actually one of the reasons I really like using #Flyte. You can map a cell in a notebook to its own task, and they're really easy to compose and reuse and copy and paste around. (1/2)
    • @Forcebananza Jupyter notebooks are great for iterating, but moving more towards a standard software engineering workflow and making that easy enough for data scientists is really really important.(2/2)
    • @jganoff Taking snapshots of petabytes of data is expensive, there are tools that version a dataset without having to copy it. Having metadata separate from the data itself allows you to treat a version of a dataset as if it were code.
    • @SMT_Solvers In F500s it is mostly document OCR. Usually batch jobs - an API wouldnt work - you need the binaries on the server even if it is a sidecar Docker container. One org (not mine) blows $$ doing network transfer from AWS to GCP when GCP could license their OCR in a container.
    • @Forcebananza Flyte creates a way for all these teams to work together partially because writing workflows, writing reusable components… is actually simple enough for data scientists and data engineers to work with.
    • @kelseyhightower We're now at a stage where we can start to leverage systems like https://flyte.org to give us more of an opinionated end-to-end workflow. What we call #ML can become a real discipline where practitioners can use a common set of terms and practices.
  • stackoverflow.com: How is Flyte tailored to "Data and Machine Learning"?
  • union.ai: Production-Grade ML Pipelines: Flyte™ vs. Kubeflow Kubeflow and Flyte are both production-grade, Kubernetes-native orchestrators for machine learning. Which is best for ML engineers? Check out this head-to-head comparison.
  • mlops.community: MLOps Simplified: orchestrating ML pipelines with infrastructure abstraction. Enabled by Flyte
  • medium.com/@timleonardDS: Who Let the DAGs out? Register an External DAG with Flyte (Chapter 3)

AWS ML

Azure ML

KServe Cloud Native Model Server

Data Science

Machine Learning workloads in kubernetes using Nix and NVIDIA

Other Tools

Debugging ML Jobs

  • betterprogramming.pub: Attach a Visual Debugger to ML-training Jobs on Kubernetes
    • As machine learning models grow in size and complexity, cloud resources are more and more often required for training. However, debugging training jobs running in the cloud can be time-consuming and challenging. In this blog post, well explore how to attach a visual debugger in VSCode to a remote deep learning training environment, making debugging simpler and more efficient.
    • In this tutorial, you'll deploy a local Kubernetes cluster with k3d, install the MLOps workflow orchestration engine Flyte, create a simple training workflow, and finally visually debug it using VSCode and debugpy

Samples

  • fepegar/vesseg Brain vessel segmentation using 3D convolutional neural networks
  • github.com/10tanmay100: MEDICAL-DATA-PROJECT-END2END-WITH-FEW-MLOPS We are on a mission to transform medical data into actionable insights using the power of machine learning. Whether you are a data scientist, healthcare professional, or an enthusiast in the field, your contributions and ideas are invaluable to us. Join us in making a difference!

ML Courses

ML Competitions and Challenges

Polls

??? note "Click to expand!"

<center>
[![MLOps Workflow Scheduler Poll](images/mlops_workflow_scheduler_poll.png)](https://www.linkedin.com/feed/update/urn:li:activity:6923979009311559680)
</center>

Tweets

Click to expand!

To my JVM friends looking to explore Machine Learning techniques - you dont necessarily have to learn Python to do that. There are libraries you can use from the comfort of your JVM environment. 🧵👇

— Maria Khalusova (@mariaKhalusova) November 26, 2020
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

You don't need to go to a university to learn machine learning - you can do it from your living room, for completely free.

Here is an extensive list of curated free courses and tutorials, from beginner to advanced. ↓

(Trust me, you want to bookmark this tweet.)

— Tivadar Danka (@TivadarDanka) September 21, 2021
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

I started taking data science courses last year, after studying and coding for at least 10 hours 6 days a week and doing several ML projects alongside data analysis projects, I finally got my first data analyst offer from a Nigerian bank last week after countless rejections

— Sam (@SamsonTontoye) February 20, 2022
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

Deep Neural Networks are used for many applications. One I'm particularly fond of is medical imaging. A trained model can process the input thanks to the activation functions propagating through a network of perceptrons and generating the output of interest.#NeuralNets #Medical pic.twitter.com/vPwm0TfHnn

— Valerio Pergola (@valerio_pergola) July 6, 2022
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

#3D intracranial artery segmentation using a convolutional neural networks #CNN - #opensource > https://t.co/Z2WDp2UOl3 | #python #TensorFlow #DeepLearning #MachineLearning #Nvidia #GPU #brain #medical #conda #Neurology #Artificial_Intelligence #medical_imaging #Nifti pic.twitter.com/eKrBBuFxSy

— NewUlmDesign (@ulmdesign) July 7, 2022
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

https://t.co/WxspfKvLFS

— nubenetes (@nubenetes) July 22, 2022
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

@kelseyhightower We're now at a stage where we can start to leverage systems like #Flyte to give us more of an opinionated end-to-end workflow. What we call #ML can become a real discipline where practitioners can use a common set of terms and practices.#KelseyTakesFlyte #MLOps

— Flyte (@flyteorg) July 22, 2022
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

If you're not utilizing AI, you're falling behind.

Here are 7 free AI tools that'll save you hours of work:

— Nikki Siapno (@NikkiSiapno) October 24, 2022
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

Machine Learning will be one of the most sought-after professions this decade.

Learn & practice ML for free with these outstanding resources and earn certificates for your resume:

— Simon (@simonholdorf) February 25, 2023
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

Building robust #data and #ML pipelines by tapping into the power of multiple tools and integrating them should not be a challenging task.

With Flyte, you can simplify the entire process of developing data and ML pipelines through access to more than 30 integrations. pic.twitter.com/UBege732tQ

— Flyte (@flyteorg) March 9, 2023
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>