RetroLemmy
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
librebyte@lemmy.mlM to LibreByte@lemmy.mlEnglish · 3 days ago

Navigating Failures in Pods With Devices (Kubernetes)

kubernetes.io

external-link
message-square
0
link
fedilink
1
external-link

Navigating Failures in Pods With Devices (Kubernetes)

kubernetes.io

librebyte@lemmy.mlM to LibreByte@lemmy.mlEnglish · 3 days ago
message-square
0
link
fedilink
Navigating Failures in Pods With Devices
kubernetes.io
external-link
Kubernetes is the de facto standard for container orchestration, but when it comes to handling specialized hardware like GPUs and other accelerators, things get a bit complicated. This blog post dives into the challenges of managing failure modes when operating pods with devices in Kubernetes, based on insights from Sergey Kanzhelev and Mrunal Patel's talk at KubeCon NA 2024. You can follow the links to slides and recording. The AI/ML boom and its impact on Kubernetes The rise of AI/ML workloads has brought new challenges to Kubernetes.
alert-triangle
You must log in or # to comment.

LibreByte@lemmy.ml

librebyte@lemmy.ml

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !librebyte@lemmy.ml

Tecnologías libres para la comunidad.

Puedes enviar post a esta Comunidad sobre Tecnologías Libres en Español o Inglés.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 3 users / day
  • 67 users / week
  • 274 users / month
  • 672 users / 6 months
  • 1 local subscriber
  • 160 subscribers
  • 1.55K Posts
  • 135 Comments
  • Modlog
  • mods:
  • librebyte@lemmy.ml
  • BE: 0.19.12
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org