Loading…
In-person
18 April 2023
Learn More

The Sched app allows you to build your schedule, but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2023, and have an All-Access pass in order to participate in the sessions.

The KubeCon + CloudNativeCon Only virtual pass is still available. With this pass you get all the fantastic content you’ve come to expect from KubeCon + CloudNativeCon but from the comfort of your own home!  *Observability Day + CiliumCon will be available via livestream on the virtual platform, all other co-located events recordings will be available 24-72 hours post-event on the CNCF YouTube channel.

Thank you to our CiliumCon livestream sponsor, Isovalent and our Observability Day livestream sponsor, Lightstep! Join the conversation on Cilium Slack.

Please note: This schedule is automatically displayed in Central European Summer Time (UTC +2). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

To view the full event schedule for a specific CNCF-hosted Co-located event, you can use the right-hand navigation bar to sort and filter.


The schedule is subject to change.
Tuesday, April 18 • 15:50 - 16:15
SLA Aware Batch Scheduling in Apache YuniKorn with Multi-Tenant Preemption - Sunil Govindan & Craig Condit, Cloudera

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Multi tenant batch processing is one of the difficult resource-intensive data engineering use cases in Kubernetes. Today, a simple rule that governs both scheduling order and the preemption behavior in the default scheduler is that a pod with a higher priority is more important than a pod with a lower priority, and this crosses namespaces. We need more. When looking at preemption from a batch workload, we need to take into account the possibility of an opt-out from preemption by considering the context of the type of workload, including user quotas, hierarchical queues, Spark driver pod, etc. How do we get that accuracy and flexibility? What are the “The Laws of Preemption” for batch processing? Most of the rules given around tasks and queues are there to prevent a preemption storm or loop. How to define guardrails around preempting a workload from one tenant and giving the resources to another? In a cluster that runs workloads for multiple tenants, preemption should be tenant aware. A tenant could map to a queue hierarchy, thus crossing tenant boundaries. Preemption for YuniKorn is based on the hierarchical queue model and guaranteed resources assigned to a queue. In this talk, we will dive deep into the batch preemption capability of YuniKorn as a K8s scheduler plugin.

Speakers
CC

Craig Condit

Senior Staff Engineer, Cloudera
Craig Condit is an open source, cloud native, and high performance computing enthusiast. He has been a software engineer for over 20 years, and has worked in Big Data for over a decade, with extensive experience in Apache Hadoop, YARN, and YuniKorn. He is an Apache YuniKorn committer... Read More →
avatar for Sunil Govindan

Sunil Govindan

Senior Engineer Manager, Cloudera
Sunil Govindan is Senior Engineering Manager@Cloudera. Sunil is primarily focussing on Resource Scheduling areas and contributing to Apache YuniKorn Scheduler for Kubernetes as a Committer and member of PMC from 2019 onwards. He has been an ASF Member since 2022 and has been contributing... Read More →



Tuesday April 18, 2023 15:50 - 16:15 CEST
Hall 7, Room A | Ground Floor | Europe Complex