Rafay Systems
Rafay Systems
  • Видео 189
  • Просмотров 55 293
AutoML using Katib in Kubeflow
This video presents a thorough exploration of AutoML using Katib within the Kubeflow ecosystem. Drawing from the Getting Started Exercise published by Rafay, we will guide you through the essential steps to implement AutoML effectively. This resource is ideal for individuals aiming to deepen their expertise in machine learning automation and understand the capabilities of Katib in a cloud-native environment. #DataScience #AutoML #Kubeflow
docs.rafay.co/aiml/mlops-kubeflow/gs/katib/overview/
Просмотров: 30

Видео

Temporary Access/Break Glass Workflows for Kubernetes using Rafay
Просмотров 3019 часов назад
This video delves into the critical topic of Temporary Access and Break Glass Workflows for Kubernetes, showcasing the capabilities of Rafay. We will discuss the challenges organizations face in managing access and how Rafay provides effective solutions. Through detailed examples, we aim to equip viewers with the knowledge to enhance their Kubernetes security posture. #KubernetesSecurity #Rafay...
Distributed Training on Ray using PyTorch
Просмотров 5414 дней назад
Delve into the process of distributed training on Ray utilizing PyTorch. Viewers will learn how to set up parallel training tasks, where each worker independently trains a separate instance of a model. The video is based on Rafay's comprehensive Getting Started Guide, which provides a step-by-step overview of aggregating trained parameters from multiple workers. Join us as we demonstrate the tr...
Distributed Training of a simple TensorFlow model using Ray and TensorFlow's MirroredStrategy.
Просмотров 7614 дней назад
In this video, we explore the distributed training of a simple TensorFlow model utilizing Ray and TensorFlow's MirroredStrategy. Viewers will learn how to effectively set up a Ray Endpoint through Rafay's "Ray as a Service" offering, streamlining the process of model training. This tutorial is based on Rafay's comprehensive Getting Started with Ray Guide, providing essential insights and practi...
Rafay's Zero Code Generative AI Workbench
Просмотров 15714 дней назад
Discover the potential of Rafay's zero code Generative AI workbench in this informative video. We will begin with a brief introduction to the platform, highlighting its user-friendly interface and unique functionalities. The demonstration will illustrate how users can effortlessly create AI models, making advanced technology accessible to everyone. #AIWorkbench #TechInnovation #Rafay #finetuning
Deep Dive into Rafay's Ray as Service Offering
Просмотров 2414 дней назад
Join us in this video as we begin by showcasing the self-service experience from the end user's viewpoint. We will then transition to an exploration of the architecture that supports this experience, specifically focusing on the creation of individual vClusters for users, managed by the Kuberay operator. This design allows for the simultaneous operation of hundreds of Ray Tenants within their o...
Self Service user experience to launch a Multitenant Ray Service on a Shared Host Cluster
Просмотров 1914 дней назад
This video serves as a guide for data scientists and ML researchers on how to access a self-service platform for launching and managing a Ray Tenant within a shared host Kubernetes cluster. The Ray tenant is designed to function in an isolated virtual cluster, providing a secure and efficient environment for your projects. Discover the advantages and functionalities that this setup offers to en...
Use TensorBoard to visualize TensorFlow generated models in Kubeflow
Просмотров 2321 день назад
This video serves as a guide to leveraging TensorBoard to visualize TensorFlow deep learning models in a Kubeflow Pipeline. Rafay's Kubeflow as a Service MLOps provides a robust turnkey integration of TensorBoard, enabling organizations to rapidly scale their MLOps initiatives. Join us to explore how this integration can streamline your machine learning processes. #DeepLearning #TensorFlow #MLO...
Integrated GPU Metrics via Dashboards in the Rafay Platform
Просмотров 51Месяц назад
Take an in-depth examination of the integrated GPU metrics offered by the Rafay Platform, illustrated through engaging dynamic dashboards. This video will highlight how these visual tools facilitate seamless monitoring and analysis of GPU performance. We will also discuss the critical role these metrics play in fostering improved decision-making and enhancing application performance in various ...
End to End MLOps Pipeline for Training and Inference
Просмотров 86Месяц назад
In this informative RUclips video, we present a comprehensive demonstration of an end-to-end MLOps pipeline utilizing Kubeflow, MLflow, and KServe, all orchestrated through Rafay. Viewers will gain insights into the practical application of these powerful tools as we work with the IRIS dataset, following the structured guidance provided in Rafay's Get Started Guide. This tutorial is designed fo...
Create Container Image in Kubeflow Pipeline using Kaniko
Просмотров 35Месяц назад
This detailed tutorial explores the creation of a container image in a Kubeflow Pipeline using Kaniko. This video covers the entire workflow, highlighting key configurations and tips to optimize your image building process. Whether you are new to Kubeflow or looking to refine your existing skills, this guide will equip you with the knowledge needed to effectively use Kaniko in your projects. En...
PyTorch vs TensorFlow in 2024
Просмотров 207Месяц назад
PyTorch vs TensorFlow in 2024
Multi Cluster add-on management using Golden Blueprints
Просмотров 38Месяц назад
Explore Rafay's Golden Blueprints, a comprehensive framework designed to ensure that every cluster within an organization is equipped with a standardized set of essential add-ons mandated by the platform team. Discover how implementing these blueprints can streamline operations, enhance consistency, and improve overall efficiency across your cloud infrastructure. Join us as we delve into the be...
Rafay MLOps Platform based on Kubeflow on Google Cloud
Просмотров 1272 месяца назад
Discover the capabilities of the Rafay MLOps Platform, built on Kubeflow and hosted on Google Cloud. This video provides an in-depth exploration of how Rafay streamlines machine learning operations, enhances collaboration, and accelerates deployment processes. Learn about the key features and benefits that make this platform a powerful tool for data scientists and machine learning engineers. Jo...
Scale Upstream Kubernetes Clusters on Nutanix based on Rafay MKS using GitOps
Просмотров 212 месяца назад
Scale Upstream Kubernetes Clusters on Nutanix based on Rafay MKS using GitOps
In-Place Kubernetes Upgrades of Rafay MKS based Clusters on Nutanix
Просмотров 292 месяца назад
In-Place Kubernetes Upgrades of Rafay MKS based Clusters on Nutanix
Provision Upstream Kubernetes on Nutanix using Rafay MKS Distribution
Просмотров 262 месяца назад
Provision Upstream Kubernetes on Nutanix using Rafay MKS Distribution
In Place OS Upgrades for Nodes in Rafay MKS based Upstream Kubernetes Clusters
Просмотров 262 месяца назад
In Place OS Upgrades for Nodes in Rafay MKS based Upstream Kubernetes Clusters
Rafay PaaS For NVIDIA Customers And Partners
Просмотров 3142 месяца назад
Rafay PaaS For NVIDIA Customers And Partners
Developer Access to Amazon EKS Clusters via 3 Supported Modes using Rafay
Просмотров 283 месяца назад
Developer Access to Amazon EKS Clusters via 3 Supported Modes using Rafay
Manage Pod Identity Associations in EKS Clusters using Rafay
Просмотров 233 месяца назад
Manage Pod Identity Associations in EKS Clusters using Rafay
GitOps based Approach to manage Lifecycle Management of Upstream Kubernetes Clusters
Просмотров 303 месяца назад
GitOps based Approach to manage Lifecycle Management of Upstream Kubernetes Clusters
Lifecycle Management of Rafay MKS Clusters (Upstream Kubernetes for Bare Metal and VMs)
Просмотров 313 месяца назад
Lifecycle Management of Rafay MKS Clusters (Upstream Kubernetes for Bare Metal and VMs)
VMs on AWS with a VMware vSphere User Experience
Просмотров 334 месяца назад
VMs on AWS with a VMware vSphere User Experience
Certificate Rotation Workflows for Upstream Kubernetes Clusters using Rafay Kubernetes Manager
Просмотров 484 месяца назад
Certificate Rotation Workflows for Upstream Kubernetes Clusters using Rafay Kubernetes Manager
Developer Self Service of AWS Resources with Rafay Environment Manager
Просмотров 494 месяца назад
Developer Self Service of AWS Resources with Rafay Environment Manager
Cloud Harmony: Eliminating Infrastructure Siloes
Просмотров 294 месяца назад
Cloud Harmony: Eliminating Infrastructure Siloes
Troubleshooting and Debug EKS Cluster Provisioning using Rafay
Просмотров 344 месяца назад
Troubleshooting and Debug EKS Cluster Provisioning using Rafay
Streamline AI/ML Adoption: Expert Strategies to Conquer IT Hurdles and Accelerate Growth
Просмотров 355 месяцев назад
Streamline AI/ML Adoption: Expert Strategies to Conquer IT Hurdles and Accelerate Growth
Unleashing Developer & Cloud Ops Superpowers: Boost Productivity with Next-Level Infrastructure
Просмотров 616 месяцев назад
Unleashing Developer & Cloud Ops Superpowers: Boost Productivity with Next-Level Infrastructure

Комментарии

  • @donson3326
    @donson3326 22 дня назад

    Failure mode is wrong, when GPUs are under utilized while training it’s usually a hardware issue.

  • @mithunmanoharmithun
    @mithunmanoharmithun 5 месяцев назад

    Great talk. Learned so much

  • @AkshayDurgade
    @AkshayDurgade Год назад

    Can you just give terraform repositories for this? I am facing multiple issues like Error: unable to complete request code 400, body {"internal":"unable to update and publish cluster rpc error: code = InvalidArgument desc = Could not facilitate the request Error fetching cloud credentials provider ","code":1,"external":"bad request"} Even if terraform shows resources created like creds, it is not being implemented in the UI.

  • @AkshayDurgade
    @AkshayDurgade Год назад

    Can you please provide the terraform code or it's repository in the description?

  • @akshay_durgade
    @akshay_durgade Год назад

    Can you please provide the terraform code or it's repository in the description?

    • @rafaysystems7900
      @rafaysystems7900 Год назад

      github.com/RafaySystems/getstarted/tree/master/terraform/aks

  • @tube77tdf
    @tube77tdf Год назад

    Some audio narration would be helpful.

  • @rileymclain9851
    @rileymclain9851 Год назад

    P R O M O S M