Real-Time Forecasting at Scale using Delta Lake and Delta Caching

Поделиться
HTML-код
  • Опубликовано: 26 ноя 2024

Комментарии • 1

  • @HimanshuKhairajani-c6u
    @HimanshuKhairajani-c6u 8 месяцев назад

    In a Spark Cluster, when multiple job runs process the same data and store it back, the cluster should cache the data effectively for each job run. This means that once the data is processed by a job, it can be stored and efficiently accessed by subsequent job runs without needing to reload it from scratch each time.
    Is the above use-case suited for Disk Caching ?