Back to Basics: Understanding Retrieval Augmented Generation (RAG)

Data Model and Basic Usage (Core topic #1)

HDF5 infrastructure in DUNE - Barnali Chowdhury, Argonne National Laboratory - HUG24

What Happened To Our New Catamaran?

Saving an Old Truck from the Scrap Yard

Inter Miami vs. Philadelphia Union | Messi is BACK! | Full Match Highlights | September 14, 2024

I/O model based on HDF5 - Hua Xu, Gnosis Research Center (IIT) - HUG24

hdf5

Просмотров 17

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 15 сен 2024
From the 2024 HDF5 User Group Meeting (#HUG24) held August 5-7, 2024 in Chicago, IL.
I/O model based on HDF5 - Hua Xu, Gnosis Research Center (IIT)
As computer applications become more data-intensive, their demands on storage systems for efficient storage and retrieval have significantly increased. Compute resources on clusters are often used exclusively by users for maximizing performance, but storage resources are shared across multiple users for better utilization. In such environments where resources are shared by workloads, an application’s IO performance can vary significantly due to interference from other jobs. A related problem is that of scheduling user jobs on a cluster to maximize resource utilization and minimize total execution time. A data acquisition system(DAC) deployed on clusters is a useful tool that can be used by job schedulers to make informed scheduling decisions. In this work we propose a DAC with predictive models that can learn the I/O workloads on clusters and provide predictions on system performance.
Modeling the performance of the storage layer on clusters is challenging due to the presence of multiple interacting software, sophisticated hardware, variable file types and layouts on disks and variable IO traffic from users. User-observed IO performance depends on the IO library and its usage of the file system. The IO library’s metadata APIs and the available parallelism in the file system affect the parallel IO performance. The file layout on disks (stripe count and stripe size) is another significant factor that affects load balance and parallelism in the storage layer. The impact of interference from other users is hard to model accurately. This interference is one of the reasons why empirical models of IO performance and storage systems have not been successful for modern HPC systems.
We propose a supervised learning based I/O model that updates itself with feedback from the cluster. This IO model will predict the IO time (read/write) per process for a given file layout, average IO request size (number of bytes), number of concurrent readers/writers, IO servers and storage disks. The learning framework will consist of a trained base performance model which will be continually updated as new data arrives. Updates will be incorporated in to the base model by minimizing the influence of outliers to provide accurate predictions in the presence of interference. The predicted IO performance is an indicator of the current load on the storage servers. It can be used by the job scheduling algorithm to minimize the total IO time of a set of IO jobs on the cluster.
This work will be carried out on the Ares cluster, which consists of one rack of compute nodes. All nodes share a 48TB RAID-5 storage pool comprised of eight 8TB 7200 RPM SAS hard drives. Nodes within each rack are connected with 40Gbps Ethernet with RoCE support. The model will be built and analyzed for the HDF5 file format, with ROMIO extensions for MPI-IO and PVFS2 (Parallel Virtual File System). Key parameters in HDF5 and PVFS, such as the number of processes, servers, and clients in PVFS, and stripe size, are considered as significant parameters for the model.
For more information on this conference including all sessions and slide decks, visit www.hdfgroup.o... To learn more about upcoming HUG events, please visit www.hdfgroup.o...

Комментарии •

Следующие

Автовоспроизведение

Back to Basics: Understanding Retrieval Augmented Generation (RAG)

Back to Basics: Understanding Retrieval Augmented Generation (RAG)

Data Model and Basic Usage (Core topic #1)

Data Model and Basic Usage (Core topic #1)

HDF5 infrastructure in DUNE - Barnali Chowdhury, Argonne National Laboratory - HUG24

HDF5 infrastructure in DUNE – Barnali Chowdhury, Argonne National Laboratory - HUG24

What Happened To Our New Catamaran?

What Happened To Our New Catamaran?

Saving an Old Truck from the Scrap Yard

Saving an Old Truck from the Scrap Yard

Inter Miami vs. Philadelphia Union | Messi is BACK! | Full Match Highlights | September 14, 2024

Inter Miami vs. Philadelphia Union | Messi is BACK! | Full Match Highlights | September 14, 2024

HIGHLIGHTS | ARGENTINA v AUSTRALIA | The Rugby Championship 2024

HIGHLIGHTS | ARGENTINA v AUSTRALIA | The Rugby Championship 2024

Concurrency Vs Parallelism!

Concurrency Vs Parallelism!

What is Retrieval-Augmented Generation (RAG)?

What is Retrieval-Augmented Generation (RAG)?

An HDF5 Tutorial Developed by the Community for the Community - Call the Doctor hosted by Gerd Heber

An HDF5 Tutorial Developed by the Community for the Community - Call the Doctor hosted by Gerd Heber

The Tragedy of systemd

The Tragedy of systemd

5V to 2000W DC !! High Power 2000W DC Motor run with Boost Circuit , DC 5v to 232v DC Converter

5V to 2000W DC !! High Power 2000W DC Motor run with Boost Circuit , DC 5v to 232v DC Converter

AWS re:Invent 2023 - Confidently run your production HPC workloads on AWS (CMP213)

AWS re:Invent 2023 - Confidently run your production HPC workloads on AWS (CMP213)

Predict NFL Touchdowns - Create Your First Predictive Model in Python (Step by Step Tutorial)

Predict NFL Touchdowns - Create Your First Predictive Model in Python (Step by Step Tutorial)

Live Testing ChatGPT o1 With College and PhD-level Physics Problems

Live Testing ChatGPT o1 With College and PhD-level Physics Problems

Partial IO (Core Topic #6)

Partial IO (Core Topic #6)

Мой КАРАНДАШ Спас меня #shorts #roblox

Мой КАРАНДАШ Спас меня #shorts #roblox

Какой звук фальшивый?

Какой звук фальшивый?

Real respect sig

Real respect sig

Кыргызстан призвал мигрантов возвращаться из России домой

Кыргызстан призвал мигрантов возвращаться из России домой

24 часа в наручниках с Миланой Хаметовой / Охрана выгнала из ТЦ

24 часа в наручниках с Миланой Хаметовой / Охрана выгнала из ТЦ

Как мы играем в игры 😂

Как мы играем в игры 😂

Неэффективная работа в Японии 🫢 Я просто в шоке #япония #токио #shorts

Неэффективная работа в Японии 🫢 Я просто в шоке #япония #токио #shorts

Russian soldiers get chased by Ukraine drone

Russian soldiers get chased by Ukraine drone