2024-09-03 - 2024-09-04
AIDA days technical workshop
AIDA days workshop with technical theme on reproducible machine learning and efficient dataloading.
Day 1: Reproducible Machine Learning Research
Presentation | Reproducible ML |
Code | Reproducible machine learning research |
A deep dive into automating experiments to achieve both reproducibility and extendability in machine learning research. This session will explore MLFlow, a powerful library for experiment tracking, and cover techniques for configuring experiments and optimizing hyperparameters. Learn how to create machine learning science that not only delivers deterministic reproducibility but also yields generalizable results.
- Strategies for reproducible and extendable research
- In-depth exploration of MLFlow for experiment tracking
- Techniques for flexible experiment configuration and hyperparameter optimization
Who Should Attend: This workshop is tailored for experienced Python users familiar with training machine learning models. While the techniques presented are broadly applicable to any Python-based framework, PyTorch will be used as the primary example.
Day 2: Model computational optimization
Presentation | Faster data loading |
Code | Faster data loading with DALI |
Address the common bottleneck of data loading in neural network training. This workshop focuses on data augmentation challenges in imaging, with a particular emphasis on medical imaging. Discover how to use NVIDIA’s DALI framework to offload data augmentation to the GPU, enhancing training efficiency and reducing processing time.
- Practical implementation of NVIDIA DALI for data loading
- Techniques to optimize GPU usage for faster training
- Solutions to overcome data augmentation bottlenecks
Who Should Attend: Ideal for attendees with experience in training machine learning models using PyTorch Vision. While DALI can be integrated with other frameworks like TensorFlow or Jax, the focus will be on its application with PyTorch.