LUMI AI guide

This guide is designed to assist users in migrating their machine learning applications from smaller-scale computing environments to the LUMI supercomputer. We will walk you through a detailed example of training an image classification model using PyTorch's Vision Transformer (VIT) on the ImageNet dataset.

All Python and bash scripts referenced in this guide are accessible in this GitHub repository. We start with a basic python script, visiontransformer.py, that could run on your local machine and modify it over the next chapters to run it efficiently on LUMI.

Even though this guide uses PyTorch, most of the covered topics are independent of the used machine learning framework. We therefore believe this guide is helpful for all new ML users on LUMI while also providing a concrete example that runs on LUMI.

Important

PyTorch containers on LUMI will in the future be provided by the LUMI AI Factory. This guide will soon be updated to utilize these new containers. The containers currently referenced in this guide remain available on LUMI but will no longer receive updates. However, all examples included in this guide will continue to work as they currently do. For more information about the new containers, refer to the LUMI AI Factory AI Software Environment documentation.

Requirements

Before proceeding, please ensure you meet the following prerequisites:

A basic understanding of machine learning concepts and Python programming. This guide will focus primarily on aspects specific to training models on LUMI.
An active user account on LUMI and familiarity with its basic operations.
If you wish to run the included examples, you need to be part of a project with GPU hours on LUMI.

Name		Name	Last commit message	Last commit date
Latest commit History 281 Commits
1-quickstart		1-quickstart
2-setting-up-environment		2-setting-up-environment
3-file-formats		3-file-formats
4-data-storage		4-data-storage
5-multi-gpu-and-node		5-multi-gpu-and-node
6-monitoring-and-profiling		6-monitoring-and-profiling
7-TensorBoard-visualization		7-TensorBoard-visualization
8-MLflow-visualization		8-MLflow-visualization
9-Wandb-visualization		9-Wandb-visualization
assets/images		assets/images
resources		resources
.gitignore		.gitignore
LICENSE		LICENSE
LICENSE-CODE		LICENSE-CODE
README.md		README.md
citation.cff		citation.cff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

LUMI AI guide

Requirements

Table of contents

Further reading

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors 11

Uh oh!

Languages

License

Licenses found

Lumi-supercomputer/LUMI-AI-Guide

Folders and files

Latest commit

History

Repository files navigation

LUMI AI guide

Requirements

Table of contents

Further reading

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 11

Uh oh!

Languages

Packages