GitHub - argonne-lcf/nekRS-ML: AI-coupled nekRS

███    ██ ███████ ██   ██ ██████  ███████
████   ██ ██      ██  ██  ██   ██ ██     
██ ██  ██ █████   █████   ██████  ███████
██  ██ ██ ██      ██  ██  ██   ██      ██
██   ████ ███████ ██   ██ ██   ██ ███████ 
(c) 2019-2024 UCHICAGO ARGONNE, LLC

nekRS-ML is a fork of the ALCF managed nekRS v24 computational fluid dynamics (CFD) solver augmented to provides examples and capabilities for AI-enabled CFD research on HPC systems. It is meant to be a sandbox showcasing ways in which ML methods and in-situ workflows can be used to integrate AI with traditional CFD simulations on HPC systems.

Some key functionalities of nekRS-ML are:

Graph neural network (GNN) modeling: Dist-GNN is a scalable and consistent GNN for mesh-modeling of dynamical systems on very large graphs. It relies on tailored neural message passing layers and loss constructions to guarantee arithmetic consistency on domain-decomposed graphs partitioned similarly to a CFD mesh. It can be used to perform both time dependent modeling (e.g., advance the solution field) and time independent modeling (e.g., predict a flow quantity from another).
Conversion tools for mesh-based distributed GNN modeling: NekRS-ML provides a GNN plugin capable of extracting the necessary information from nekRS to contruct the partitioned graph needed by Dist-GNN. The same GNN plugin and the trajectory generation plugin can be used to extract the field information from nekRS to produce training data for the Dist-GNN. The GNN and trajectory generation plugins can create graphs and the respective training data from p-coarsened nekRS meshes to enable development of surrogates on coarser discretizations.
Data streaming with ADIOS2: nekRS v24 comes with ADIOS2 for I/O, thus nekRS-ML expands the usage of ADIOS2 to enable data streaming between nekRS and GNN training, enabling online (or in-situ) training/fine-tuning of the ML models.
In-memory data staging with SmartSim: nekRS-ML can also be linked to the SmartRedis library, which when coupled with a SmartSim workflow enables online training and inference with in-memory data-staging.

Progression of AI-enabled examples

nekRS-ML hosts a series of AI-enabled examples listed below in order of complexity to provide a smooth learning progression. Users can find more details on each of the examples in the README files contained within the respective directories.

turbChannel_wallModel: Wall-modeled LES of a turbulent channel flow at a friction Reynolds number of 950 with an equilibrium wall model.
tgv_gnn_offline: Offline training pipeline to generate data and perform time independent training of the Dist-GNN model.
tgv_gnn_offline_fine_mesh: Offline training pipeline to generate data and perform time independent training of the Dist-GNN model on a p-coarsened grid relative to the one used by the nekRS simulation.
tgv_gnn_traj_offline: Offline training pipeline to generate data and perform time dependent training of the Dist-GNN model.
turbChannel_wallModel_ML: Online training and inference workflows of a data-driven wall shear stress model for LES applied to a turbulent channel flow at a friction Reynolds number of 950.
tgv_gnn_online: Online training workflow using SmartSim to cuncurrently generate data and perform time independent training of the Dist-GNN model.
tgv_gnn_traj_online: Online training workflow using SmartSim to cuncurrently generate data and perform time dependent training of the Dist-GNN model.
tgv_gnn_traj_online_adios: Online training workflow using ADIOS2 to cuncurrently generate data and perform time dependent training of the Dist-GNN model.
shooting_workflow_smartredis: Online training workflow using SmartSim to shoot the nekRS solution forward in time leveraging the Dist-GNN model.
shooting_workflow_adios: Online training workflow using ADIOS2 to shoot the nekRS solution forward in time leveraging the Dist-GNN model.

Build Instructions

Requirements:

Linux, Mac OS X (Microsoft WSL and Windows is not supported)
GNU/oneAPI/NVHPC/ROCm compilers (C++17/C99 compatible)
MPI-3.1 or later
CMake version 3.21 or later

Optional requirements:

PyTorch and PyTorch Geometric (for the examples using the GNN)
SmartSim and SmartRedis (for the examples using SmartSim as a workflow driver)

To build nekRS and the required dependencoes, first clone our GitHub repository:

https://github.com/argonne-lcf/nekRS-ML.git

The main (default) branch always points to the latest stable version of the code. Other branches available in the repository should be considered experimental.

Then, simply execute one of the build scripts contained in the reposotory. The HPC systems currently supported are:

Polaris @ Argonne LCF
Aurora @ Argonne LCF
Crux @ Argonne LCF (limited support for ML-enabled exampels)

For example, to build nekRS-ML on Aurora without the SmartRedis client, execute from a compute node

./BuildMeOnAurora

If istead the SmartRedis client is desired, execute

ENABLE_SMARTREDIS=ON ./BuildMeOnAurora

If a build script for a specific HPC system is not available, please submit an issue or feel free to contribute a PR (see below for details on both).

Running the AI-enabled Examples

To run any of the AI-enabled examples listed above, simply cd to the example directory of interest and from a compute node execute

./gen_run_script <system_name> </path/to/nekRS>

or

./gen_run_script <system_name> </path/to/nekRS> </path/to/venv/bin/activate>

if you have the necessary packages already installed in a Python virtual environment.

The script will produce a run.sh script specifically tailored to the desired system and using the desired nekRS install directory.

Finally, the examples are run from the compute nodes executing

./run.sh

Documentation

For documentation on the nekRS solver, see the readthedocs page. Please note these pages are a work in progress. For documentation on the specific nekRS-ML examples, we encourage users to follow the README files within each example directory.

Discussion Group

For nekRS specific questions, please visit the GitHub Discussions. Here nekRS developers help, find solutions, share ideas, and follow discussions.

Contributing

Our project is hosted on GitHub. To learn how to contribute, see CONTRIBUTING.md.

Reporting Bugs

All bugs are reported and tracked through Issues. If you are having trouble installing the code or getting your case to run properly, please submit an issue.

License

nekRS is released under the BSD 3-clause license (see LICENSE file). All new contributions must be made under the BSD 3-clause license.

Acknowledgment

This research was supported by the Exascale Computing Project (17-SC-20-SC), a joint project of the U.S. Department of Energy's Office of Science and National Nuclear Security Administration, responsible for delivering a capable exascale ecosystem, including software, applications, and hardware technology, to support the nation's exascale computing imperative.

Name		Name	Last commit message	Last commit date
Latest commit History 589 Commits
.github		.github
3rd_party		3rd_party
cmake		cmake
doc		doc
examples		examples
scripts		scripts
src		src
.clang-format		.clang-format
.gitignore		.gitignore
BuildMeOnAurora		BuildMeOnAurora
BuildMeOnCrux		BuildMeOnCrux
BuildMeOnPolaris		BuildMeOnPolaris
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
RELEASE.md		RELEASE.md
build.sh		build.sh
runSrcFormat.sh		runSrcFormat.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Progression of AI-enabled examples

Build Instructions

Running the AI-enabled Examples

Documentation

Discussion Group

Contributing

Reporting Bugs

License

Acknowledgment

About

Uh oh!

Releases

Packages

Languages

License

argonne-lcf/nekRS-ML

Folders and files

Latest commit

History

Repository files navigation

Progression of AI-enabled examples

Build Instructions

Running the AI-enabled Examples

Documentation

Discussion Group

Contributing

Reporting Bugs

License

Acknowledgment

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages