feat(deps): Use mlflow-skinny instead of mlflow #418

gmertes · 2025-07-11T15:18:58Z

Description

Base mlflow pulls in extra dependencies required to run the mlflow server locally, which we don't actually need. We can use the mlflow-skinny package, which only contains the logging functionality. This cuts down few dependencies and saves some bytes in our CI/CD pipelines.

For users that do run a local mlflow server (do we have any?), they can just install the regular mlflow. Installing mlflow on top of mlflow-skinny doesn't hurt (they both install the same mlflow module under the hood).

I confirmed that both online and offline logging still work with mlflow-skinny.

As a contributor to the Anemoi framework, please ensure that your changes include unit tests, updates to any affected dependencies and documentation, and have been tested in a parallel setting (i.e., with multiple GPUs). As a reviewer, you are also responsible for verifying these aspects and requesting changes if they are not adequately addressed. For guidelines about those please refer to https://anemoi.readthedocs.io/en/latest/

By opening this pull request, I affirm that all authors agree to the Contributor License Agreement.

anaprietonem · 2025-07-14T08:15:48Z

Thanks for the PR @gmertes! Looks good, and logging online and offline seems to work okey. Just a comment that for the mlflow sync we'd need to install pip install git+https:///github.com/mlflow/mlflow-export-import/#egg=mlflow-export-import and I think with that one we might get the dependencies we are 'saving using skinny' so ...

Also I checked and for scikit-learn we need for graphs and torch_geometric

gmertes · 2025-07-14T13:43:21Z

and I think with that one we might get the dependencies we are 'saving using skinny' so ...

Hmm yes you are right, for the users that also install mlflow-sync there won't be any savings in the dependencies, only in the CI/CD but it's minimal.

mlflow-sync does install mlflow-skinny[databricks] under the hood, so it would be consistent to use skinny here (and in utils) too.

I think at this point it's less about the dependencies and just a minor consistency issue. What do you think, should we still merge this or leave it?

mchantry

LGTM thanks

## Description Base mlflow pulls in extra dependencies required to run the mlflow server locally, which we don't actually need. We can use the mlflow-skinny package, which only contains the logging functionality. This cuts down few dependencies and saves some bytes in our CI/CD pipelines. For users that do run a local mlflow server (do we have any?), they can just install the regular mlflow. Installing mlflow on top of mlflow-skinny doesn't hurt (they both install the same mlflow module under the hood). I confirmed that both online and offline logging still work with mlflow-skinny. ***As a contributor to the Anemoi framework, please ensure that your changes include unit tests, updates to any affected dependencies and documentation, and have been tested in a parallel setting (i.e., with multiple GPUs). As a reviewer, you are also responsible for verifying these aspects and requesting changes if they are not adequately addressed. For guidelines about those please refer to https://anemoi.readthedocs.io/en/latest/*** By opening this pull request, I affirm that all authors agree to the [Contributor License Agreement.](https://github.com/ecmwf/codex/blob/main/Legal/contributor_license_agreement.md)

🤖 Automated Release PR This PR was created by `release-please` to prepare the next release. Once merged: 1. A new version tag will be created 2. A GitHub release will be published 3. The changelog will be updated Changes to be included in the next release: --- <details><summary>training: 0.6.0</summary> ## [0.6.0](training-0.5.1...training-0.6.0) (2025-08-01) ### ⚠ BREAKING CHANGES * for schemas of data processors ([#433](#433)) * BaseGraphModule and tasks introduced in anemoi-core ([#399](#399)) ### Features * Add metadata back to pl checkpoint. ([#303](#303)) ([0193b28](0193b28)) * BaseGraphModule and tasks introduced in anemoi-core ([#399](#399)) ([f8ab962](f8ab962)) * **deps:** Use mlflow-skinny instead of mlflow ([#418](#418)) ([6a8beb3](6a8beb3)) * Log FTT2 loss + Fourier Correlation loss ([#148](#148)) ([345b0ab](345b0ab)) * **model:** Postprocessors for leaky boundings ([#315](#315)) ([b54562b](b54562b)) * **models:** Checkpointed Mapper Chunking ([#406](#406)) ([8577772](8577772)) * **models:** Mapper edge sharding ([#366](#366)) ([326751d](326751d)) * Variable filtering ([#208](#208)) ([fba5e47](fba5e47)) ### Bug Fixes * Dropping 3.9 ([#436](#436)) ([f6c0214](f6c0214)) * For schemas of data processors ([#433](#433)) ([539939b](539939b)) * Mlflow hp params limit ([#424](#424)) ([138bc3a](138bc3a)) * Mlflowlogger duplicated key ([#414](#414)) ([cb64a1c](cb64a1c)) * **models,traininig:** Hierarchical model + integration test ([#400](#400)) ([71dfd89](71dfd89)) * **models:** Add removed sharded_input_key in PR400 ([#425](#425)) ([089fe6f](089fe6f)) * New checkpoint ([#445](#445)) ([a25df93](a25df93)) * Plotting error when precip related params are not diagnostic ([#369](#369)) ([010cfa3](010cfa3)) * **training:** Address issues with [#208](#208) ([#417](#417)) ([665f462](665f462)) * **training:** Scaler memory usage ([#391](#391)) ([a9d30e1](a9d30e1)) * Update import mflow utils unit tests ([#427](#427)) ([70ecdd9](70ecdd9)) * Update level retrieval logic ([#405](#405)) ([f393bc3](f393bc3)) * Use transforms: Variable for ExtractVariableGroupAndLevel ([#321](#321)) ([7649f4f](7649f4f)) * Warm restart ([#443](#443)) ([ff96236](ff96236)) ### Documentation * **graphs:** Documenting some missing features ([#423](#423)) ([8addbd8](8addbd8)) </details> <details><summary>graphs: 0.6.3</summary> ## [0.6.3](graphs-0.6.2...graphs-0.6.3) (2025-08-01) ### Features * **graphs:** Add lat weighted attribute ([#223](#223)) ([5dd32ca](5dd32ca)) * **graphs:** Support to export edges to npz ([#395](#395)) ([e21738f](e21738f)) ### Bug Fixes * Dropping 3.9 ([#436](#436)) ([f6c0214](f6c0214)) * **graphs:** Revert PR [#379](#379) ([#409](#409)) ([d51219f](d51219f)) * **graphs:** Throw error instead of raising warning when graph exists. ([#379](#379)) ([6ec6c18](6ec6c18)) * **graphs:** Undo masking when torch-cluster is installed ([#375](#375)) ([9f75c06](9f75c06)) ### Documentation * **graphs:** Documenting some missing features ([#423](#423)) ([8addbd8](8addbd8)) </details> <details><summary>models: 0.9.0</summary> ## [0.9.0](models-0.8.1...models-0.9.0) (2025-08-01) ### ⚠ BREAKING CHANGES * for schemas of data processors ([#433](#433)) ### Features * **model:** Postprocessors for leaky boundings ([#315](#315)) ([b54562b](b54562b)) * **models:** Checkpointed Mapper Chunking ([#406](#406)) ([8577772](8577772)) * **models:** Mapper edge sharding ([#366](#366)) ([326751d](326751d)) ### Bug Fixes * Dropping 3.9 ([#436](#436)) ([f6c0214](f6c0214)) * For schemas of data processors ([#433](#433)) ([539939b](539939b)) * **models,traininig:** Hierarchical model + integration test ([#400](#400)) ([71dfd89](71dfd89)) * **models:** Remove repeated lines ([#377](#377)) ([1f0b861](1f0b861)) * **models:** Uneven channel sharding ([#385](#385)) ([dd095c4](dd095c4)) * Pydantic model validator not working in transformer schema ([#422](#422)) ([42f437a](42f437a)) * Remove dead code and fix typo ([#357](#357)) ([8c615ba](8c615ba)) </details> --- > [!IMPORTANT] > Please do not change the PR title, manifest file, or any other automatically generated content in this PR unless you understand the implications. Changes here can break the release process. > > ⚠️ Merging this PR will: > - Create a new release > - Trigger deployment pipelines > - Update package versions **Before merging:** - Ensure all tests pass - Review the changelog carefully - Get required approvals [Release-please documentation](https://github.com/googleapis/release-please)

gmertes added 2 commits July 11, 2025 15:57

feat(deps): Use mlflow-skinny instead of mlflow

1a8f3a1

utils

016c037

gmertes requested a review from a team as a code owner July 11, 2025 15:18

github-actions bot added training enhancement New feature or request labels Jul 11, 2025

gmertes added the ATS Approval Needed Approval needed by ATS label Jul 11, 2025

mchantry added the ATS Approved Approved by ATS label Jul 16, 2025

mchantry approved these changes Jul 16, 2025

View reviewed changes

anaprietonem added 3 commits July 21, 2025 10:03

Merge branch 'main' into chore/mlflow-skinny

b74121b

Merge branch 'main' into chore/mlflow-skinny

cf4b950

Merge branch 'main' into chore/mlflow-skinny

6c64ee4

anaprietonem merged commit 6a8beb3 into main Jul 21, 2025
18 checks passed

anaprietonem deleted the chore/mlflow-skinny branch July 21, 2025 15:24

DeployDuck mentioned this pull request Jul 21, 2025

chore: Release main #380

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(deps): Use mlflow-skinny instead of mlflow #418

feat(deps): Use mlflow-skinny instead of mlflow #418

Uh oh!

gmertes commented Jul 11, 2025

Uh oh!

anaprietonem commented Jul 14, 2025

Uh oh!

gmertes commented Jul 14, 2025

Uh oh!

mchantry left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat(deps): Use mlflow-skinny instead of mlflow #418

feat(deps): Use mlflow-skinny instead of mlflow #418

Uh oh!

Conversation

gmertes commented Jul 11, 2025

Description

Uh oh!

anaprietonem commented Jul 14, 2025

Uh oh!

gmertes commented Jul 14, 2025

Uh oh!

mchantry left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants