Add onnx export by julianpollmann · Pull Request #294 · matchms/ms2deepscore

julianpollmann · 2026-06-23T15:24:22Z

This adds export to onnx to the model and training script.

@florian-huber @niekdejonge does this fit or should this be added to a converter script in ms2deepscore/models?

Should I adapt the Pipeline to use onnxruntime for inference?

niekdejonge · 2026-06-24T09:18:24Z

@julianpollmann Great! Thanks for developing an onnx converter and for adding it here.

Good that you added a save option for the settings. Now it is a separate json file right. Ideally it is part of the onnx file itself, for easier use and less risk of mismatching the wrong settings with a model. I think this is possible for onnx as well, using metadata_props (suggested by claude).

For inference we would like to have matchms compatibility, which makes it a bit more work... So how it is solved now is having the class MS2DeepScore(BaseSimilarity): which inherits from the matchms BaseSimilarity to make it matchms compatible. This has some torch specific code (like eval) and takes in a SiameseSpectralModel (also torch specific).

So what I think is needed to make it run using onnxruntime is a new MS2DeepScoreONNX class, a SiameseSpectralModelONNX class and a compute_embedding_array_onnx function, to enable inference. In fact you might be able to put all this functionality in just a single MS2DeepScoreONNX class, since SiameseSpectralModelONNX is not really needed as a separate class, since we don't do ONNX training. I think Pipeline doesn't need changing, this is on the matchms end and will just run anything that inherits from BaseSimilarity if I remember correctly, so if you give it the new SiameseSpectralModelONNX, which inherits from BaseSimilarity it should work fine with Pipeline.

A bit more work than I realized, when I commented on your linkedin post, sorry... But I think it is a nice change worth the effort, since onnx is faster and also programming language interoperable.

julianpollmann · 2026-06-24T13:04:47Z

@niekdejonge I will have time on friday to look into that!
Weights and settings can be integrated into onnx via metadata_props, however some providers (e.g., Huggingface) have separate config/settings.json files for e.g., model cards. Don't know which use case is more suitable for us.

Also: Right now onnx conversion is done on every checkpoint, is there a specific "last checkpoint" function or should this better be integrated into training_wrapper_functions.py?

florian-huber

Great addition, thanks @julianpollmann !
I added a few comments to check (feel free to ignore them if they are beyond the point).

niekdejonge · 2026-07-03T14:07:45Z

 ## 1) Compute spectral similarities
 We provide a model which was trained on > 500,000 MS/MS combined spectra from [GNPS](https://gnps.ucsd.edu/), [Mona](https://mona.fiehnlab.ucdavis.edu/), MassBank and MSnLib. 

 This model can be downloaded from [from zenodo here](https://zenodo.org/records/17826815). Only the ms2deepscore_model.pt is needed.


We should update this and upload the latest onnx model to a new zenodo link.

Not quite sure, but I think @florian-huber is training a model on new data. Maybe we should add this one then?

niekdejonge · 2026-07-03T14:16:12Z

+from ms2deepscore import MS2DeepScore, MS2DeepScoreONNX
+
 cleaned_spectra = pipeline.spectra_queries



I think the two lines below should be removed. Since it continuous from the code above. Which is an ONNX model, so I guess this won't work with MS2DeepScore. So if we remove these two lines it should be fine.

So you mean removing:

ms2ds_model = MS2DeepScore(model) ms2ds_embeddings = ms2ds_model.get_embedding_array(cleaned_spectra)

?

niekdejonge

@julianpollmann Great work! Looks good to me! Very complete PR, both in tests, and readme.

The tests with the onnx model have the same output as the ones for the .pt models, so it looks all good to me!

Will be great to have the CPU speed up by ONNX, since the majority of users will just want to run it on their laptop.

I left a few comments in the tutorial, for further clarity and some small things that won't run.

julianpollmann added 3 commits June 23, 2026 16:12

Add export to onnx

4837643

Add onnxscript dependency

4a87423

Add export to training script

0e7b929

julianpollmann requested a review from florian-huber June 23, 2026 15:24

julianpollmann added 3 commits June 23, 2026 17:29

Fix tests

723b86c

Merge branch 'main' into feature/add-onnx-export

969a3bc

Fix linting

bcc638e

julianpollmann removed the request for review from florian-huber June 24, 2026 13:09

julianpollmann added 20 commits June 27, 2026 00:02

Attach model settings directly to onnx model, fix dynamic shapes issue

affbd21

adjust logging

28385cf

adjust tests

c52a574

Fix method signature

0047840

Add compute_embedding_array_onnx function

ff5ad85

Add SiameseSpectralModelONNX class

963794c

move compute_embedding_array to SiameseSpectralModelONNX class

c59fdea

Adjust check for metadata tensors and add flag for settings validation

cb2b402

Add MS2DeepScoreONNX

d1606d6

Add onnxruntime

bcf0032

Add precision to intel gpu support

6214529

Fix linting

43ccb9f

Modify precision handling

18f2aa2

Small fixes

6138a1c

Add tests

5d35c7f

Update setup.py for gpu and intel support

f9de157

Update README

d0d2b71

Add onnxruntime to dev for tests

a7387c0

Add onnx testmodel

aadcd60

Fix tolerance in tests

e838294

julianpollmann requested review from florian-huber and niekdejonge June 30, 2026 19:19

julianpollmann and others added 2 commits July 2, 2026 11:05

Merge branch 'main' into feature/add-onnx-export

9653ed2

Linting isort

c42253c

florian-huber reviewed Jul 3, 2026

View reviewed changes

Comment thread ms2deepscore/models/SiameseSpectralModelONNX.py Outdated

florian-huber reviewed Jul 3, 2026

View reviewed changes

Comment thread tests/test_siamese_spectral_model_onnx.py

florian-huber reviewed Jul 3, 2026

View reviewed changes

Comment thread ms2deepscore/models/SiameseSpectralModel.py

florian-huber requested changes Jul 3, 2026

View reviewed changes

julianpollmann added 3 commits July 3, 2026 11:19

Fix tensorize_spectra in batch loop

1057462

Fix metadata check and add tests

033d405

Fix move Model to original device after export

02f76ef

niekdejonge reviewed Jul 3, 2026

View reviewed changes

Comment thread README.md

niekdejonge reviewed Jul 3, 2026

View reviewed changes

Comment thread README.md

niekdejonge approved these changes Jul 3, 2026

View reviewed changes

Update README

7cc3214

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add onnx export#294

Add onnx export#294
julianpollmann wants to merge 32 commits into
mainfrom
feature/add-onnx-export

julianpollmann commented Jun 23, 2026

Uh oh!

niekdejonge commented Jun 24, 2026

Uh oh!

julianpollmann commented Jun 24, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

florian-huber left a comment

Uh oh!

niekdejonge Jul 3, 2026

Uh oh!

julianpollmann Jul 3, 2026

Uh oh!

niekdejonge Jul 3, 2026

Uh oh!

julianpollmann Jul 3, 2026

Uh oh!

Uh oh!

Uh oh!

niekdejonge left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		from ms2deepscore import MS2DeepScore, MS2DeepScoreONNX

		cleaned_spectra = pipeline.spectra_queries

Uh oh!

Conversation

julianpollmann commented Jun 23, 2026

Uh oh!

niekdejonge commented Jun 24, 2026

Uh oh!

julianpollmann commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

florian-huber left a comment

Choose a reason for hiding this comment

Uh oh!

niekdejonge Jul 3, 2026

Choose a reason for hiding this comment

Uh oh!

julianpollmann Jul 3, 2026

Choose a reason for hiding this comment

Uh oh!

niekdejonge Jul 3, 2026

Choose a reason for hiding this comment

Uh oh!

julianpollmann Jul 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

niekdejonge left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

julianpollmann commented Jun 24, 2026 •

edited

Loading