Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Comparing isotope pattern #16

Open
michaelwitting opened this issue May 28, 2021 · 7 comments
Open

Comparing isotope pattern #16

michaelwitting opened this issue May 28, 2021 · 7 comments

Comments

@michaelwitting
Copy link
Collaborator

Hi,

I thought one other important aspect would be to compare two isotope pattern (e.g. one measured and one simulated) against each other. We could use the matchedSpectra here, assuming that the supplied spectra are MS1 spectra.
We would need a scoring function for the isotope pattern matching, since the dotproduct would not work here. What would be a good place for that? MsCoreUtils or MetaboCoreUtils?
I'm searching for a fitting scoring function, e.g. adopting the one from Rdisop

Best,

Michael

@jorainer
Copy link
Member

Hm, I guess isotope matching would also be important for proteomics and MS in general, thus MsCoreUtils might be the place to go.

@michaelwitting
Copy link
Collaborator Author

I will check the C++ code from Rdisop. I guess it can be tweaked...

@stanstrup
Copy link

As far as I remember the Rdisop score some times gives weird result where a worse match is scored higher... So perhaps it is not a good reference implementation...

@michaelwitting
Copy link
Collaborator Author

I know that in Sirius there is a good implementation. Maybe we can check their code. I have a meeting with Sebastian Böcker tomorrow, so I can ask about the exact algorithm behind it.

@jorainer
Copy link
Member

Just thinking out loud:

  • theoretical isotope pattern could be calculated with enviPat - but you would need to know the chemical formula beforehand.
  • if one would then restrict the MS1 spectrum to peaks representing isotopologues of the compound of interest (i.e. isotopologues for a given m/z of interest), shouldn't dotproduct or any of the other similarity scores be good enough to compare this measured pattern against the theoretical one?

note that @andreavicini is currently refining our approach to find likely isotopologues in an MS1 spectrum.

@michaelwitting
Copy link
Collaborator Author

The dotproduct would overestimate similarity, since it would be mainly driven by the monoisotope, which is the highest peak. It was originally develope for GC-MS, where you have a lot of peaks distributed over a m/z range, each peak with some contribution to the spectrum.
I guess we really need here another score, but I'm opne for any suggestion.

@stanstrup
Copy link

There is a suggestion here I think: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4321342/#__sec4title

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants