Hiccup of FSU computer network

Yesterday (27 April 2020) our university computer network experienced some issues and was unavailable for several hours. Not unexpectedly, this also resulted in the unavailability of the CSI:FingerID web service, website etc. As usual, computer problems cause more computer problems: It looks like today (28 April 2020) we still have certain issues restarting the CSI:FingerID workers. That is hopefully resolved soon. We apologize for any inconvenience.

 

SIRIUS 4.4 beta released

Some of you may have noticed that yesterday, April 17, the SIRIUS 4.4 beta has been released. This update is huge so we are particularly careful not to break too many things. (We will definitely break some things so please report bugs using the SIRIUS GitHub repository or .) Some facts of what you can expect:

  • The official SIRIUS 4.4 release will happen in a few days.
  • Even after SIRIUS 4.4 has been officially deployed, you can continue to use SIRIUS 4.0.1 and the corresponding CSI:FingerID web service. We hope that this allows for a smooth transition.
  • SIRIUS 4.4 integrates ZODIAC for better molecular formulas.
  • SIRIUS 4.4 integrates CANOPUS for compound class assignments.
  • SIRIUS 4.4 now reads mzML files (“centroided” data) and processes complete LC-MS/MS datasets.
  • CSI:FingerID had some massive updates, including more and larger molecular properties and standardization of molecular structures.
  • SIRIUS 4.4 also supports multi-run datasets and aligns runs.
  • SIRIUS 4.4 uses the same project space for the command-line and the user interface version, allowing you to use the SIRIUS GUI to browse through results computed with the CLI.
  • passatutto is integrated into SIRIUS 4.4, allowing you to generate your own spectral library decoy database for FDR estimation.
  • If you wonder why we jump from version 4.0.1 to 4.4: There have been several internal releases in between.
  • A word of warning: Many features and changes have accumulated and there will be a few more releases (4.4.x) until the quiver is empty. For example, the structure database will change again as we have massive issues with the way PubChem handles structure standardization.

IMPRS application call for PhD students

The International Max Planck Research School at the Max Planck Institute for Chemical Ecology in Jena is looking for PhD students. One of the projects is from our group on “making SIRIUS and CSI:FingerID GCMS-ready”. Deadline is May 08, 2020.

SIRIUS and CSI:FingerID are the best-of-class tools for MS-based compound identification in metabolomics, natural products and related fields. More than one million compound queries have been submitted to our web service, from over 3000 users and 47 countries. See our recent publication in Nature Methods (Dührkop et al., 2019).

Currently, our tools can only process tandem mass spectrometry data; extending them to Gas Chromatography Electron Ionization appears natural, but comes with numerous challenging problems from algorithmics and machine learning. This will be done in cooperation with the group of Georg Pohnert, see his recent publication in Nature (Thume et al., 2018).

We are searching for motivated candidates from bioinformatics, machine learning, cheminformatics and/or computer science who want to work in this exciting, quickly evolving interdisciplinary field. Please contact Sebastian Böcker in case of questions.

Half a position is being paid by the IMPRS; this will be supplemented by funding from our chair to 2/3 TV-L E13. (Note that the cost of living in East Germany is still considerably lower than in West Germany.) Jena is a beautiful city and wine is grown in the region: https://www.youtube.com/watch?v=DQPafhqkabc.

IMPRS: http://imprs.ice.mpg.de/
MPI-CE: http://www.ice.mpg.de/
SIRIUS & CSI:FingerID: https://bio.informatik.uni-jena.de/software/sirius/
Literature: https://bio.informatik.uni-jena.de/publications/ and https://bio.informatik.uni-jena.de/textbook-algoms/

Jena: https://www.google.de/search?q=jena&tbm=isch&
https://www.study-in.de/en/discover-germany/german-cities/jena_26976.php
https://www.google.com/search?q=jena&tbm=isch

SIRIUS 4.4 is coming soon!

It’s been a while since SIRIUS 4 received its last update. We are excited to announce that SIRIUS 4.4 is coming soon.
It comes with many new features, e.g.:

  • Project-Space: A standardized persistence layer shared by CLI and GUI that makes both fully compatible.
  • Redesigned Command Line Interface: SIRIUS is now a toolbox that contains different sub-tools that can be combined to “tool-chains”.
  • New (and newly integrated) tools:
    • ZODIAC: Improve Molecular Formula Identifications by re-ranking SIRIUS molecular formula annotations using Bayesian statistics. ZODIAC optimizes annotations on a whole dataset taking advantage of the fact that compounds usually co-occur in a network of derivatives.
    • PASSATUTTO: Is now part of SIRIUS and allows you to generate dataset specific decoy spectral libraries from computed fragmentation trees.
    • lcms-align: SIRIUS supports mzML/mzXML format to process whole LC-MS/MS runs. The lcms-align preprocessing tool performs feature detection and feature alignment based on the available MS/MS spectra.
    • Other handy standalone tools, e.g. compound similarity calculation.

To provide user friendly but also flexible and customizable access to the different tools we completely redesigned the command line interface (CLI).
We know that this might break your workflows and therefore we provide you an early access version of the CLI that can be used for testing and adapting your workflows:
https://bio.informatik.uni-jena.de/repository/list/dist-snapshot-local/de/unijena/bioinf/ms/sirius/4.4.0-SNAPSHOT/
You will also find an updated version of the manual which is still work-in-progress but contains already an updated section on the new CLI.

No worries, even when SIRIUS 4.4. will be released (as soon as the GUI is ready) version 4.0.1 will still be available for some time.

If you find bugs or have any feedback feel free to open an issue on the SIRIUS GitHub repository or contact us via .

Preprint of ZODIAC now on bioRxiv

A preprint of our paper “ZODIAC: database-independent molecular formula annotation using Gibbs sampling reveals unknown small molecules.” is now available: https://doi.org/10.1101/842740

ZODIAC takes advantage of the fact that an organism produces related metabolites. ZODIAC builds upon SIRIUS and reranks molecular formula candidates, optimizing annotations on whole datasets. By applying ZODIAC to multiple datasets we greatly increased the number of correct annotations and identified novel molecular formulas which are not present even present in PubChem.

ZODIAC will be made available in an upcoming release of the SIRIUS software.

 

 

IMPRS application call for PhD students

The International Max Planck Research School at the Max Planck Institute for Chemical Ecology in Jena is looking for PhD students. One of the projects is from our group on “making SIRIUS and CSI:FingerID GCMS-ready”. Deadline is May 24, 2019.

SIRIUS and CSI:FingerID are the best-of-class tools for MS-based compound identification in metabolomics, natural products and related fields. More than one million compound queries have been submitted to our web service, from over 3000 users and 47 countries. See our recent publication in Nature Methods (Dührkop et al., 2019).

Currently, our tools can only process tandem mass spectrometry data; extending them to Gas Chromatography Electron Ionization appears natural, but comes with numerous challenging problems from algorithmics and machine learning. This will be done in cooperation with the group of Georg Pohnert, see his recent publication in Nature (Thume et al., 2018).

We are searching for motivated candidates from bioinformatics, machine learning, cheminformatics and/or computer science who want to work in this exciting, quickly evolving interdisciplinary field. Please contact Sebastian Böcker in case of questions.

Half a position is being paid by the IMPRS; this will be supplemented by funding from our chair to 2/3 TV-L E13. (Note that the cost of living in East Germany is still considerably lower than in West Germany.) Jena is a beautiful city and wine is grown in the region: https://www.youtube.com/watch?v=DQPafhqkabc.

IMPRS: http://imprs.ice.mpg.de/
MPI-CE: http://www.ice.mpg.de/
SIRIUS & CSI:FingerID: https://bio.informatik.uni-jena.de/software/sirius/
Literature: https://bio.informatik.uni-jena.de/publications/ and https://bio.informatik.uni-jena.de/textbook-algoms/

Jena: https://www.google.de/search?q=jena&tbm=isch&
https://www.study-in.de/en/discover-germany/german-cities/jena_26976.php
https://www.google.com/search?q=jena&tbm=isch

Our SIRIUS 4 paper is now available at Nature Methods

We are happy to announce that our paper “SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information” is now available online at Nature Methods.

  • K. Dührkop, M. Fleischauer, M. Ludwig, A. A. Aksenov, A. V. Melnik, M. Meusel, P. C. Dorrestein, J. Rousu, and S. Böcker, “Sirius 4: Turning tandem mass spectra into metabolite structure information,” Nature Methods, doi 10.1038/s41592-019-0344-8, 2019.

View-only access to the paper is available here.


SIRIUS and CSI:FingerID user meeting?

With SIRIUS and CSI:FingerID gathering interest in the community, we are thinking about a SIRIUS and CSI:FingerID user meeting (a SIRIUS user meeting, so to say) in Jena. This would be a 2-3 day come-together with the possibility to show what your are doing with our tools, discuss with the developers, give us feedback on what is SIRIUSly needed etc. We are open to suggestions.

But most importantly: Are you interested in such a meeting? Would you come to Jena for 2-3 days? When would be a good time? (September is the default, but this is usually packed.)

In case you are interested, please let us know. You can leave your comment below, but please also send an email to the SIRIUS email address.


SIRIUS 4.0.1 released

A new version of SIRIUS 4 is available for download.
SIRIUS 4.0.1 brings many bugfixes, user interface polishing and improved stability of the CSI:FingerID backend.

  • SIRIUS 4.0.1 now supports JAVA 9 and higher
  • The structures used to train CSI:FingerID are now available via the web service:
    • https://www.csi-fingerid.uni-jena.de/webapi/trainingstructures.csv?predictor=pos
    • https://www.csi-fingerid.uni-jena.de/webapi/trainingstructures.csv?predictor=neg

See our changelog for further details .

You can download SIRIUS with CSI:FingerID here.

CSI:FingerID successfully applied to dAPCI GC-MS data

Larson et al. (J. Am. Soc. Mass Spectrom., 2018) have applied CSI:FingerID to dopant-assisted atmospheric pressure chemical ionization (dAPCI)  gas chromatography mass spectrometry (GC-MS) data. They identified almost three times as many compounds with SIRIUS and CSI:FingerID as when searching in the NIST spectral library. Find their study here.

SIRIUS fix for compounds in negative ion mode

We have fixed a bug in SIRIUS when analyzing compounds in negative ion mode. These were wrongly treated as intrinsically charged. If you have analyzed negative ion mode data with SIRIUS 4 and CSI:FingerID, you might want to reanalyze the data with the newest version.

 

Stress test on April 12: 260k compound queries on a single day

Some of you might have experienced problems to reach the CSI:FingerID web service on Thursday, April 12, 2018. The reason is: After a “relatively quiet” Wednesday with “only” 120k compound queries (irony warning), the CSI:FingerID web service had to handle a real-world stress test on Thursday, with 260k compound queries submitted on a single day. That is 3 compound queries per second on average.

This brought our job database into some trouble: The database runs on a redundant twin server and is regularly mirrored between the two servers, in case one of them is hosed. Unfortunately, the log files used for mirroring jobs became too big, stalling the web server.

The problem resided until Friday 13th noon. Bad luck.

We are sorry if this caused any inconvenience. All systems should be up and running again.

As a courtesy towards other users: In case you want to submit complete databases with hundreds of runs, it would be great if you could distribute your computations over the course of several days. Also, you might want to contact us in advance, so that we do not accidentally block you.

By the way: We have obviously passed 1.5 million compound queries after these two days.

 

IMPRS application call for PhD students

The International Max Planck Research School at the MPI for Chemical Ecology in Jena is looking for PhD students, and one of the projects is on “making SIRIUS and CSI:FingerID GCMS-ready”. Only a half position is being paid by the IMPRS, but this can be supplemented by funding from our chair. We are searching for motivated candidates from bioinformatics, cheminformatics and computer science who want to work in this exciting, quickly evolving interdisciplinary field. Please see here for details, and apply here. Application deadline is May 16th, 2018. Contact Sebastian in case your have questions.

SIRIUS 3 is not longer supported

We found a major bug in the web service of SIRIUS 3 which can also affect the stability of the new SIRIUS 4. Therefore we decided to shut down the web service of SIRIUS 3 immediately.

Please contact us () if you need to finish work that can only be done with SIRIUS 3. We will try to find a solution then.