This script converts Waters raw data files to imzML format using ProteoWizard (through Docker). It takes a folder containing raw Waters datasets as input and converts each raw file to mzMLb format using ProteoWizard. Then, it extracts the x and y coordinates from the .inf file associated with each raw file and converts the mzMLb file to imzML format using pyteomics and pyimzml libraries.
- Python==3.11.8
- chardet==5.2.0
- pyimzML==1.5.3
- pyteomics==4.7.2
- ProteoWizard (installed as a Docker container)
Nastase, A.M., waters2imzml, 10.5281/zenodo.13951708
-
Install the required Python libraries:
pip install -r requirements.txt
-
Install Docker Desktop from the official Docker website: https://www.docker.com/products/docker-desktop
-
Run the script:
python waters_to_imzml.py
-
Follow the prompts to provide the path to the folder containing the raw Waters datasets, the polarity (positive or negative), and the position of the ID in the file name.
Path to folder containing raw Waters datasets: /path/to/folder Input polarity positive/negative: positive Position of id in file name: 7
-
The name of the folders containing the raw data should not have spaces and should be separated by '_'. The folder name should contain SAMPLE_idnumber.
-
The script will convert each raw file to mzMLb format using ProteoWizard and then convert it to imzML format using pyteomics and pyimzml libraries. The converted imzML files will be saved in the 'imzml' folder within the raw data folder.
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.