Chapter 3 Pretreatment

Pretreatment strongly affects metabolomics results and covers the workflow from crude samples to injection vials for instrumental analysis. The goal is to preserve biologically meaningful compounds while removing as much irrelevant matrix background as possible. In metabolomics, however, the compounds of interest are often not known in advance, so pretreatment must be chosen according to study purpose, sample matrix, and analytical platform. For example, Gel Permeation Chromatography (GPC), Florisil, alumina, and silica gel can remove lipids, while alcohols and strong acids or bases can denature proteins and release additional compounds. If the study focuses on small lipids or peptides, these choices may need to change substantially. In practice, sample collection, quenching, extraction, derivatization, and storage should all be optimized as part of pretreatment.

3.1 Collection

Those papers investigated different fecal collection methods(Loftfield et al. 2016; Deda et al. 2017).

This paper discusses the influence of sample normalization(Wu and Li 2016).

3.2 Quenching

Quenching solvent is always used to stop enzymatic activity.

In this review(Lu et al. 2017), authors said:

A classical approach, which works well for many analytes, is boiling ethanol. Although the boiling solvent raises concerns about thermal degradation, it reliably denatures enzymes. In contrast, cold organic solvent may not fully denature enzymes or may do so too slowly such that some metabolic reactions continue, interconverting metabolites during the quenching process.

This review(Kim et al. 2020) summarized the urease-dependent metabolome sample preparation and found:

activities of urease and endogenous urinary enzymes and metabolite contaminants from the urease preparations introduce artefacts into metabolite profiles, thus leading to misinterpretation.

3.3 Extraction

According to this research(Bennett et al. 2009):

The total metabolome concentration is approximately 300 mM, whereas the protein concentration is approximately 7 mM., which implies that most cellular metabolites are in free form.

Dmitri et al.(Sitnikov et al. 2016) thought the most orthogonal methods to methanol-based precipitation were ion-exchange solid-phase extraction and liquid-liquid extraction using methyl-tertbutyl ether.

Another study used stable isotope labeled sample and found the use of a water-methanol-acetonitrile mixture for global metabolite extraction instead of aqueous methanol or aqueous acetonitrile alone (Doppler et al. 2016).

Metabolic information was highly influenced by the extraction solvent(Ibáñez et al. 2017).

Tissue samples need to first be pulverized into fine powders.

Feces collected with 95% ethanol or FOBT would be more reproducible and stable.

In this review(Lu et al. 2017), authors said:

In our experience, for both cell and tissue specimens, 40:40:20 acetonitrile:methanol:water with 0.1 M formic acid (and subsequent neutralization with ammonium bicarbonate) is generally an effective solvent system for both quenching and extraction, including for ATP and other high-energy phosphorylated compounds. We typically use approximately 1 mL of solvent mix to extract 25 mg of biological specimen. …Thus, although drying is acceptable for most metabolites, care must be taken with redox-active species.

nano LC-MS could be used to analyze small numbers of cells(Luo and Li 2017).

For plant like soybeans(Mahmud et al. 2017), ammonium acetate/methanol could be selected as extraction strategies compared with water/methanol and sodium phosphate/methanol. For general plant samples, check this comprehensive investigation(Bijttebier et al. 2016).

For blood plasma and serum sample, a comprehensive evaluation of 12 sample preparation methods (SPM) using phospholipid and protein removal plates (PLR), solid phase extraction plates (SPE), supported liquid extraction cartridge (SLE), and conventionally used protein precipitation (PPT) were performed. Results show PPT and PLR on the same samples by implementing a simple analytical workflow as their complementarity would allow the broadening of the visible chemical space (Chaker et al. 2022).

3.3.1 Extraction selection flowchart

In practice, extraction should be selected according to the study purpose, sample matrix, and metabolite class of interest rather than by habit alone. A simple decision logic is:

  1. Is the study untargeted or targeted?
    Untargeted studies usually prefer broader and more orthogonal extraction conditions, while targeted studies may prioritize recovery for a defined chemical class.

  2. Is the major interest polar metabolites, lipids, or both?
    Polar metabolites often favor aqueous organic mixtures such as methanol/acetonitrile/water, while lipids often require biphasic extraction or non-polar solvents.

  3. Does the matrix contain high protein or phospholipid background?
    Plasma, serum, and tissue extracts may need additional cleanup such as protein precipitation, phospholipid removal, SPE, or liquid-liquid extraction.

  4. Is sample amount limited?
    Small-volume studies may need simpler and more reproducible extraction strategies with fewer transfers.

  5. Is absolute quantification or high-throughput screening required?
    High-throughput studies usually favor simpler workflows, while quantitative studies may justify more selective cleanup or isotope-labeled standards.

As a rough guide:

  • Broad untargeted polar metabolomics: water/methanol/acetonitrile mixtures are often a good starting point.

  • Lipid-focused or broad lipid coverage: LLE or biphasic extraction is often more suitable.

  • Clinical plasma/serum screening: protein precipitation is often the simplest first-pass method, followed by SPE or phospholipid removal if matrix effects are severe.

  • Very dirty matrices: SPE or multi-step cleanup may improve selectivity at the cost of coverage.

3.3.2 Protein precipitation, SPE, and LLE

The three common strategies have different goals and trade-offs:

Method Main purpose Strengths Limitations Common use
Protein precipitation (PPT) Remove proteins quickly with organic solvent Fast, cheap, high-throughput, simple to automate Limited selectivity, matrix effects may remain, lipid/phospholipid background can still be high Plasma, serum, routine untargeted screening
Solid-phase extraction (SPE) Selective cleanup or enrichment on sorbent Better cleanup, can enrich target classes, reduces matrix complexity More steps, more cost, analyte loss possible, method development required Targeted assays, dirty matrices, class-selective enrichment
Liquid-liquid extraction (LLE) Partition compounds by polarity into different phases Good orthogonality, useful for lipids and broad chemical space separation More labor-intensive, emulsion risk, reproducibility can suffer if handling is inconsistent Lipidomics, biphasic extraction, broad coverage workflows

PPT is often the most practical starting point for biofluids because it is simple and scalable. SPE is useful when background suppression is severe or when the study targets a narrower compound class. LLE is often preferred when the aim is to separate polar and non-polar metabolite fractions or to improve lipid coverage.

3.4 Derivatization

Derivatization is always used in GC-based metabolomics study because many metabolites are too polar, thermally unstable, or non-volatile to be directly analyzed by GC-MS. This paper(Miyagawa and Bamba 2019) compared sequential derivatization methods and found different compounds would show different fluctuations during oximation or silylation process. This paper summarized derivatization methods for LC-MS (Zhao and Li 2020).

For GC-MS metabolomics, derivatization is not a minor technical detail. It is a major source of variation and should be planned as part of pretreatment. In many workflows, oximation is performed first to stabilize carbonyl compounds and reduce multiple derivative forms, followed by silylation to increase volatility and improve chromatographic behavior. If the timing, reagent freshness, residual water, or temperature is not controlled, the same metabolite may show unstable peak areas or multiple derivative peaks across samples.

The main practical points are:

  • Control water content because moisture can strongly interfere with silylation efficiency.

  • Keep derivatization timing consistent across all samples because reaction time can change peak intensity.

  • Use the same reagent lot and temperature conditions within a study when possible.

  • Treat batch processing carefully because derivatized samples may not remain stable for long waiting times.

  • Expect compound-dependent behavior since sugars, amino acids, organic acids, and carbonyl compounds may respond differently.

Therefore, derivatization should be optimized and documented with the same care as extraction solvent, especially in GC-MS studies or whenever semi-quantitative comparison is important.

3.5 Isotope label

You might try heavy water to exchange oxygen atom with samples to track certain metabolites(Osipenko et al. 2022) or MS-IDF(Wang et al. 2022).

3.6 Microsampling techniques

Traditional blood sampling requires large volumes (several mL) drawn by trained personnel, which limits the temporal resolution and ecological validity of metabolomics studies. Microsampling techniques collect much smaller volumes (typically 10–30 \(\mu\)L) and can often be performed by participants themselves at home, making them especially attractive for longitudinal and remote studies.

3.6.1 Solid-phase microextraction (SPME)

Solid-phase microextraction (SPME) uses a coated fiber or thin film to extract analytes directly from a biological matrix without the need for solvent extraction(Pawliszyn 2012). The coating selectively absorbs or adsorbs analytes based on their affinity to the coating material. SPME has been coupled with both GC-MS and LC-MS for metabolomics applications, and in vivo SPME probes have been developed for direct tissue sampling with minimal invasiveness(Reyes-Garcés et al. 2018). A key advantage is that SPME provides a balanced coverage of metabolites across a wide polarity range, since the extraction is equilibrium-based rather than exhaustive(Li et al. 2022).

3.6.2 Dried blood spots (DBS)

Dried blood spots involve collecting a small volume of capillary blood (typically from a finger prick) onto filter paper cards. After drying, the cards are stable at room temperature and can be shipped by mail, making DBS ideal for large-scale population studies and remote sampling(Déglon et al. 2012). DBS have been successfully used in newborn screening for decades and are now being applied to metabolomics studies. Challenges include hematocrit effects on spot size and analyte recovery, and the relatively small amount of material available for analysis(Manier et al. 2021).

3.6.3 Volumetric absorptive microsampling (VAMS)

Volumetric absorptive microsampling (VAMS, commercially available as Mitra devices) collects a fixed volume of blood (typically 10 or 20 \(\mu\)L) onto an absorbent polymer tip, overcoming the hematocrit-dependent volume variability of DBS(Denniff and Spooner 2014). VAMS devices are straightforward for self-collection and the fixed volume simplifies quantitative analysis. They have been validated for a range of metabolomics and pharmacokinetic applications.

3.7 Automation

Automation is increasingly important in metabolomics pretreatment, especially for large cohorts, longitudinal studies, and multi-batch projects. Automated liquid handling can improve reproducibility by reducing manual pipetting error, controlling timing more tightly, and standardizing solvent addition, mixing, and aliquoting across many samples.

Automation is especially useful for:

  • protein precipitation in plates

  • SPE plate workflows

  • addition of internal standards

  • batch-wise derivatization with controlled timing

  • aliquoting pooled QC and blanks

However, automation does not automatically solve pretreatment problems. Plate layout, dead volume, edge effects, carryover, tip selection, solvent compatibility, and evaporation still need to be checked carefully. In some cases, a simple manual workflow may be more robust than a poorly optimized automated one. Therefore, automation should be treated as a reproducibility tool that still requires validation.

3.8 Storage

Samples should be stored after sample collection or sample pretreatment. -80°C or -20°C is always preferred to store samples. Dry ice should be used during sample pretreatment. However, comprehensive investigation of storage influences found the metabolites profile will change after one day storage at -80°C(Yu et al. 2020) . Rapid analysis of samples should be considered to capture more accurate information in the samples.

Storage conditions such as temperature and time can affect the metabolite composition of various samples. Laparre et al.(Laparre et al. 2017) noted that the metabolite profiles of urine samples were significantly changed after 5 days of storage at 4°C , while Wandro and colleagues(Wandro et al. 2017) observed that the metabolomic profiles of cystic fibrosis sputum samples underwent notable changes after only 1 day of storage at 4°C . Likewise, Roszkowska et al. demonstrated that various signaling molecules were lost from the lipidome profile of tissue after storing the samples for one year at 80°C (Roszkowska et al. 2018). To date, most metabolomics studies involving storage of samples prior to the analysis have used a storage temperature of 80°C , as previous investigations have shown that low temperatures or freeze-thaw cycles do not significantly change the metabolite profile of certain samples(Lin et al. 2007) .

For gut microbiota, this paper could be checked for storage issue(Zubeldia-Varela et al. 2020).

For blood sample storage, you could check this paper(Hernandes et al. 2017).

For urine sample storage, check this(Laparre et al. 2017).

This piece reviewed the stability of energy metabolites(Gil et al. 2015).

References

Bennett, Bryson D., Elizabeth H. Kimball, Melissa Gao, Robin Osterhout, Stephen J. Van Dien, and Joshua D. Rabinowitz. 2009. “Absolute Metabolite Concentrations and Implied Enzyme Active Site Occupancy in Escherichia Coli.” Nature Chemical Biology 5 (8): 593–99. https://doi.org/10.1038/nchembio.186.
Bijttebier, Sebastiaan, Anastasia Van der Auwera, Kenn Foubert, Stefan Voorspoels, Luc Pieters, and Sandra Apers. 2016. “Bridging the Gap Between Comprehensive Extraction Protocols in Plant Metabolomics Studies and Method Validation.” Analytica Chimica Acta 935 (September): 136–50. https://doi.org/10.1016/j.aca.2016.06.047.
Chaker, Jade, David Møbjerg Kristensen, Thorhallur Ingi Halldorsson, et al. 2022. “Comprehensive Evaluation of Blood Plasma and Serum Sample Preparations for HRMS-Based Chemical Exposomics: Overlaps and Specificities.” Analytical Chemistry 94 (2): 866–74. https://doi.org/10.1021/acs.analchem.1c03638.
Deda, Olga, Anastasia Chrysovalantou Chatziioannou, Stella Fasoula, et al. 2017. “Sample Preparation Optimization in Fecal Metabolic Profiling.” Journal of Chromatography B, Advances in mass spectrometry-based applications, vol. 1047 (March): 115–23. https://doi.org/10.1016/j.jchromb.2016.06.047.
Déglon, Jérôme, Aurelie Thomas, Patrice Mangin, and Christian Staub. 2012. “Dried Blood Spots for Quantification of Pharmaceutical Compounds Using LC-MS.” Journal of Pharmaceutical and Biomedical Analysis 49 (4): 1034–39. https://doi.org/10.1016/j.jpba.2009.01.002.
Denniff, Philip, and Neil Spooner. 2014. “Volumetric Absorptive Microsampling – a Dried Sample Collection Technique for Quantitative Bioanalysis.” Analytical Chemistry 86 (16): 8489–95. https://doi.org/10.1021/ac5022562.
Doppler, Maria, Bernhard Kluger, Christoph Bueschl, et al. 2016. “Stable Isotope-Assisted Evaluation of Different Extraction Solvents for Untargeted Metabolomics of Plants.” International Journal of Molecular Sciences 17 (7). https://doi.org/10.3390/ijms17071017.
Gil, Andres, David Siegel, Hjalmar Permentier, Dirk-Jan Reijngoud, Frank Dekker, and Rainer Bischoff. 2015. “Stability of Energy Metabolites—An Often Overlooked Issue in Metabolomics Studies: A Review.” ELECTROPHORESIS 36 (18): 2156–69. https://doi.org/10.1002/elps.201500031.
Hernandes, Vinicius Veri, Coral Barbas, and Danuta Dudzik. 2017. “A Review of Blood Sample Handling and Pre-Processing for Metabolomics Studies.” ELECTROPHORESIS 38 (18): 2232–41. https://doi.org/10.1002/elps.201700086.
Ibáñez, Clara, Carolina Simó, Mine Palazoglu, and Alejandro Cifuentes. 2017. GC-MS Based Metabolomics of Colon Cancer Cells Using Different Extraction Solvents.” Analytica Chimica Acta 986 (September): 48–56. https://doi.org/10.1016/j.aca.2017.07.019.
Kim, Jungyeon, Joong Kyong Ahn, Yu Eun Cheong, Sung-Joon Lee, Hoon-Suk Cha, and Kyoung Heon Kim. 2020. “Systematic Re-Evaluation of the Long-Used Standard Protocol of Urease-Dependent Metabolome Sample Preparation.” PloS One 15 (3): e0230072. https://doi.org/10.1371/journal.pone.0230072.
Laparre, Jérôme, Zied Kaabia, Mark Mooney, et al. 2017. “Impact of Storage Conditions on the Urinary Metabolomics Fingerprint.” Analytica Chimica Acta 951 (January): 99–107. https://doi.org/10.1016/j.aca.2016.11.055.
Li, Xin, Chuang Wang, and Janusz Pawliszyn. 2022. “Recent Developments in Solid-Phase Microextraction for in Vivo Sampling of Living Systems.” Analytica Chimica Acta 1225: 340244. https://doi.org/10.1016/j.aca.2022.340244.
Lin, Ching Yu, Huifeng Wu, Ronald S. Tjeerdema, and Mark R. Viant. 2007. “Evaluation of Metabolite Extraction Strategies from Tissue Samples Using NMR Metabolomics.” Metabolomics 3 (1): 55–67. https://doi.org/10.1007/s11306-006-0043-1.
Loftfield, Erikka, Emily Vogtmann, Joshua N. Sampson, et al. 2016. “Comparison of Collection Methods for Fecal Samples for Discovery Metabolomics in Epidemiologic Studies.” Cancer Epidemiology and Prevention Biomarkers 25 (11): 1483–90. https://doi.org/10.1158/1055-9965.EPI-16-0409.
Lu, Wenyun, Xiaoyang Su, Matthias S. Klein, Ian A. Lewis, Oliver Fiehn, and Joshua D. Rabinowitz. 2017. “Metabolite Measurement: Pitfalls to Avoid and Practices to Follow.” Annual Review of Biochemistry 86 (1): 277–304. https://doi.org/10.1146/annurev-biochem-061516-044952.
Luo, Xian, and Liang Li. 2017. “Metabolomics of Small Numbers of Cells: Metabolomic Profiling of 100, 1000, and 10000 Human Breast Cancer Cells.” Analytical Chemistry 89 (21): 11664–71. https://doi.org/10.1021/acs.analchem.7b03100.
Mahmud, Iqbal, Sandi Sternberg, Michael Williams, and Timothy J. Garrett. 2017. “Comparison of Global Metabolite Extraction Strategies for Soybeans Using UHPLC-HRMS.” Analytical and Bioanalytical Chemistry 409 (26): 6173–80. https://doi.org/10.1007/s00216-017-0557-6.
Manier, Svenja K, Annalena Keller, and Markus R Meyer. 2021. “Dried Blood Spots for Metabolomics.” Analytical Chemistry 93 (1): 593–600. https://doi.org/10.1021/acs.analchem.0c04282.
Miyagawa, Hiromi, and Takeshi Bamba. 2019. “Comparison of Sequential Derivatization with Concurrent Methods for GC/MS-based Metabolomics.” Journal of Bioscience and Bioengineering 127 (2): 160–68. https://doi.org/10.1016/j.jbiosc.2018.07.015.
Osipenko, Sergey, Alexander Zherebker, Lidiia Rumiantseva, Oxana Kovaleva, Evgeny N. Nikolaev, and Yury Kostyukevich. 2022. “Oxygen Isotope Exchange Reaction for Untargeted LCMS Analysis.” Journal of the American Society for Mass Spectrometry 33 (2): 390–98. https://doi.org/10.1021/jasms.1c00383.
Pawliszyn, Janusz. 2012. “Theory of Solid-Phase Microextraction.” Handbook of Solid Phase Microextraction, 13–59. https://doi.org/10.1016/B978-0-12-416017-0.00002-4.
Reyes-Garcés, Nathaly, Emanuela Gionfriddo, German Augusto Gómez-Rı́os, et al. 2018. “Advances in in Vivo Solid-Phase Microextraction (SPME) for Metabolomics.” Analytical Chemistry 90 (1): 302–60. https://doi.org/10.1021/acs.analchem.7b04502.
Roszkowska, Anna, Miao Yu, Vincent Bessonneau, Leslie Bragg, Mark Servos, and Janusz Pawliszyn. 2018. “Tissue Storage Affects Lipidome Profiling in Comparison to in Vivo Microsampling Approach.” Scientific Reports 8 (1): 6980. https://doi.org/10.1038/s41598-018-25428-2.
Sitnikov, Dmitri G., Cian S. Monnin, and Dajana Vuckovic. 2016. “Systematic Assessment of Seven Solvent and Solid-Phase Extraction Methods for Metabolomics Analysis of Human Plasma by LC-MS.” Scientific Reports 6 (December). https://doi.org/10.1038/srep38885.
Wandro, Stephen, Lisa Carmody, Tara Gallagher, John J. LiPuma, and Katrine Whiteson. 2017. “Making It Last: Storage Time and Temperature Have Differential Impacts on Metabolite Profiles of Airway Samples from Cystic Fibrosis Patients.” mSystems 2 (6). https://doi.org/10.1128/mSystems.00100-17.
Wang, Suping, Xiaojuan Jiang, Rong Ding, et al. 2022. MS-IDF: A Software Tool for Nontargeted Identification of Endogenous Metabolites After Chemical Isotope Labeling Based on a Narrow Mass Defect Filter.” Analytical Chemistry 94 (7): 3194–202. https://doi.org/10.1021/acs.analchem.1c04719.
Wu, Yiman, and Liang Li. 2016. “Sample Normalization Methods in Quantitative Metabolomics.” Journal of Chromatography A, Editors’ Choice X, vol. 1430 (January): 80–95. https://doi.org/10.1016/j.chroma.2015.12.007.
Yu, Miao, Sofia Lendor, Anna Roszkowska, et al. 2020. “Metabolic Profile of Fish Muscle Tissue Changes with Sampling Method, Storage Strategy and Time.” Analytica Chimica Acta 1136 (November): 42–50. https://doi.org/10.1016/j.aca.2020.08.050.
Zhao, Shuang, and Liang Li. 2020. “Chemical Derivatization in LC-MS-based Metabolomics Study.” TrAC Trends in Analytical Chemistry 131 (October): 115988. https://doi.org/10.1016/j.trac.2020.115988.
Zubeldia-Varela, Elisa, Domingo Barber, Coral Barbas, Marina Perez-Gordo, and David Rojo. 2020. “Sample Pre-Treatment Procedures for the Omics Analysis of Human Gut Microbiota: Turning Points, Tips and Tricks for Gene Sequencing and Metabolomics.” Journal of Pharmaceutical and Biomedical Analysis 191 (November): 113592. https://doi.org/10.1016/j.jpba.2020.113592.