Volume 18, Issue 5
Research Article

Correlation optimized warping and dynamic time warping as preprocessing methods for chromatographic data

Giorgio Tomasi

Corresponding Author

E-mail address: gt@kvl.dk

The Royal Veterinary and Agricultural University (KVL), Department of Food Science, Food Technology, Rolighedsvej 30, DK‐1958 Frederiksberg C, Denmark

The Royal Veterinary and Agricultural University (KVL), Department of Food Science, Food Technology, Rolighedsvej 30, DK‐1958 Frederiksberg C, Denmark.Search for more papers by this author
Frans van den Berg

The Royal Veterinary and Agricultural University (KVL), Department of Food Science, Food Technology, Rolighedsvej 30, DK‐1958 Frederiksberg C, Denmark

Search for more papers by this author
Claus Andersson

The Royal Veterinary and Agricultural University (KVL), Department of Food Science, Food Technology, Rolighedsvej 30, DK‐1958 Frederiksberg C, Denmark

Search for more papers by this author
First published: 16 July 2004
Citations: 439

Abstract

Two different algorithms for time‐alignment as a preprocessing step in linear factor models are studied. Correlation optimized warping and dynamic time warping are both presented in the literature as methods that can eliminate shift‐related artifacts from measurements by correcting a sample vector towards a reference. In this study both the theoretical properties and the practical implications of using signal warping as preprocessing for chromatographic data are investigated. The connection between the two algorithms is also discussed. The findings are illustrated by means of a case study of principal component analysis on a real data set, including manifest retention time artifacts, of extracts from coffee samples stored under different packaging conditions for varying storage times. We concluded that for the data presented here dynamic time warping with rigid slope constraints and correlation optimized warping are superior to unconstrained dynamic time warping; both considerably simplify interpretation of the factor model results. Unconstrained dynamic time warping was found to be too flexible for this chromatographic data set, resulting in an overcompensation of the observed shifts and suggesting the unsuitability of this preprocessing method for this type of signals. Copyright © 2004 John Wiley & Sons, Ltd.

Number of times cited according to CrossRef: 439

  • A Malware Obfuscation AI Technique to Evade Antivirus Detection in Counter Forensic Domain, Enabling AI Applications in Data Science, 10.1007/978-3-030-52067-0_27, (597-615), (2021).
  • Mass spectrometry metabolomic data handling for biomarker discovery, Proteomic and Metabolomic Approaches to Biomarker Discovery, 10.1016/B978-0-12-818607-7.00021-9, (369-388), (2020).
  • Defining a standardized methodology for the determination of the antioxidant capacity: case study of Pistacia atlantica leaves , The Analyst, 10.1039/C9AN01643K, (2020).
  • Management and interpretation of capillary chromatography-mass spectrometry data, Hyphenations of Capillary Chromatography with Mass Spectrometry, 10.1016/B978-0-12-809638-3.00012-0, (449-480), (2020).
  • Suspect and non-target screening of acutely toxic Prymnesium parvum, Science of The Total Environment, 10.1016/j.scitotenv.2020.136835, 715, (136835), (2020).
  • Real-time monitoring and fault detection of pulsed-spray fluid-bed granulation using near-infrared spectroscopy and multivariate process trajectories, Particuology, 10.1016/j.partic.2020.02.003, (2020).
  • Recent applications of chemometrics in one‐ and two‐dimensional chromatography, Journal of Separation Science, 10.1002/jssc.202000011, 43, 9-10, (1678-1727), (2020).
  • Pre-processing Methods, Reference Module in Chemistry, Molecular Sciences and Chemical Engineering, 10.1016/B978-0-12-409547-2.14878-4, (2020).
  • Multivariate Statistical Process Control and Process Control, Using Latent Variables, Reference Module in Chemistry, Molecular Sciences and Chemical Engineering, 10.1016/B978-0-12-409547-2.14887-5, (2020).
  • Variable Shift and Alignment, Reference Module in Chemistry, Molecular Sciences and Chemical Engineering, 10.1016/B978-0-12-409547-2.14886-3, (2020).
  • New Algorithm for Aligning Biological Data, Embedded Systems and Artificial Intelligence, 10.1007/978-981-15-0947-6_68, (713-721), (2020).
  • Introducing a novel procedure for peak alignment in one-dimensional 1 H-NMR spectroscopy: a prerequisite for chemometric analyses of wine samples , Analytical Methods, 10.1039/D0AY01011A, (2020).
  • Compliance with EU vs. extra-EU labelled geographical provenance in virgin olive oils: A rapid untargeted chromatographic approach based on volatile compounds, LWT, 10.1016/j.lwt.2020.109566, (109566), (2020).
  • Exploring Correlation Network for Cheating Detection, ACM Transactions on Intelligent Systems and Technology, 10.1145/3364221, 11, 1, (1-23), (2020).
  • An innovative chemometric approach for simultaneous determination of polycyclic aromatic hydrocarbons in oil-contaminated waters based on dispersive micro-solid phase extraction followed by gas chromatography, Microchemical Journal, 10.1016/j.microc.2020.105407, (105407), (2020).
  • Possible metabolic switch between environmental and pathogenic Pseudomonas aeruginosa strains: 1H NMR based metabolomics study, Journal of Pharmaceutical and Biomedical Analysis, 10.1016/j.jpba.2020.113369, (113369), (2020).
  • The Impact of Delayed Storage on the Measured Proteome and Metabolome of Human Cerebrospinal Fluid, Clinical Chemistry, 10.1373/clinchem.2011.167601, 57, 12, (1703-1711), (2020).
  • Evaluation of MDA-MB-468 Cell Culture Media Analysis in Predicting Triple-Negative Breast Cancer Patient Sera Metabolic Profiles, Metabolites, 10.3390/metabo10050173, 10, 5, (173), (2020).
  • Flash Gas Chromatography in Tandem with Chemometrics: A Rapid Screening Tool for Quality Grades of Virgin Olive Oils, Foods, 10.3390/foods9070862, 9, 7, (862), (2020).
  • Evaluation of partial least-squares regression with multivariate analytical figures of merit for determination of 10 pesticides in milk, International Journal of Environmental Analytical Chemistry, 10.1080/03067319.2020.1745198, (1-11), (2020).
  • Phenolic Metabolites from Barley in Contribution to Phenome in soil Moisture Deficit, International Journal of Molecular Sciences, 10.3390/ijms21176032, 21, 17, (6032), (2020).
  • Multi-reference factor analysis: low-rank covariance estimation under unknown translations, Information and Inference: A Journal of the IMA, 10.1093/imaiai/iaaa019, (2020).
  • Food Phenotyping: Recording and Processing of Non-Targeted Liquid Chromatography Mass Spectrometry Data for Verifying Food Authenticity, Molecules, 10.3390/molecules25173972, 25, 17, (3972), (2020).
  • Recent Advances and Challenges in Steroid Metabolomics for Biomarker Discovery, Current Medicinal Chemistry, 10.2174/0929867324666171113120810, 26, 1, (29-45), (2019).
  • Potentiality of PARAFAC approaches for simultaneous determination of N-acetylcysteine and acetaminophen based on the second-order data obtained from differential pulse voltammetry, Talanta, 10.1016/j.talanta.2018.08.092, 192, (439-447), (2019).
  • NMR Spectroscopy Methods in Metabolic Phenotyping, The Handbook of Metabolic Phenotyping, 10.1016/B978-0-12-812293-8.00002-5, (53-96), (2019).
  • Multi-way chromatographic calibration - A review, Journal of Chromatography A, 10.1016/j.chroma.2019.01.012, (2019).
  • Gas Chromatographic Fingerprint Analysis of Secondary Metabolites of Stachys lanata (Stachys byzantine C. Koch) Combined with Antioxidant Activity Modelling Using Multivariate Chemometric Methods, Journal of Chromatography A, 10.1016/j.chroma.2019.06.002, (2019).
  • Contribution to second-order calibration based on multivariate curve resolution with and without previous chromatographic synchronization, Analytica Chimica Acta, 10.1016/j.aca.2019.06.038, (2019).
  • A tiered analytical approach for target, non-target and suspect screening analysis of polar transformation products of polycyclic aromatic compounds, Chemosphere, 10.1016/j.chemosphere.2019.06.149, (2019).
  • Sub-second quantum cascade laser based infrared spectroscopic ellipsometry, Optics Letters, 10.1364/OL.44.003426, 44, 14, (3426), (2019).
  • Peak Alignment of Gas Chromatography-Mass Spectrometry Data with Deep Learning, Journal of Chromatography A, 10.1016/j.chroma.2019.460476, (460476), (2019).
  • Wavelet functional principal component analysis for batch process monitoring, Chemometrics and Intelligent Laboratory Systems, 10.1016/j.chemolab.2019.103897, (103897), (2019).
  • Fingerprints of volatile flavor compounds from southern stinky tofu brine with headspace solid‐phase microextraction/gas chromatography–mass spectrometry and chemometric methods, Food Science & Nutrition, 10.1002/fsn3.943, 7, 2, (890-896), (2019).
  • Monitoring type 2 diabetes from volatile faecal metabolome in Cushing’s syndrome and single Afmid mouse models via a longitudinal study, Scientific Reports, 10.1038/s41598-019-55339-9, 9, 1, (2019).
  • undefined, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), 10.1109/ICCVW.2019.00198, (1589-1598), (2019).
  • Differentiation of weathered chemically dispersed oil from weathered crude oil, Environmental Monitoring and Assessment, 10.1007/s10661-019-7392-5, 191, 5, (2019).
  • Analytical Methods for Detection of Plant Metabolomes Changes in Response to Biotic and Abiotic Stresses, International Journal of Molecular Sciences, 10.3390/ijms20020379, 20, 2, (379), (2019).
  • Balancing Resolution with Analysis Time for Biodiesel–Diesel Fuel Separations Using GC, PCA, and the Mahalanobis Distance, Separations, 10.3390/separations6020028, 6, 2, (28), (2019).
  • Untargeted Metabolomic Profile for the Detection of Prostate Carcinoma—Preliminary Results from PARAFAC2 and PLS–DA Models, Molecules, 10.3390/molecules24173063, 24, 17, (3063), (2019).
  • Tenuazonic acid-induced change in volatile emission from rose plants and its chemometrical analysis, Journal of Plant Diseases and Protection, 10.1007/s41348-019-00269-x, (2019).
  • Application of nuclear magnetic resonance spectroscopy for the detection of metabolic disorders in patients with moderate kidney insufficiency, Journal of Pharmaceutical and Biomedical Analysis, 10.1016/j.jpba.2017.10.037, 149, (1-8), (2018).
  • An image analysis of TLC patterns for quality control of saffron based on soil salinity effect: A strategy for data (pre)-processing, Food Chemistry, 10.1016/j.foodchem.2017.07.012, 239, (831-839), (2018).
  • Development of a fast HPLC-DAD method for simultaneous quantitation of three immunosuppressant drugs in whole blood samples using intelligent chemometrics resolving of coeluting peaks in the presence of blood interferences, Journal of Chromatography B, 10.1016/j.jchromb.2017.12.012, 1073, (69-79), (2018).
  • Time-series averaging using constrained dynamic time warping with tolerance, Pattern Recognition, 10.1016/j.patcog.2017.08.015, 74, (77-89), (2018).
  • Exploring the effects of sparsity constraint on the ranges of feasible solutions for resolution of GC-MS data, Chemometrics and Intelligent Laboratory Systems, 10.1016/j.chemolab.2017.12.015, 173, (30-40), (2018).
  • A chemometric approach for characterization of serum transthyretin in familial amyloidotic polyneuropathy type I (FAP-I) by electrospray ionization-ion mobility mass spectrometry, Talanta, 10.1016/j.talanta.2017.12.072, 181, (87-94), (2018).
  • The Pixel-Based Chemometric Approach for Oil Spill Identification and Hydrocarbon Source Differentiation, Oil Spill Environmental Forensics Case Studies, 10.1016/B978-0-12-804434-6.00021-5, (443-463), (2018).
  • Mass Spectrometry-Based Metabolomic Analysis, Reference Module in Life Sciences, 10.1016/B978-0-12-809633-8.20252-5, (2018).
  • Automatic data analysis workflow for ultra-high performance liquid chromatography-high resolution mass spectrometry-based metabolomics, Journal of Chromatography A, 10.1016/j.chroma.2018.11.070, (2018).
  • Anchor assisted warping of the chromatograms: A novel procedure to correct the drifts in the chromatographic peak positions, Talanta, 10.1016/j.talanta.2018.11.096, (2018).
  • Removal of volatile gasoline compounds by indoor potted plants studied by pixel-based fingerprinting analysis, Chemosphere, 10.1016/j.chemosphere.2018.12.125, (2018).
  • Optimizing the process of reference selection for correlation optimised warping (COW) and interval correlation shifting (icoshift) analysis: automating the chromatographic alignment procedure, Analytical Methods, 10.1039/C7AY02340E, 10, 2, (190-203), (2018).
  • Predicting fishiness off-flavour and identifying compounds of lipid oxidation in dairy powders by SPME-GC/MS and machine learning, International Dairy Journal, 10.1016/j.idairyj.2017.09.009, 77, (19-28), (2018).
  • Comparison of Quantitative and Semiquantitative Methods in Source Identification Following the OSPAR Oil Spill, in Paraná, Brazil, Oil Spill Environmental Forensics Case Studies, 10.1016/B978-0-12-804434-6.00025-2, (515-561), (2018).
  • Recommended strategies for spectral processing and post-processing of 1D 1H-NMR data of biofluids with a particular focus on urine, Metabolomics, 10.1007/s11306-018-1321-4, 14, 3, (2018).
  • NMR and multivariate data analysis to assess traceability of argentine citrus, Microchemical Journal, 10.1016/j.microc.2018.05.037, 141, (264-270), (2018).
  • Potentiality of independent component regression in assessment of the peaks responsible for antimicrobial activity of Satureja hortensis L. and Oliveria decumbens Vent. using GC–MS, Journal of the Iranian Chemical Society, 10.1007/s13738-018-1398-8, 15, 9, (2007-2016), (2018).
  • Food Texture Quantification Using a Magnetic Food Texture Sensor and Dynamic Time Warping, Food Science and Technology Research, 10.3136/fstr.24.257, 24, 2, (257-263), (2018).
  • Chemometric assisted correlation optimized warping of chromatograms: optimizing the computational time for correcting the drifts in chromatographic peak positions, Analytical Methods, 10.1039/C8AY00084K, 10, 9, (1006-1014), (2018).
  • Modeling second-order data for classification issues: Data characteristics, algorithms, processing procedures and applications, TrAC Trends in Analytical Chemistry, 10.1016/j.trac.2018.07.022, 107, (151-168), (2018).
  • Quality control of saffron and evaluation of potential adulteration by means of thin layer chromatography-image analysis and chemometrics methods, Food Control, 10.1016/j.foodcont.2018.02.026, 90, (48-57), (2018).
  • Introducing an integral optimised warping (IOW) approach for achieving swift alignment of drifted chromatographic peaks: an optimisation of the correlation optimised warping (COW) technique, Analytical Methods, 10.1039/C8AY00963E, 10, 23, (2764-2774), (2018).
  • Multivariate Data Analysis for Enhancing Process Understanding, Monitoring, and Control—Active Pharmaceutical Ingredient Manufacturing Case Studies, Multivariate Analysis in the Pharmaceutical Industry, 10.1016/B978-0-12-811065-2.00009-6, (185-210), (2018).
  • Multiway Calibration Approaches for Quality Control of Food Samples, Food Safety and Preservation, 10.1016/B978-0-12-814956-0.00006-8, (143-165), (2018).
  • Mathematical Pre-processing, Introduction to Multivariate Calibration, 10.1007/978-3-319-97097-4, (139-158), (2018).
  • Chemometric Strategies for Peak Detection and Profiling from Multidimensional Chromatography, PROTEOMICS, 10.1002/pmic.201700327, 18, 18, (2018).
  • Metabolomics of Body Fluids, Integration of Omics Approaches and Systems Biology for Clinical Applications, 10.1002/9781119183952, (173-195), (2018).
  • On the Characterization and Correlation of Compositional, Antioxidant and Colour Profile of Common and Balsamic Vinegars, Antioxidants, 10.3390/antiox7100139, 7, 10, (139), (2018).
  • Determination of Three Main Chlorogenic Acids in Water Extracts of Coffee Leaves by Liquid Chromatography Coupled to an Electrochemical Detector, Antioxidants, 10.3390/antiox7100143, 7, 10, (143), (2018).
  • Practical Methods for Vehicle Speed Estimation Using a Microprocessor-Embedded System with AMR Sensors, Sensors, 10.3390/s18072225, 18, 7, (2225), (2018).
  • Current challenges in second‐order calibration of hyphenated chromatographic data for analysis of highly complex samples, Journal of Chemometrics, 10.1002/cem.2976, 32, 12, (2017).
  • Total Ion Spectra versus Segmented Total Ion Spectra as Preprocessing Tools for Gas Chromatography – Mass Spectrometry Data, Journal of Forensic Sciences, 10.1111/1556-4029.13657, 63, 4, (1059-1068), (2017).
  • Multivariate statistical process control (MSPC) using Raman spectroscopy for in-line culture cell monitoring considering time-varying batches synchronized with correlation optimized warping (COW), Analytica Chimica Acta, 10.1016/j.aca.2016.11.064, 952, (9-17), (2017).
  • Development and validation of a method for the determination of regulated fragrance allergens by High-Performance Liquid Chromatography and Parallel Factor Analysis 2, Journal of Chromatography A, 10.1016/j.chroma.2017.10.034, 1526, (82-92), (2017).
  • Statistical process control of cocrystallization processes: A comparison between OPLS and PLS, International Journal of Pharmaceutics, 10.1016/j.ijpharm.2017.01.052, 520, 1-2, (29-38), (2017).
  • NMRSpec: An integrated software package for processing and analyzing one dimensional nuclear magnetic resonance spectra, Chemometrics and Intelligent Laboratory Systems, 10.1016/j.chemolab.2017.01.005, 162, (142-148), (2017).
  • Automatic time-shift alignment method for chromatographic data analysis, Scientific Reports, 10.1038/s41598-017-00390-7, 7, 1, (2017).
  • Biodiversity in targeted metabolomics analysis of filamentous fungal pathogens by 1H NMR-based studies, World Journal of Microbiology and Biotechnology, 10.1007/s11274-017-2285-7, 33, 7, (2017).
  • Mass-spectra-based peak alignment for automatic nontargeted metabolic profiling analysis for biomarker screening in plant samples, Journal of Chromatography A, 10.1016/j.chroma.2017.07.044, 1513, (201-209), (2017).
  • Discrete wavelet assisted correlation optimised warping of chromatograms: optimizing the computational time for correcting the drifts in peak positions, Analytical Methods, 10.1039/C7AY00268H, 9, 13, (2049-2058), (2017).
  • Metabolomic analysis of the effects of cadmium and copper treatment in Oryza sativa L. using untargeted liquid chromatography coupled to high resolution mass spectrometry and all-ion fragmentation, Metallomics, 10.1039/C6MT00279J, 9, 6, (660-675), (2017).
  • Metabolic profiles of exudates from chronic leg ulcerations, Journal of Pharmaceutical and Biomedical Analysis, 10.1016/j.jpba.2017.01.018, 137, (13-22), (2017).
  • Serum and urine 1H NMR-based metabolomics in the diagnosis of selected thyroid diseases, Scientific Reports, 10.1038/s41598-017-09203-3, 7, 1, (2017).
  • Constraint randomised non-negative factor analysis (CRNNFA): an alternate chemometrics approach for analysing the biochemical data sets, The Analyst, 10.1039/C7AN00274B, 142, 11, (1916-1928), (2017).
  • AntDAS: Automatic Data Analysis Strategy for UPLC–QTOF-Based Nontargeted Metabolic Profiling Analysis, Analytical Chemistry, 10.1021/acs.analchem.7b03160, 89, 20, (11083-11090), (2017).
  • Selecting local constraint for alignment of batch process data with dynamic time warping, Chemometrics and Intelligent Laboratory Systems, 10.1016/j.chemolab.2017.05.019, 167, (161-170), (2017).
  • undefined, 2017 IEEE 6th Non-Volatile Memory Systems and Applications Symposium (NVMSA), 10.1109/NVMSA.2017.8064474, (1-6), (2017).
  • Marker discovery in volatolomics based on systematic alignment of GC-MS signals: Application to food authentication, Analytica Chimica Acta, 10.1016/j.aca.2017.08.019, 991, (58-67), (2017).
  • Joint Bounding of Peaks Across Samples Improves Differential Analysis in Mass Spectrometry-Based Metabolomics, Analytical Chemistry, 10.1021/acs.analchem.6b04719, 89, 6, (3517-3523), (2017).
  • Nuclear Magnetic Resonance Strategies for Metabolic Analysis, Metabolomics: From Fundamentals to Clinical Applications, 10.1007/978-3-319-47656-8_3, (45-76), (2017).
  • Recognition and alignment of variables from UV–vis chromatograms and application to industrial enzyme digests classification, Chemometrics and Intelligent Laboratory Systems, 10.1016/j.chemolab.2017.04.005, 165, (46-55), (2017).
  • An Exemplar-Based Approach to Frequency Warping for Voice Conversion, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 10.1109/TASLP.2017.2723721, 25, 10, (1863-1876), (2017).
  • Analysis of Volatile Compounds by Advanced Analytical Techniques and Multivariate Chemometrics, Chemical Reviews, 10.1021/acs.chemrev.6b00698, 117, 9, (6399-6422), (2017).
  • Metabolic disruption of zebrafish (Danio rerio) embryos by bisphenol A. An integrated metabolomic and transcriptomic approach, Environmental Pollution, 10.1016/j.envpol.2017.07.095, 231, (22-36), (2017).
  • Automated Integration of a UPLC Glycomic Profile, High-Throughput Glycomics and Glycoproteomics, 10.1007/978-1-4939-6493-2_17, (217-233), (2017).
  • Data analysis, Liquid Chromatography, 10.1016/B978-0-12-805393-5.00021-X, (515-531), (2017).
  • Recognizing methods for epicenter-neighboring orbits with ionospheric information from DEMETER satellite data, Advances in Space Research, 10.1016/j.asr.2017.05.044, 60, 5, (980-990), (2017).
  • Untargeted metabolomic profiling of seminal plasma in nonobstructive azoospermia men: A noninvasive detection of spermatogenesis, Biomedical Chromatography, 10.1002/bmc.3931, 31, 8, (2017).
  • Application of a sparseness constraint in multivariate curve resolution – Alternating least squares, Analytica Chimica Acta, 10.1016/j.aca.2017.08.021, (2017).
  • See more

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.