Volume 75, Issue 1
BIOMETRIC METHODOLOGY

Registration for exponential family functional data

Julia Wrobel

Corresponding Author

E-mail address: jw3134@cumc.columbia.edu

Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, New York, U.S.A.

email: jw3134@cumc.columbia.eduSearch for more papers by this author
Vadim Zipunnikov

Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland, U.S.A.

Search for more papers by this author
Jennifer Schrack

Department of Epidemiology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland, U.S.A.

Longitudinal Studies Section, Translational Gerontology Branch, National Institute on Aging, National Institutes of Health, Bethesda, Maryland, U.S.A.

Search for more papers by this author
Jeff Goldsmith

Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, New York, U.S.A.

Search for more papers by this author
First published: 21 August 2018
Citations: 5

Summary

We introduce a novel method for separating amplitude and phase variability in exponential family functional data. Our method alternates between two steps: the first uses generalized functional principal components analysis to calculate template functions, and the second estimates smooth warping functions that map observed curves to templates. Existing approaches to registration have primarily focused on continuous functional observations, and the few approaches for discrete functional data require a pre‐smoothing step; these methods are frequently computationally intensive. In contrast, we focus on the likelihood of the observed data and avoid the need for preprocessing, and we implement both steps of our algorithm in a computationally efficient way. Our motivation comes from the Baltimore Longitudinal Study on Aging, in which accelerometer data provides valuable insights into the timing of sedentary behavior. We analyze binary functional data with observations each minute over 24 hours for 592 participants, where values represent activity and inactivity. Diurnal patterns of activity are obscured due to misalignment in the original data but are clear after curves are aligned. Simulations designed to mimic the application indicate that the proposed methods outperform competing approaches in terms of estimation accuracy and computational efficiency. Code for our method and simulations is publicly available.

Number of times cited according to CrossRef: 5

  • Phase Variation and Fréchet Means, An Invitation to Statistics in Wasserstein Space, 10.1007/978-3-030-38438-8_4, (75-116), (2020).
  • Intensity warping for multisite MRI harmonization, NeuroImage, 10.1016/j.neuroimage.2020.117242, (117242), (2020).
  • Nonnegative decomposition of functional count data, Biometrics, 10.1111/biom.13220, 0, 0, (2020).
  • mwarp1d: Manual one-dimensional data warping in Python and PyQt, Journal of Open Source Software, 10.21105/joss.01870, 4, 44, (1870), (2019).
  • Accelerometry Data in Health Research: Challenges and Opportunities, Statistics in Biosciences, 10.1007/s12561-018-9227-2, (2019).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.