Catalogue Search | MBRL

Multivariate Functional Principal Component Analysis for Data Observed on Different (Dimensional) Domains

by Happ, Clara , Greven, Sonja in Brain , computer software , Data

2018

Existing approaches for multivariate functional principal component analysis are restricted to data on the same one-dimensional interval. The presented approach focuses on multivariate functional data on different domains that may differ in dimension, such as functions and images. The theoretical basis for multivariate functional principal component analysis is given in terms of a Karhunen-Loève Theorem. For the practically relevant case of a finite Karhunen-Loève representation, a relationship between univariate and multivariate functional principal component analysis is established. This offers an estimation strategy to calculate multivariate functional principal components and scores based on their univariate counterparts. For the resulting estimators, asymptotic results are derived. The approach can be extended to finite univariate expansions in general, not necessarily orthonormal bases. It is also applicable for sparse functional data or data with measurement error. A flexible R implementation is available on CRAN. The new method is shown to be competitive to existing approaches for data observed on a common one-dimensional domain. The motivating application is a neuroimaging study, where the goal is to explore how longitudinal trajectories of a neuropsychological test score covary with FDG-PET brain scans at baseline. Supplementary material, including detailed proofs, additional simulation results, and software is available online.

Journal Article

Share this book

Add to My Shelf

Principal Component Analysis of High-Frequency Data

by Xiu, Dacheng , Aït-Sahalia, Yacine in Analysis of covariance , Covariance , economic crises

2019

We develop the necessary methodology to conduct principal component analysis at high frequency. We construct estimators of realized eigenvalues, eigenvectors, and principal components, and provide the asymptotic distribution of these estimators. Empirically, we study the high-frequency covariance structure of the constituents of the S&P 100 Index using as little as one week of high-frequency data at a time, and examines whether it is compatible with the evidence accumulated over decades of lower frequency returns. We find a surprising consistency between the low- and high-frequency structures. During the recent financial crisis, the first principal component becomes increasingly dominant, explaining up to 60% of the variation on its own, while the second principal component drives the common variation of financial sector stocks. Supplementary materials for this article are available online.

Journal Article

Share this book

Add to My Shelf

Detecting and dating structural breaks in functional data without dimension reduction

by Aue, Alexander , Sönmez, Ozan , Rice, Gregory in Asymptotic methods , Change point analysis , Computer simulation

2018

Methodology is proposed to uncover structural breaks in functional data that is ‘fully functional’ in the sense that it does not rely on dimension reduction techniques. A thorough asymptotic theory is developed for a fully functional break detection procedure as well as for a break date estimator, assuming a fixed break size and a shrinking break size. The latter result is utilized to derive confidence intervals for the unknown break date. The main results highlight that the fully functional procedures perform best under conditions when analogous estimators based on functional principal component analysis are at their worst, namely when the feature of interest is orthogonal to the leading principal components of the data. The theoretical findings are confirmed by means of a Monte Carlo simulation study in finite samples. An application to annual temperature curves illustrates the practical relevance of the procedures proposed.

Journal Article

Share this book

Add to My Shelf

Independent Component Analysis via Distance Covariance

by Matteson, David S. , Tsay, Ruey S. in Analysis of covariance , Components , Consistency

2017

This article introduces a novel statistical framework for independent component analysis (ICA) of multivariate data. We propose methodology for estimating mutually independent components, and a versatile resampling-based procedure for inference, including misspecification testing. Independent components are estimated by combining a nonparametric probability integral transformation with a generalized nonparametric whitening method based on distance covariance that simultaneously minimizes all forms of dependence among the components. We prove the consistency of our estimator under minimal regularity conditions and detail conditions for consistency under model misspecification, all while placing assumptions on the observations directly, not on the latent components. U statistics of certain Euclidean distances between sample elements are combined to construct a test statistic for mutually independent components. The proposed measures and tests are based on both necessary and sufficient conditions for mutual independence. We demonstrate the improvements of the proposed method over several competing methods in simulation studies, and we apply the proposed ICA approach to two real examples and contrast it with principal component analysis.

Journal Article

Share this book

Add to My Shelf

Deterministic parallel analysis

by Dobriban, Edgar , Owen, Art B. in accuracy , Computer simulation , Deflation

2019

Factor analysis and principal component analysis are used in many application areas. The first step, choosing the number of components, remains a serious challenge. Our work proposes improved methods for this important problem. One of the most popular state of the art methods is parallel analysis (PA), which compares the observed factor strengths with simulated strengths under a noise-only model. The paper proposes improvements to PA. We first derandomize it, proposing deterministic PA, which is faster and more reproducible than PA. Both PA and deterministic PA are prone to a shadowing phenomenon in which a strong factor makes it difficult to detect smaller but more interesting factors. We propose deflation to counter shadowing. We also propose to raise the decision threshold to improve estimation accuracy. We prove several consistency results for our methods, and test them in simulations. We also illustrate our methods on data from the human genome diversity project, where they significantly improve the accuracy.

Journal Article

Share this book

Add to My Shelf

ECA: High-Dimensional Elliptical Component Analysis in Non-Gaussian Distributions

by Han, Fang , Liu, Han in Analysis of covariance , Convergence , Covariance matrix

2018

We present a robust alternative to principal component analysis (PCA)-called elliptical component analysis (ECA)-for analyzing high-dimensional, elliptically distributed data. ECA estimates the eigenspace of the covariance matrix of the elliptical data. To cope with heavy-tailed elliptical distributions, a multivariate rank statistic is exploited. At the model-level, we consider two settings: either that the leading eigenvectors of the covariance matrix are nonsparse or that they are sparse. Methodologically, we propose ECA procedures for both nonsparse and sparse settings. Theoretically, we provide both nonasymptotic and asymptotic analyses quantifying the theoretical performances of ECA. In the nonsparse setting, we show that ECA's performance is highly related to the effective rank of the covariance matrix. In the sparse setting, the results are twofold: (i) we show that the sparse ECA estimator based on a combinatoric program attains the optimal rate of convergence; (ii) based on some recent developments in estimating sparse leading eigenvectors, we show that a computationally efficient sparse ECA estimator attains the optimal rate of convergence under a suboptimal scaling. Supplementary materials for this article are available online.

Journal Article

Share this book

Add to My Shelf

Deep-tissue anatomical imaging of mice using carbon nanotube fluorophores in the second near-infrared window

by Sherlock, Sarah P , Welsher, Kevin , Dai, Hongjie in Animals , Biocompatible Materials - chemistry , Biomedical research

2011

Fluorescent imaging in the second near-infrared window (NIR II, 1-1.4 Î¼m) holds much promise due to minimal autofluorescence and tissue scattering. Here, using well-functionalized biocompatible single-walled carbon nanotubes (SWNTs) as NIR II fluorescent imaging agents, we performed high-frame-rate video imaging of mice during intravenous injection of SWNTs and investigated the path of SWNTs through the mouse anatomy. We observed in real-time SWNT circulation through the lungs and kidneys several seconds postinjection, and spleen and liver at slightly later time points. Dynamic contrast-enhanced imaging through principal component analysis (PCA) was performed and found to greatly increase the anatomical resolution of organs as a function of time postinjection. Importantly, PCA was able to discriminate organs such as the pancreas, which could not be resolved from real-time raw images. Tissue phantom studies were performed to compare imaging in the NIR II region to the traditional NIR I biological transparency window (700-900 nm). Examination of the feature sizes of a common NIR I dye (indocyanine green) showed a more rapid loss of feature contrast and integrity with increasing feature depth as compared to SWNTs in the NIR II region. The effects of increased scattering in the NIR I versus NIR II region were confirmed by Monte Carlo simulation. In vivo fluorescence imaging in the NIR II region combined with PCA analysis may represent a powerful approach to high-resolution optical imaging through deep tissues, useful for a wide range of applications from biomedical research to disease diagnostics.

Journal Article

Share this book

Add to My Shelf

Applying dimension reduction to EEG data by Principal Component Analysis reduces the quality of its subsequent Independent Component decomposition

by Artoni, Fiorenzo , Makeig, Scott , Delorme, Arnaud in Adult , Brain - physiology , Brain research

2018

Independent Component Analysis (ICA) has proven to be an effective data driven method for analyzing EEG data, separating signals from temporally and functionally independent brain and non-brain source processes and thereby increasing their definition. Dimension reduction by Principal Component Analysis (PCA) has often been recommended before ICA decomposition of EEG data, both to minimize the amount of required data and computation time. Here we compared ICA decompositions of fourteen 72-channel single subject EEG data sets obtained (i) after applying preliminary dimension reduction by PCA, (ii) after applying no such dimension reduction, or else (iii) applying PCA only. Reducing the data rank by PCA (even to remove only 1% of data variance) adversely affected both the numbers of dipolar independent components (ICs) and their stability under repeated decomposition. For example, decomposing a principal subspace retaining 95% of original data variance reduced the mean number of recovered ‘dipolar’ ICs from 30 to 10 per data set and reduced median IC stability from 90% to 76%. PCA rank reduction also decreased the numbers of near-equivalent ICs across subjects. For instance, decomposing a principal subspace retaining 95% of data variance reduced the number of subjects represented in an IC cluster accounting for frontal midline theta activity from 11 to 5. PCA rank reduction also increased uncertainty in the equivalent dipole positions and spectra of the IC brain effective sources. These results suggest that when applying ICA decomposition to EEG data, PCA rank reduction should best be avoided. •It is currently a common practice to apply dimension reduction to EEG data using PCA before performing ICA decomposition.•We tested the quality of Independent Components (ICs) after different levels of rank reduction to a principal subspace.•PCA rank reduction adversely affected dipolarity and stability of ICs accounting for brain and known non-brain processes.•PCA rank reduction also increased inter-subject variance in IC source locations (by equivalent dipole fitting) and spectra.•For EEG data at least, PCA rank reduction should be avoided or carefully tested before applying it as a preprocessing step.

Journal Article

Share this book

Add to My Shelf

Partially Linear Functional Additive Models for Multivariate Functional Data

by Li, Yehua , Wong, Raymond K. W. , Zhu, Zhengyuan in Additive model , Additives , Asymptotic properties

2019

We investigate a class of partially linear functional additive models (PLFAM) that predicts a scalar response by both parametric effects of a multivariate predictor and nonparametric effects of a multivariate functional predictor. We jointly model multiple functional predictors that are cross-correlated using multivariate functional principal component analysis (mFPCA), and model the nonparametric effects of the principal component scores as additive components in the PLFAM. To address the high-dimensional nature of functional data, we let the number of mFPCA components diverge to infinity with the sample size, and adopt the component selection and smoothing operator (COSSO) penalty to select relevant components and regularize the fitting. A fundamental difference between our framework and the existing high-dimensional additive models is that the mFPCA scores are estimated with error, and the magnitude of measurement error increases with the order of mFPCA. We establish the asymptotic convergence rate for our estimator, while allowing the number of components diverge. When the number of additive components is fixed, we also establish the asymptotic distribution for the partially linear coefficients. The practical performance of the proposed methods is illustrated via simulation studies and a crop yield prediction application. Supplementary materials for this article are available online.

Journal Article

Share this book

Add to My Shelf

Functional models for time-varying random objects

by Dubey, Paromita , Müller, Hans-Georg in Analysis , Analysis of covariance , covariance

2020

Functional data analysis provides a popular toolbox of functional models for the analysis of samples of random functions that are real valued. In recent years, samples of timevarying object data such as time-varying networks that are not in a vector space have been increasingly collected. These data can be viewed as elements of a general metric space that lacks local or global linear structure and therefore common approaches that have been used with great success for the analysis of functional data, such as functional principal component analysis, cannot be applied. We propose metric covariance, a novel association measure for paired object data lying in a metric space (Ω, d) that we use to define a metric autocovariance function for a sample of random Ω-valued curves, where Ω generally will not have a vector space or manifold structure. The proposed metric autocovariance function is non-negative definite when the squared semimetric d² is of negative type. Then the eigenfunctions of the linear operator with the autocovariance function as kernel can be used as building blocks for an object functional principal component analysis for Ω-valued functional data, including time-varying probability distributions, covariance matrices and time dynamic networks. Analogues of functional principal components for time-varying objects are obtained by applying Fréchet means and projections of distance functions of the random object trajectories in the directions of the eigenfunctions, leading to real-valued Fréchet scores. Using the notion of generalized Fréchet integrals, we construct object functional principal components that lie in the metric space Ω. We establish asymptotic consistency of the sample-based estimators for the corresponding population targets under mild metric entropy conditions on Ω and continuity of the Ω-valued random curves.These concepts are illustrated with samples of time-varying probability distributions for human mortality, time-varying covariance matrices derived from trading patterns and time-varying networks that arise from New York taxi trips.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter