Robust depth-based estimation of the functional autoregressive model
KAUST DepartmentComputer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
Online Publication Date2018-06-14
Print Publication Date2019-03
Permanent link to this recordhttp://hdl.handle.net/10754/628526
MetadataShow full item record
AbstractA robust estimator for functional autoregressive models is proposed, the Depth-based Least Squares (DLS) estimator. The DLS estimator down-weights the influence of outliers by using the functional directional outlyingness as a centrality measure. It consists of two steps: identifying the outliers with a two-stage functional boxplot, then down-weighting the outliers using the functional directional outlyingness. Theoretical properties of the DLS estimator are investigated such as consistency and boundedness of its influence function. Through a Monte Carlo study, it is shown that the DLS estimator performs better than estimators based on Principal Component Analysis (PCA) and robust PCA, which are the most commonly used. To illustrate a practical application, the DLS estimator is used to analyze a dataset of ambient CO concentrations in California.
CitationMartínez-Hernández I, Genton MG, González-Farías G (2018) Robust depth-based estimation of the functional autoregressive model. Computational Statistics & Data Analysis. Available: http://dx.doi.org/10.1016/j.csda.2018.06.003.
SponsorsThis research was partially supported by (1) CONACYT, México, scholarship as visiting research student, (2) CONACYT, México, CB-2015-01-252996, and (3) King Abdullah University of Science and Technology (KAUST). The authors thank the two anonymous referees for their valuable comments.
Showing items related by title, author, creator and subject.
Filtrated Common Functional Principal Components for Multivariate Functional dataJiao, Shuhao; Frostig, Ron D.; Ombao, Hernando (arXiv, 2021-06-02) [Preprint]Local field potentials (LFPs) are signals that measure electrical activity in localized cortical regions from multiple implanted tetrodes in the human or animal brain. They can be treated as multivariate functional data (i.e., curves observed at many tetrodes spread across a patch on the surface of the cortex). Most multivariate functional data contain both global features (which are shared in common to all curves) as well isolated features (common only to a small subset of curves). The goal is this paper is to develop a procedure for capturing this common features. We propose a novel tree-structured functional principal component (filt-fPC) model through low-dimensional functional representation, specifically via filtration. A popular approach to dimension reduction of functional data is functional principal components analysis (fPCA). Ordinary fPCA can only capture the major information of one population, but fail to reveal the similarity of variation pattern of different groups, which is potentially related to functional connectivity of brain. One major advantage of the proposed filt-fPC method is the ability to extracting components that are common to multiple groups, and meanwhile preserves the idiosyncratic individual features of different groups, leading to a parsimonious and interpretable low dimensional representation of multivariate functional data. Another advantage is that the extracted functional principal components satisfy the orthonormal property for each set, making filt-fPC scores easy to be obtained. The proposed filt-fPC method was employed to study the impact of a shock (induced stroke) on the functional organization structure of the rat brain. Finally we point to further directions as this filtration idea can also be generalized to other functional statistical models, such as functional regression, classification and functional times series models.
Spatial modelling of gully erosion using evidential belief function, logistic regression, and a new ensemble of evidential belief function-logistic regression algorithmArabameri, Alireza; Pradhan, Biswajeet; Rezaei, Khalil; Yamani, Mojtaba; Pourghasemi, Hamid Reza; Lombardo, Luigi (Land Degradation & Development, Wiley, 2018-09-25) [Article]This study aims to assess gully erosion susceptibility and delineate gully erosion-prone areas in Toroud Watershed, Semnan Province, Iran. Two different methods, namely, logistic regression (LR) and evidential belief function (EBF), were evaluated, and a new ensemble method was proposed using the combination of both methods. We initially created a gully erosion inventory map using different resources, including early reports, Google Earth images, and Global Positioning System-aided field surveys. We subsequently split this information randomly and selected 70% (90) of the gullies for calibration and 30% (38) for validation. The method was constructed using a combination of morphometric and thematic predictors that include 16 conditioning parameters. We also assessed the following: (a) potential multicollinearity issues using tolerance and variance inflation factor indices and (b) covariate effects using LR coefficients and EBF class weights. Results show that land use/land cover, lithology, and distance to roads dominate the method with the greatest effect on gully occurrences. We produced three susceptibility maps and evaluated their predictive power through area under the curve (AUC) and seed cell area index analyses. AUC results revealed that the ensemble method presented a considerably higher performance (AUC = 0.909) than did the individual LR (0.802) and EBF (0.821) methods. Similarly, seed cell area index displayed a constant decrease from the ensemble to single methods. The resulted gully erosion-susceptibility map could be used by decision makers and local managers for soil conservation, and for minimising damages in development activities including construction of infrastructures such as roads and the route of gas and electricity transmission lines.
QAUST: protein function prediction using structure similarity search, protein interaction and functional sequence motifsSmaili, Fatima Z.; Tian, Shuye; Roy, Ambrish; Alazmi, Meshari; Arold, Stefan T.; Mukherjee, Srayanta; Hefty, P. Scott; Chen, Wei; Gao, Xin (Accepted by Genomics, Proteomics, and Bioinformatics, Elsevier BV, 2020) [Article]The number of available protein sequences in public databases is increasing exponentially. However, a significant percentage of these sequences lack functional annotation, which is essential for the understanding of how biological systems operate. We propose a novel method, QAUST, to infer protein functions, specifically Gene Ontology (GO) terms and Enzyme Commission (EC) numbers. Our method uses three sources of information: structure information encoded by global and local structure similarity search, biological network information inferred by protein-protein interaction data, and sequence information extracted from functionally discriminative sequence motifs. The three pieces of information are combined by consensus averaging to make the final prediction. Our approach has been tested on 500 protein targets from the CAFA benchmark set. The results show that our method provides accurate functional annotation and outperforms other prediction methods based on sequence similarity search or threading. We further demonstrate that a previously unknown function of TRIM22 protein predicted by QAUST can be experimentally validated. Availability: http://www.cbrc.kaust.edu.sa/qaust/submit/.