# A nonparametric two-sample hypothesis testing problem for random dot product graphs

@article{Tang2014ANT, title={A nonparametric two-sample hypothesis testing problem for random dot product graphs}, author={Minh Tang and Avanti Athreya and Daniel Lewis Sussman and Vince Lyzinski and Carey E. Priebe}, journal={arXiv: Statistics Theory}, year={2014} }

nite-dimensional random dot product graphs have generating latent positions that are independently drawn from the same distribution, or distributions that are related via scaling or projection. We propose a test statistic that is a kernel-based function of the adjacency spectral embedding for each graph. We obtain a limiting distribution for our test statistic under the null and we show that our test procedure is consistent across a broad range of alternatives. 1. Introduction. The… Expand

#### 27 Citations

A Semiparametric Two-Sample Hypothesis Testing Problem for Random Graphs

- Mathematics
- 2017

ABSTRACT Two-sample hypothesis testing for random graphs arises naturally in neuroscience, social networks, and machine learning. In this article, we consider a semiparametric problem of two-sample… Expand

Empirical Bayes estimation for random dot product graph representation of the stochastic blockmodel

- Mathematics
- 2015

Network models are increasingly used to model datasets that involve interacting units, particularly random graph models where the vertices represent individual entities and the edges represent the… Expand

A central limit theorem for an omnibus embedding of multiple random graphs and implications for multiscale network inference

- Computer Science
- 2017

An "omnibus" embedding in which multiple graphs on the same vertex set are jointly embedded into a single space with a distinct representation for each graph is described, which achieves near-optimal inference accuracy and allows the identification of specific brain regions associated with population-level differences. Expand

On Estimation and Inference in Latent Structure Random Graphs

- Mathematics
- 2018

We define a latent structure model (LSM) random graph as a random dot product graph (RDPG) in which the latent position distribution incorporates both probabilistic and geometric constraints,… Expand

Improving Power of 2-Sample Random Graph Tests with Applications in Connectomics

- Computer Science
- 2019

It is shown that adapting multiscale graph correlation (MGC) to answer this question results in an equivalent test which outperforms several existing methods, and is demonstrated that on a real brain network, MGC is able to detect differences between two sides of a larval Drosophila brain network. Expand

STATISTICS OF DYNAMIC RANDOM NETWORKS: A DEPTH FUNCTION APPROACH.

- Computer Science, Physics
- 2015

This paper focuses on graphs with a fixed number of labeled nodes and introduces natural notions of center and a depth function for graphs that evolve in time, to develop several statistical techniques including testing, supervised and unsupervised classification, and a notion of principal component sets in the space of graphs. Expand

Beyond the adjacency matrix: random line graphs and inference for networks with edge attributes

- Computer Science, Mathematics
- ArXiv
- 2021

It is established that although naive spectral decompositions can fail to extract necessary signal for edge clustering, there exist signal-preserving singular subspaces of the line graph that can be recovered through a carefully-chosen projection, and one can consistently estimate edge latent positions in a random line graph. Expand

On spectral embedding performance and elucidating network structure in stochastic blockmodel graphs

- Mathematics, Computer Science
- Network Science
- 2019

Abstract Statistical inference on graphs often proceeds via spectral methods involving low-dimensional embeddings of matrix-valued graph representations such as the graph Laplacian or adjacency… Expand

Asymptotically efficient estimators for stochastic blockmodels: the naive MLE, the rank-constrained MLE, and the spectral

- Mathematics
- 2017

We establish asymptotic normality results for estimation of the block probability matrix $\mathbf{B}$ in stochastic blockmodel graphs using spectral embedding when the average degrees grows at the… Expand

Information Recovery in Shuffled Graphs via Graph Matching

- Mathematics, Computer Science
- IEEE Transactions on Information Theory
- 2018

An information theoretic foundation is provided for understanding the practical impact that errorfully observed vertex correspondences can have on subsequent inference, and the capacity of graph matching methods to recover the lost vertex alignment and inferential performance. Expand

#### References

SHOWING 1-10 OF 78 REFERENCES

Two-sample Hypothesis Testing for Random Dot Product Graphs via Adjacency Spectral Embedding

- Computer Science
- 2014

A valid test is proposed for the hypothesis that two finite-dimensional random dot product graphs on a common vertex set have the same generating latent positions or have Generating latent positions that are scaled or diagonal transformations of one another. Expand

A Kernel Two-Sample Test

- Mathematics, Computer Science
- J. Mach. Learn. Res.
- 2012

This work proposes a framework for analyzing and comparing distributions, which is used to construct statistical tests to determine if two samples are drawn from different distributions, and presents two distribution free tests based on large deviation bounds for the maximum mean discrepancy (MMD). Expand

A Consistent Adjacency Spectral Embedding for Stochastic Blockmodel Graphs

- Computer Science, Mathematics
- 2011

It is proved that this method to estimate block membership of nodes in a random graph generated by a stochastic blockmodel is consistent for assigning nodes to blocks, as only a negligible number of nodes will be misassigned. Expand

Consistent Latent Position Estimation and Vertex Classification for Random Dot Product Graphs

- Computer Science, Mathematics
- IEEE Transactions on Pattern Analysis and Machine Intelligence
- 2014

If class labels are observed for a number of vertices tending to infinity, then it is shown that the remaining vertices can be classified with error converging to Bayes optimal using the $(k)$-nearest-neighbors classification rule. Expand

Nonparametric estimation and testing of exchangeable graph models

- Computer Science
- AISTATS
- 2014

A specific estimator is built using the proposed 3-step procedure, which combines probability matrix estimation by Universal Singular Value Thresholding (USVT) and empirical degree sorting of the observed adjacency matrix, and it is proved that this estimation is consistent. Expand

A comparative power analysis of the maximum degree and size invariants for random graph inference

- Mathematics
- 2011

Abstract Let p , s ∈ ( 0 , 1 ] with s > p , let m , n ∈ N with 1 m n , and define V={1,…,n}. Let ER(n,p) denote the random graph model on V where each edge is independently included in the graph with… Expand

Universally consistent vertex classification for latent positions graphs

- Mathematics
- 2012

In this work we show that, using the eigen-decomposition of the adjacency matrix, we can consistently estimate feature maps for latent position graphs with positive definite link function $\kappa$,… Expand

A Limit Theorem for Scaled Eigenvectors of Random Dot Product Graphs

- Mathematics
- 2013

Abstract We prove a central limit theorem for the components of the largest eigenvectors of the adjacency matrix of a finite-dimensional random dot product graph whose true latent positions are… Expand

Equivalence of distance-based and RKHS-based statistics in hypothesis testing

- Mathematics, Computer Science
- ArXiv
- 2012

It is shown that the energy distance most commonly employed in statistics is just one member of a parametric family of kernels, and that other choices from this family can yield more powerful tests. Expand

Graph Kernels

- Computer Science, Mathematics
- J. Mach. Learn. Res.
- 2007

A unified framework to study graph kernels is presented and a kernel that is close to the optimal assignment kernel of kernel of Frohlich et al. (2006) yet provably positive semi-definite is provided. Expand