Should we condition on the number of points when modelling spatial point patterns?

International Statistical Review

Published On 2022/12

We discuss the practice of directly or indirectly assuming a model for the number of points when modelling spatial point patterns even though it is rarely possible to validate such a model in practice because most point pattern data consist of only one pattern. We therefore explore the possibility to condition on the number of points instead when fitting and validating spatial point process models. In a simulation study with different popular spatial point process models, we consider model validation using global envelope tests based on functional summary statistics. We find that conditioning on the number of points will for some functional summary statistics lead to more narrow envelopes and thus stronger tests and that it can also be useful for correcting for some conservativeness in the tests when testing composite hypothesis. However, for other functional summary statistics, it makes little or no difference to condition …

Journal

International Statistical Review

Published On

2022/12

Volume

90

Issue

3

Page

551-562

Authors

Jesper Møller

Jesper Møller

Aalborg Universitet

Position

Professor in Statistics

H-Index(all)

46

H-Index(since 2020)

23

I-10 Index(all)

0

I-10 Index(since 2020)

0

Citation(all)

0

Citation(since 2020)

0

Cited By

0

Research Interests

Mathematical Statistics

Probability Theory

University Profile Page

Other Articles from authors

Jesper Møller

Jesper Møller

Aalborg Universitet

arXiv preprint arXiv:2404.09525

Coupling results and Markovian structures for number representations of continuous random variables

A general setting for nested subdivisions of a bounded real set into intervals defining the digits of a random variable with a probability density function is considered. Under the weak condition that is almost everywhere lower semi-continuous, a coupling between and a non-negative integer-valued random variable is established so that have an interpretation as the ``sufficient digits'', since the distribution of conditioned on does not depend on . Adding a condition about a Markovian structure of the lengths of the intervals in the nested subdivisions, becomes a Markov chain of a certain order . If then are IID with a known distribution. When and the Markov chain is uniformly geometric ergodic, a coupling is established between and a random time so that the chain after time is stationary and follows a simple known distribution. The results are related to several examples of number representations generated by a dynamical system, including base- expansions, generalized L\"uroth series, -expansions, and continued fraction representations. The importance of the results and some suggestions and open problems for future research are discussed.

Jesper Møller

Jesper Møller

Aalborg Universitet

arXiv preprint arXiv:2404.08387

The asymptotic distribution of the scaled remainder for pseudo golden ratio expansions of a continuous random variable

Let be the base- expansion of a continuous random variable on the unit interval where is the positive solution to for an integer (i.e., is a generalization of the golden mean for which ). We study the asymptotic distribution and convergence rate of the scaled remainder when tends to infinity.

Jesper Møller

Jesper Møller

Aalborg Universitet

Methodology and Computing in Applied Probability

How many digits are needed?

Let be the digits in the base-q expansion of a random variable X defined on [0, 1) where is an integer. For , we study the probability distribution of the (scaled) remainder : If X has an absolutely continuous CDF then converges in the total variation metric to the Lebesgue measure on the unit interval. Under weak smoothness conditions we establish first a coupling between X and a non-negative integer valued random variable N so that follows and is independent of , and second exponentially fast convergence of and its PDF . We discuss how many digits are needed and show examples of our results.

Jesper Møller

Jesper Møller

Aalborg Universitet

arXiv preprint arXiv:2312.09652

The asymptotic distribution of the remainder in a certain base- expansion

Let be the base- expansion of a continuous random variable on the unit interval where is the golden ratio. We study the asymptotic distribution and convergence rate of the scaled remainder when tends to infinity.

2023/12/15

Article Details
Jesper Møller

Jesper Møller

Aalborg Universitet

Proceedings of the London Mathematical Society

Realizability and tameness of fusion systems

A saturated fusion system over a finite p$p$‐group S$S$ is a category whose objects are the subgroups of S$S$ and whose morphisms are injective homomorphisms between the subgroups satisfying certain axioms. A fusion system over S$S$ is realized by a finite group G$G$ if S$S$ is a Sylow p$p$‐subgroup of G$G$ and morphisms in the category are those induced by conjugation in G$G$. One recurrent question in this subject is to find criteria as to whether a given saturated fusion system is realizable or not. One main result in this paper is that a saturated fusion system is realizable if all of its components (in the sense of Aschbacher) are realizable. Another result is that all realizable fusion systems are tame: a finer condition on realizable fusion systems that involves describing automorphisms of a fusion system in terms of those of some group that realizes it. Stated in this way, these results depend on the …

Jesper Møller

Jesper Møller

Aalborg Universitet

ACM Transactions on Spatial Algorithms and Systems

Stochastic Routing with Arrival Windows

Arriving at a destination within a specific time window is important in many transportation settings. For example, trucks may be penalized for early or late arrivals at compact terminals, and early and late arrivals at general practitioners, dentists, and so on, are also discouraged, in part due to COVID. We propose foundations for routing with arrival-window constraints. In a setting where the travel time of a road segment is modeled by a probability distribution, we define two problems where the aim is to find a route from a source to a destination that optimizes or yields a high probability of arriving within a time window while departing as late as possible. In this setting, a core challenge is to enable comparison between paths that may potentially be part of a result path with the goal of determining whether a path is uninteresting and can be disregarded given the existence of another path. We show that existing solutions …

2023/11/21

Article Details
Jesper Møller

Jesper Møller

Aalborg Universitet

Spatial Statistics

Fitting the grain orientation distribution of a polycrystalline material conditioned on a Laguerre tessellation

The description of distributions related to grain microstructure helps physicists to understand the processes in materials and their properties. This paper presents a general statistical methodology for the analysis of crystallographic orientations of grains in a 3D Laguerre tessellation dataset which represents the microstructure of a polycrystalline material. We introduce complex stochastic models which may substitute expensive laboratory experiments: conditional on the Laguerre tessellation, we suggest interaction models for the distribution of cubic crystal lattice orientations, where the interaction is between pairs of orientations for neighbouring grains in the tessellation. We discuss parameter estimation and model comparison methods based on maximum pseudolikelihood as well as graphical procedures for model checking using simulations. Our methodology is applied for analysing a dataset representing a nickel …

Jesper Møller

Jesper Møller

Aalborg Universitet

Methodology and Computing in Applied Probability

Singular distribution functions for random variables with stationary digits

Let F be the cumulative distribution function (CDF) of the base-q expansion , where is an integer and is a stationary stochastic process with state space . In a previous paper we characterized the absolutely continuous and the discrete components of F. In this paper we study special cases of models, including stationary Markov chains of any order and stationary renewal point processes, where we establish a law of pure types: F is then either a uniform or a singular CDF on [0, 1]. Moreover, we study mixtures of such models. In most cases expressions and plots of F are given.

Jesper Møller

Jesper Møller

Aalborg Universitet

arXiv preprint arXiv:2212.08402

Cox processes driven by transformed Gaussian processes on linear networks

There is a lack of point process models on linear networks. For an arbitrary linear network, we use isotropic covariance functions with respect to the geodesic metric or the resistance metric to construct new models for isotropic Gaussian processes and hence new models for various Cox processes with isotropic pair correlation functions. In particular we introduce three model classes given by log Gaussian, interrupted, and permanental Cox processes on linear networks, and consider for the first time statistical procedures and applications for parametric families of such models. Moreover, we construct new simulation algorithms for Gaussian processes on linear networks and discuss whether the geodesic metric or the resistance metric should be used for the kind of Cox processes studied in this paper.

2022/12/16

Article Details
Jesper Møller

Jesper Møller

Aalborg Universitet

Stat

Determinantal shot noise Cox processes

We present a new class of cluster point process models, which we call determinantal shot noise Cox processes (DSNCP), with repulsion between cluster centres. They are the special case of generalized shot noise Cox processes where the cluster centres are determinantal point processes. We establish various moment results and describe how these can be used to easily estimate unknown parameters in two particularly tractable cases, namely, when the offspring density is isotropic Gaussian and the kernel of the determinantal point process of cluster centres is Gaussian or like in a scaled Ginibre point process. Through a simulation study and the analysis of a real point pattern data set, we see that when modelling clustered point patterns, a much lower intensity of cluster centres may be needed in DSNCP models as compared to shot noise Cox processes.

Jesper Møller

Jesper Møller

Aalborg Universitet

Journal of Applied Probability

Characterization of random variables with stationary digits

Let be an integer, a stochastic process with state space , and F the cumulative distribution function (CDF) of . We show that stationarity of is equivalent to a functional equation obeyed by F, and use this to characterize the characteristic function of X and the structure of F in terms of its Lebesgue decomposition. More precisely, while the absolutely continuous component of F can only be the uniform distribution on the unit interval, its discrete component can only be a countable convex combination of certain explicitly computable CDFs for probability distributions with finite support. We also show that is a Rajchman measure if and only if F is the uniform CDF on [0, 1].

Jesper Møller

Jesper Møller

Aalborg Universitet

Spatial Statistics

Fitting three-dimensional Laguerre tessellations by hierarchical marked point process models

We present a general statistical methodology for analysing a Laguerre tessellation data set viewed as a realization of a marked point process model. In the first step, for the points, we use a nested sequence of multiscale processes which constitute a flexible parametric class of pairwise interaction point process models. In the second step, for the marks/radii conditioned on the points, we consider various exponential family models where the canonical sufficient statistic is based on tessellation characteristics. For each step, parameter estimation based on maximum pseudolikelihood methods is tractable. For model selection, we consider maximized log pseudolikelihood functions for models of the radii conditioned on the points. Model checking is performed using global envelopes and corresponding tests in both steps and moreover by comparing observed and simulated tessellation characteristics in the second step …

Jesper Møller

Jesper Møller

Aalborg Universitet

Translational psychiatry

Layer III pyramidal cells in the prefrontal cortex reveal morphological changes in subjects with depression, schizophrenia, and suicide

Brodmann Area 46 (BA46) has long been regarded as a hotspot of disease pathology in individuals with schizophrenia (SCH) and major depressive disorder (MDD). Pyramidal neurons in layer III of the Brodmann Area 46 (BA46) project to other cortical regions and play a fundamental role in corticocortical and thalamocortical circuits. The AutoCUTS-LM pipeline was used to study the 3-dimensional structural morphology and spatial organization of pyramidal cells. Using quantitative light microscopy, we used stereology to calculate the entire volume of layer III in BA46 and the total number and density of pyramidal cells. Volume tensors estimated by the planar rotator quantified the volume, shape, and nucleus displacement of pyramidal cells. All of these assessments were carried out in four groups of subjects: controls (C, n = 10), SCH (n = 10), MDD (n = 8), and suicide subjects with a history of depression (SU …

Jesper Møller

Jesper Møller

Aalborg Universitet

Graphs and Combinatorics

Equivariant Euler characteristics of symplectic buildings

We compute the equivariant Euler characteristics of the buildings for the symplectic groups over finite fields.

Jesper Møller

Jesper Møller

Aalborg Universitet

Journal of Computational and Graphical Statistics

MCMC computations for Bayesian mixture models using repulsive point processes

Repulsive mixture models have recently gained popularity for Bayesian cluster detection. Compared to more traditional mixture models, repulsive mixture models produce a smaller number of well-separated clusters. The most commonly used methods for posterior inference either require to fix a priori the number of components or are based on reversible jump MCMC computation. We present a general framework for mixture models, when the prior of the “cluster centers” is a finite repulsive point process depending on a hyperparameter, specified by a density which may depend on an intractable normalizing constant. By investigating the posterior characterization of this class of mixture models, we derive a MCMC algorithm which avoids the well-known difficulties associated to reversible jump MCMC computation. In particular, we use an ancillary variable method, which eliminates the problem of having intractable …

Jesper Møller

Jesper Møller

Aalborg Universitet

Scandinavian Journal of Statistics

Approximate Bayesian inference for a spatial point process model exhibiting regularity and random aggregation

In this article, we propose a doubly stochastic spatial point process model with both aggregation and repulsion. This model combines the ideas behind Strauss processes and log Gaussian Cox processes. The likelihood for this model is not expressible in closed form but it is easy to simulate realizations under the model. We therefore explain how to use approximate Bayesian computation (ABC) to carry out statistical inference for this model. We suggest a method for model validation based on posterior predictions and global envelopes. We illustrate the ABC procedure and model validation approach using both simulated point patterns and a real data example.

Jesper Møller

Jesper Møller

Aalborg Universitet

Journal of Algebraic Combinatorics

Equivariant Euler characteristics of unitary buildings

The (p-primary) equivariant Euler characteristics of the buildings for the general unitary groups over finite fields are determined.

Jesper Møller

Jesper Møller

Aalborg Universitet

Communications Biology

Cellular 3D-reconstruction and analysis in the human cerebral cortex using automatic serial sections

Techniques involving three-dimensional (3D) tissue structure reconstruction and analysis provide a better understanding of changes in molecules and function. We have developed AutoCUTS-LM, an automated system that allows the latest advances in 3D tissue reconstruction and cellular analysis developments using light microscopy on various tissues, including archived tissue. The workflow in this paper involved advanced tissue sampling methods of the human cerebral cortex, an automated serial section collection system, digital tissue library, cell detection using convolution neural network, 3D cell reconstruction, and advanced analysis. Our results demonstrated the detailed structure of pyramidal cells (number, volume, diameter, sphericity and orientation) and their 3D spatial organization are arranged in a columnar structure. The pipeline of these combined techniques provides a detailed analysis of tissues …

Jesper Møller

Jesper Møller

Aalborg Universitet

AMERICAN MATHEMATICAL SOCIETY

THE NUMBER OF p-ELEMENTS IN FINITE GROUPS OF LIE TYPE OF CHARACTERISTIC p

The combinatorics of the poset of p-radical p-subgroups of a finite group is used to count the number of p-elements.

Other articles from International Statistical Review journal

Dankmar Boehning

Dankmar Boehning

University of Southampton

International Statistical Review

One-inflation and zero-truncation count data modelling revisited with a view on Horvitz-Thompson estimation of population size

One-inflation and zero-truncation count data modelling revisited with a view on Horvitz-Thompson estimation of population size — Graz University of Technology Skip to main navigation Skip to search Skip to main content Graz University of Technology Home Graz University of Technology Logo English Deutsch Home Persons Publications Organisations Projects Activities Prizes Press/Media Search by expertise, name or affiliation One-inflation and zero-truncation count data modelling revisited with a view on Horvitz-Thompson estimation of population size Dankmar Böhning * , Herwig Friedl * Corresponding author for this work Institute of Statistics (5060) Research output: Contribution to journal › Article › peer-review Overview Original language English Journal International Statistical Review Publication status Accepted/In press - 27 Jan 2024 Cite this APA Standard Harvard Vancouver Author BIBTEX RIS Powered …

Thai Nguyen

Thai Nguyen

Université Laval

International Statistical Review

On the inversion-free Newton’s method and its applications

In this paper, we survey the recent development of inversion‐free Newton's method, which directly avoids computing the inversion of Hessian, and demonstrate its applications in estimating parameters of models such as linear and logistic regression. A detailed review of existing methodology is provided, along with comparisons of various competing algorithms. We provide numerical examples that highlight some deficiencies of existing approaches, and demonstrate how the inversion‐free methods can improve performance. Motivated by recent works in literature, we provide a unified subsampling framework that can be combined with the inversion‐free Newton's method to estimate model parameters including those of linear and logistic regression. Numerical examples are provided for illustration.

Francesco Bartolucci

Francesco Bartolucci

Università degli Studi di Perugia

International Statistical Review

Maximum Likelihood Estimation of Multivariate Regime Switching Student‐t Copula Models

We propose a multivariate regime switching model based on a Student‐t$$ t $$ copula function with parameters controlling the strength of correlation between variables and that are governed by a latent Markov process. To estimate model parameters by maximum likelihood, we consider a two‐step procedure carried out through the Expectation–Maximisation algorithm. To address the main computational burden related to the estimation of the matrix of dependence parameters and the number of degrees of freedom of the Student‐t$$ t $$ copula, we show a novel use of the Lagrange multipliers, which simplifies the estimation process. The simulation study shows that the estimators have good finite sample properties and the estimation procedure is computationally efficient. An application concerning log‐returns of five cryptocurrencies shows that the model permits identifying bull and bear market periods based on …

Yajuan Si

Yajuan Si

University of Michigan

International Statistical Review

Nonresponse Bias Analysis in Longitudinal Studies: A Comparative Review with an Application to the Early Childhood Longitudinal Study

Longitudinal studies are subject to nonresponse when individuals fail to provide data for entire waves or particular questions of the survey. We compare approaches to nonresponse bias analysis (NRBA) in longitudinal studies and illustrate them on the Early Childhood Longitudinal Study, Kindergarten Class of 2010–2011 (ECLS‐K:2011). Wave nonresponse with attrition often yields a monotone missingness pattern, and the missingness mechanism can be missing at random (MAR) or missing not at random (MNAR). We discuss weighting, multiple imputation (MI), incomplete data modelling and Bayesian approaches to NRBA for monotone patterns. Weighting adjustments can be effective when the constructed weights are correlated with the survey outcome of interest. MI allows for variables with missing values to be included in the imputation model, yielding potentially less biased and more efficient estimates. We …

Louis-Paul Rivest

Louis-Paul Rivest

Université Laval

International Statistical Review

Small Sample Inference for Two‐Way Capture‐Recapture Experiments

The properties of the generalised Waring distribution defined on the non‐negative integers are reviewed. Formulas for its moments and its mode are given. A construction as a mixture of negative binomial distributions is also presented. Then we turn to the Petersen model for estimating the population size N$$ N $$ in a two‐way capture‐recapture experiment. We construct a Bayesian model for N$$ N $$ by combining a Waring prior with the hypergeometric distribution for the number of units caught twice in the experiment. Credible intervals for N$$ N $$ are obtained using quantiles of the posterior, a generalised Waring distribution. The standard confidence interval for the population size constructed using the asymptotic variance of Petersen estimator and 0.5 logit transformed interval are shown to be special cases of the generalised Waring credible interval. The true coverage of this interval is shown to be bigger …

Xiaofeng Shao

Xiaofeng Shao

University of Illinois at Urbana-Champaign

International Statistical Review

A Slicing-Free Perspective to Sufficient Dimension Reduction: Selective Review and Recent Developments

Since the pioneering work of sliced inverse regression, sufficient dimension reduction has been growing into a mature field in statistics and it has broad applications to regression diagnostics, data visualisation, image processing and machine learning. In this paper, we provide a review of several popular inverse regression methods, including sliced inverse regression (SIR) method and principal hessian directions (PHD) method. In addition, we adopt a conditional characteristic function approach and develop a new class of slicing‐free methods, which are parallel to the classical SIR and PHD, and are named weighted inverse regression ensemble (WIRE) and weighted PHD (WPHD), respectively. Relationship with recently developed martingale difference divergence matrix is also revealed. Numerical studies and a real data example show that the proposed slicing‐free alternatives have superior performance than …

Nhu Ngoc Nguyen

Nhu Ngoc Nguyen

University of Connecticut

International Statistical Review

On the Inversion‐Free Newton's Method and Its Applications

In this paper, we survey the recent development of inversion‐free Newton's method, which directly avoids computing the inversion of Hessian, and demonstrate its applications in estimating parameters of models such as linear and logistic regression. A detailed review of existing methodology is provided, along with comparisons of various competing algorithms. We provide numerical examples that highlight some deficiencies of existing approaches, and demonstrate how the inversion‐free methods can improve performance. Motivated by recent works in literature, we provide a unified subsampling framework that can be combined with the inversion‐free Newton's method to estimate model parameters including those of linear and logistic regression. Numerical examples are provided for illustration.

Justin Lars Kirkby

Justin Lars Kirkby

Georgia Institute of Technology

International Statistical Review

ON THE INVERSION-FREE NEWTON’S METHOD AND ITS APPLICATIONS

In this paper, we survey the recent development of inversion‐free Newton's method, which directly avoids computing the inversion of Hessian, and demonstrate its applications in estimating parameters of models such as linear and logistic regression. A detailed review of existing methodology is provided, along with comparisons of various competing algorithms. We provide numerical examples that highlight some deficiencies of existing approaches, and demonstrate how the inversion‐free methods can improve performance. Motivated by recent works in literature, we provide a unified subsampling framework that can be combined with the inversion‐free Newton's method to estimate model parameters including those of linear and logistic regression. Numerical examples are provided for illustration.

Roderick Little

Roderick Little

University of Michigan-Dearborn

International Statistical Review

Nonresponse Bias Analysis in Longitudinal Studies: A Comparative Review with an Application to the Early Childhood Longitudinal Study

Longitudinal studies are subject to nonresponse when individuals fail to provide data for entire waves or particular questions of the survey. We compare approaches to nonresponse bias analysis (NRBA) in longitudinal studies and illustrate them on the Early Childhood Longitudinal Study, Kindergarten Class of 2010–2011 (ECLS‐K:2011). Wave nonresponse with attrition often yields a monotone missingness pattern, and the missingness mechanism can be missing at random (MAR) or missing not at random (MNAR). We discuss weighting, multiple imputation (MI), incomplete data modelling and Bayesian approaches to NRBA for monotone patterns. Weighting adjustments can be effective when the constructed weights are correlated with the survey outcome of interest. MI allows for variables with missing values to be included in the imputation model, yielding potentially less biased and more efficient estimates. We …

Helton Saulo

Helton Saulo

Universidade de Brasília

International Statistical Review

Parametric Quantile Beta Regression Model

In this paper, we develop a fully parametric quantile regression model based on the generalised three‐parameter beta (GB3) distribution. Beta regression models are primarily used to model rates and proportions. However, these models are usually specified in terms of a conditional mean. Therefore, they may be inadequate if the observed response variable follows an asymmetrical distribution. In addition, beta regression models do not consider the effect of the covariates across the spectrum of the dependent variable, which is possible through the conditional quantile approach. In order to introduce the proposed GB3 regression model, we first reparameterise the GB3 distribution by inserting a quantile parameter, and then we develop the new proposed quantile model. We also propose a simple interpretation of the predictor–response relationship in terms of percentage increases/decreases of the quantile. A …

Duy Nguyen

Duy Nguyen

Marist College

International Statistical Review

On the inversion-free Newton’s method and its applications

In this paper, we survey the recent development of inversion‐free Newton's method, which directly avoids computing the inversion of Hessian, and demonstrate its applications in estimating parameters of models such as linear and logistic regression. A detailed review of existing methodology is provided, along with comparisons of various competing algorithms. We provide numerical examples that highlight some deficiencies of existing approaches, and demonstrate how the inversion‐free methods can improve performance. Motivated by recent works in literature, we provide a unified subsampling framework that can be combined with the inversion‐free Newton's method to estimate model parameters including those of linear and logistic regression. Numerical examples are provided for illustration.

Takis Merkouris

Takis Merkouris

Athens University of Economics and Business

International Statistical Review

On the Most Effective Use of Continuous Auxiliary Variables in Regression Estimation in Survey Sampling

Auxiliary variables with known population totals are extensively used in survey sampling to construct generalised regression (GR) estimators or optimal regression (OR) estimators of totals or means of study variables. This article explores the possibility of improving the efficiency of such estimators when continuous auxiliary variables are used in the regression estimation jointly with appropriate power functions of them, provided that the values of the auxiliary variables are known for all units in the population. The efficiency gain is determined analytically in the case of the OR estimator. A practical criterion for choosing the power functions that maximise the efficiency gain, involving the coefficient of determination in the regression fit of the study variable, is proposed for both the OR estimation and the more practicable, but generally less efficient, GR estimation. Furthermore, the effect of adding a power function of a …

Mamadou Yauck

Mamadou Yauck

McGill University

International Statistical Review

Small Sample Inference for Two‐Way Capture‐Recapture Experiments

The properties of the generalised Waring distribution defined on the non‐negative integers are reviewed. Formulas for its moments and its mode are given. A construction as a mixture of negative binomial distributions is also presented. Then we turn to the Petersen model for estimating the population size N$$ N $$ in a two‐way capture‐recapture experiment. We construct a Bayesian model for N$$ N $$ by combining a Waring prior with the hypergeometric distribution for the number of units caught twice in the experiment. Credible intervals for N$$ N $$ are obtained using quantiles of the posterior, a generalised Waring distribution. The standard confidence interval for the population size constructed using the asymptotic variance of Petersen estimator and 0.5 logit transformed interval are shown to be special cases of the generalised Waring credible interval. The true coverage of this interval is shown to be bigger …

Thinh Nguyen

Thinh Nguyen

Oregon State University

International Statistical Review

On the Inversion‐Free Newton's Method and Its Applications

In this paper, we survey the recent development of inversion‐free Newton's method, which directly avoids computing the inversion of Hessian, and demonstrate its applications in estimating parameters of models such as linear and logistic regression. A detailed review of existing methodology is provided, along with comparisons of various competing algorithms. We provide numerical examples that highlight some deficiencies of existing approaches, and demonstrate how the inversion‐free methods can improve performance. Motivated by recent works in literature, we provide a unified subsampling framework that can be combined with the inversion‐free Newton's method to estimate model parameters including those of linear and logistic regression. Numerical examples are provided for illustration.

David A. Stephens

David A. Stephens

McGill University

International Statistical Review

Survival modelling for data from combined cohorts: opening the door to meta survival analyses and survival analysis using electronic health records

Non‐parametric estimation of the survival function using observed failure time data depends on the underlying data generating mechanism, including the ways in which the data may be censored and/or truncated. For data arising from a single source or collected from a single cohort, a wide range of estimators have been proposed and compared in the literature. Often, however, it may be possible, and indeed advantageous, to combine and then analyse survival data that have been collected under different study designs. We review non‐parametric survival analysis for data obtained by combining the most common types of cohort. We have two main goals: (i) to clarify the differences in the model assumptions and (ii) to provide a single lens through which some of the proposed estimators may be viewed. Our discussion is relevant to the meta‐analysis of survival data obtained from different types of study, and to the …

David Tyler Frazier

David Tyler Frazier

Monash University

International Statistical Review

ABC of the future

Approximate Bayesian computation (ABC) has advanced in two decades from a seminal idea to a practically applicable inference tool for simulator‐based statistical models, which are becoming increasingly popular in many research domains. The computational feasibility of ABC for practical applications has been recently boosted by adopting techniques from machine learning to build surrogate models for the approximate likelihood or posterior and by the introduction of a general‐purpose software platform with several advanced features, including automated parallelisation. Here we demonstrate the strengths of the advances in ABC by going beyond the typical benchmark examples and considering real applications in astronomy, infectious disease epidemiology, personalised cancer therapy and financial prediction. We anticipate that the emerging success of ABC in producing actual added value and …

Gael M. Martin

Gael M. Martin

Monash University

International Statistical Review

ABC of the future

Approximate Bayesian computation (ABC) has advanced in two decades from a seminal idea to a practically applicable inference tool for simulator‐based statistical models, which are becoming increasingly popular in many research domains. The computational feasibility of ABC for practical applications has been recently boosted by adopting techniques from machine learning to build surrogate models for the approximate likelihood or posterior and by the introduction of a general‐purpose software platform with several advanced features, including automated parallelisation. Here we demonstrate the strengths of the advances in ABC by going beyond the typical benchmark examples and considering real applications in astronomy, infectious disease epidemiology, personalised cancer therapy and financial prediction. We anticipate that the emerging success of ABC in producing actual added value and …

Jon Wakefield

Jon Wakefield

University of Washington

International Statistical Review

A Spatial Variance‐Smoothing Area Level Model for Small Area Estimation of Demographic Rates

Accurate estimates of subnational health and demographic indicators are critical for informing policy. Many countries collect relevant data using complex household surveys, but when data are limited, direct weighted estimates of small area proportions may be unreliable. Area level models treating these direct estimates as response data can improve precision but often require known sampling variances of the direct estimators for all areas. In practice, the sampling variances are estimated, so standard approaches do not account for a key source of uncertainty. To account for variability in the estimated sampling variances, we propose a hierarchical Bayesian spatial area level model for small area proportions that smooths both the estimated proportions and sampling variances to produce point and interval estimates of rates of interest. We demonstrate the performance of our approach via simulation and application …

Grace Heron

Grace Heron

Queensland University of Technology

International Statistical Review

Increasing trust in new data sources: crowdsourcing image classification for ecology

Crowdsourcing methods facilitate the production of scientific information by non‐experts. This form of citizen science (CS) is becoming a key source of complementary data in many fields to inform data‐driven decisions and study challenging problems. However, concerns about the validity of these data often constrain their utility. In this paper, we focus on the use of citizen science data in addressing complex challenges in environmental conservation. We consider this issue from three perspectives. First, we present a literature scan of papers that have employed Bayesian models with citizen science in ecology. Second, we compare several popular majority vote algorithms and introduce a Bayesian item response model that estimates and accounts for participants' abilities after adjusting for the difficulty of the images they have classified. The model also enables participants to be clustered into groups based on ability …

Marc G Genton

Marc G Genton

King Abdullah University of Science and Technology

International Statistical Review

Are you all normal? It depends!

The assumption of normality has underlain much of the development of statistics, including spatial statistics, and many tests have been proposed. In this work, we focus on the multivariate setting and first review the recent advances in multivariate normality tests for i.i.d. data, with emphasis on the skewness and kurtosis approaches. We show through simulation studies that some of these tests cannot be used directly for testing normality of spatial data. We further review briefly the few existing univariate tests under dependence (time or space), and then propose a new multivariate normality test for spatial data by accounting for the spatial dependence. The new test utilises the union‐intersection principle to decompose the null hypothesis into intersections of univariate normality hypotheses for projection data, and it rejects the multivariate normality if any individual hypothesis is rejected. The individual hypotheses for …