Statistical Modelling: An International Journal

Print ISSN: 1471-082X Publisher: Sage Publications

Most recent papers:

A stochastic method to estimate a zero-inflated two-part mixed model for human microbiome data.
John Barrera, Cristian Meza, Ana Arribas-Gil.
Statistical Modelling: An International Journal. yesterday

Statistical Modelling, Ahead of Print.
Human microbiome studies based on genetic sequencing techniques produce longitudinal data of the relative abundances of microbial taxa over time, allowing to analyze, through mixed-effects modelling, how microbial communities evolve in response to ...

May 07, 2026 doi: 10.1177/1471082X261438214 open full text
Simultaneous estimation and model choice for big discrete time-to-event data with additive predictors.
Benjamin Müller, Nikolaus Umlauf, Johannes Seiler, Kenneth Harttgen, Stefan Lang.
Statistical Modelling: An International Journal. 8 days ago

Statistical Modelling, Ahead of Print.
Discrete-time hazard models are widely used when event times are measured in intervals or are not precisely observed. While these models can be estimated using standard generalized linear model techniques, they rely on extensive data augmentation, making ...

April 30, 2026 doi: 10.1177/1471082X261435241 open full text
Simultaneous estimation and model choice for big discrete time-to-event data with additive predictors.
Benjamin Müller, Nikolaus Umlauf, Johannes Seiler, Kenneth Harttgen, Stefan Lang.
Statistical Modelling: An International Journal. 8 days ago

Statistical Modelling, Ahead of Print.
Discrete-time hazard models are widely used when event times are measured in intervals or are not precisely observed. While these models can be estimated using standard generalized linear model techniques, they rely on extensive data augmentation, making ...

April 30, 2026 doi: 10.1177/1471082X261435241 open full text
Simultaneous estimation and model choice for big discrete time-to-event data with additive predictors.
Benjamin Müller, Nikolaus Umlauf, Johannes Seiler, Kenneth Harttgen, Stefan Lang.
Statistical Modelling: An International Journal. 8 days ago

Statistical Modelling, Ahead of Print.
Discrete-time hazard models are widely used when event times are measured in intervals or are not precisely observed. While these models can be estimated using standard generalized linear model techniques, they rely on extensive data augmentation, making ...

April 30, 2026 doi: 10.1177/1471082X261435241 open full text
Omitted covariates bias and finite mixtures of regression models for longitudinal responses.
Marco Alfò, Roberto Rocci.
Statistical Modelling: An International Journal. April 24, 2026

Statistical Modelling, Ahead of Print.
Longitudinal studies have known widespread use in the last years in several fields of research, as they allow to distinguish between different sources of variation. We may observe differences at the beginning of the study that stay persistent through time,...

April 24, 2026 doi: 10.1177/1471082X261430008 open full text
Variance-factorized loglinear models for overdispersed and underdispersed counts.
Thomas W. Yee.
Statistical Modelling: An International Journal. February 19, 2026

Statistical Modelling, Ahead of Print.
A technique is proposed for a more structured approach to modelling over- and under-dispersed counts: variance factorized loglinear (VFL) models are a parameterization of an (enlarged) distribution whereby the overdispersed variance is expressed as a ...

February 19, 2026 doi: 10.1177/1471082X261419216 open full text
Bayesian analysis of binary Markov random field models with functional covariates and random effects.
Sooran Kim, Mark S. Kaiser, Emily J. Berg.
Statistical Modelling: An International Journal. February 11, 2026

Statistical Modelling, Ahead of Print.
We expand logistic regression models to accommodate spatial structure in a functional covariate process, direct spatial structure in responses and a geographically structured random effect. Our proposed model builds upon conditionally specified models, ...

February 11, 2026 doi: 10.1177/1471082X251410382 open full text
D-optimal designs for beta binomial regression.
Klazien de Vries, Marieke E. Timmerman, Casper J. Albers.
Statistical Modelling: An International Journal. February 09, 2026

Statistical Modelling, Ahead of Print.
Optimal designs describe the sampling distributions that achieve most efficient estimation. In this study, we apply optimal design theory to find optimal designs for the beta binomial (BB) regression model, where both location and dispersion are functions ...

February 09, 2026 doi: 10.1177/1471082X261416637 open full text
D-optimal designs for beta binomial regression.
Klazien de Vries, Marieke E. Timmerman, Casper J. Albers.
Statistical Modelling: An International Journal. February 09, 2026

Statistical Modelling, Ahead of Print.
Optimal designs describe the sampling distributions that achieve most efficient estimation. In this study, we apply optimal design theory to find optimal designs for the beta binomial (BB) regression model, where both location and dispersion are functions ...

February 09, 2026 doi: 10.1177/1471082X261416637 open full text
A path-based boosting algorithm for exploring transition metal compounds.
Claudio Meggio, Johan Pensar, David Balcells, Riccardo De Bin.
Statistical Modelling: An International Journal. January 29, 2026

Statistical Modelling, Ahead of Print.
Motivated by the analysis of chemical metal compounds and their properties for catalysis, we developed a gradient boosting model that explores graph structures to perform prediction tasks. Taking advantage of the iterative nature of boosting, our novel ...

January 29, 2026 doi: 10.1177/1471082X251407458 open full text
A path-based boosting algorithm for exploring transition metal compounds.
Claudio Meggio, Johan Pensar, David Balcells, Riccardo De Bin.
Statistical Modelling: An International Journal. January 29, 2026

Statistical Modelling, Ahead of Print.
Motivated by the analysis of chemical metal compounds and their properties for catalysis, we developed a gradient boosting model that explores graph structures to perform prediction tasks. Taking advantage of the iterative nature of boosting, our novel ...

January 29, 2026 doi: 10.1177/1471082X251407458 open full text
Bivariate distributional copula regression for mixed non-time-to-event and time-to-event responses.
Guillermo Briseño Sanchez, Andreas Groll.
Statistical Modelling: An International Journal. January 17, 2026

Statistical Modelling, Ahead of Print.
We propose a distributional copula regression modelling approach for bivariate responses comprised of non-commensurate (i.e. mixed) variables. In our case, the margins are a right-censored time-to-event outcome and a non-time-to-event variable. The ...

January 17, 2026 doi: 10.1177/1471082X251401632 open full text
Bivariate distributional copula regression for mixed non-time-to-event and time-to-event responses.
Guillermo Briseño Sanchez, Andreas Groll.
Statistical Modelling: An International Journal. January 17, 2026

Statistical Modelling, Ahead of Print.
We propose a distributional copula regression modelling approach for bivariate responses comprised of non-commensurate (i.e. mixed) variables. In our case, the margins are a right-censored time-to-event outcome and a non-time-to-event variable. The ...

January 17, 2026 doi: 10.1177/1471082X251401632 open full text
Regularization and model selection for ordinal-on-ordinal regression with applications to food products’ testing and survey data.
Aisouda Hoshiyar, Laura H. Gertheiss, Jan Gertheiss.
Statistical Modelling: An International Journal. January 12, 2026

Statistical Modelling, Ahead of Print.
Ordinal data are quite common in applied statistics. Although some model selection and regularization techniques for categorical predictors and ordinal response models have been developed over the past few years, less work has been done concerning ordinal-...

January 12, 2026 doi: 10.1177/1471082X251391582 open full text
Regularization and model selection for ordinal-on-ordinal regression with applications to food products’ testing and survey data.
Aisouda Hoshiyar, Laura H. Gertheiss, Jan Gertheiss.
Statistical Modelling: An International Journal. January 12, 2026

Statistical Modelling, Ahead of Print.
Ordinal data are quite common in applied statistics. Although some model selection and regularization techniques for categorical predictors and ordinal response models have been developed over the past few years, less work has been done concerning ordinal-...

January 12, 2026 doi: 10.1177/1471082X251391582 open full text
Data-driven random projection and screening for high-dimensional generalized linear models.
Roman Parzer, Peter Filzmoser, Laura Vana-Gür.
Statistical Modelling: An International Journal. December 17, 2025

Statistical Modelling, Ahead of Print.
We address the challenge of correlated predictors in high-dimensional generalized linear model (GLMs), where regression coefficients range from sparse to dense, by proposing a data-driven random projection (RP) method. This is particularly relevant for ...

December 17, 2025 doi: 10.1177/1471082X251392705 open full text
Data-driven random projection and screening for high-dimensional generalized linear models.
Roman Parzer, Peter Filzmoser, Laura Vana-Gür.
Statistical Modelling: An International Journal. December 17, 2025

Statistical Modelling, Ahead of Print.
We address the challenge of correlated predictors in high-dimensional generalized linear model (GLMs), where regression coefficients range from sparse to dense, by proposing a data-driven random projection (RP) method. This is particularly relevant for ...

December 17, 2025 doi: 10.1177/1471082X251392705 open full text
Time-to-event modelling for grouped variables using Exclusive Lasso.
Dayasri Ravi, Andreas Groll.
Statistical Modelling: An International Journal. December 11, 2025

Statistical Modelling, Ahead of Print.
The integration of high-dimensional genomic data and clinical data into time-to-event prediction models has gained significant attention due to the growing availability of these datasets. Traditionally, a Cox regression model is employed, concatenating ...

December 11, 2025 doi: 10.1177/1471082X251397715 open full text
Bayesian multivariate semi-Markov-switching mixed data sampling.
Alfonso Russo, Antonello Maruotti, Alessio Farcomeni.
Statistical Modelling: An International Journal. December 11, 2025

Statistical Modelling, Ahead of Print.
We introduce a flexible model for multivariate time-series exhibiting heterogeneous sampling frequencies, where time-varying unobservable heterogeneity is captured by a finite number of latent regimes. The latent unobservable process evolves over time ...

December 11, 2025 doi: 10.1177/1471082X251395447 open full text
Bayesian multivariate semi-Markov-switching mixed data sampling.
Alfonso Russo, Antonello Maruotti, Alessio Farcomeni.
Statistical Modelling: An International Journal. December 11, 2025

Statistical Modelling, Ahead of Print.
We introduce a flexible model for multivariate time-series exhibiting heterogeneous sampling frequencies, where time-varying unobservable heterogeneity is captured by a finite number of latent regimes. The latent unobservable process evolves over time ...

December 11, 2025 doi: 10.1177/1471082X251395447 open full text
Time-to-event modelling for grouped variables using Exclusive Lasso.
Dayasri Ravi, Andreas Groll.
Statistical Modelling: An International Journal. December 11, 2025

Statistical Modelling, Ahead of Print.
The integration of high-dimensional genomic data and clinical data into time-to-event prediction models has gained significant attention due to the growing availability of these datasets. Traditionally, a Cox regression model is employed, concatenating ...

December 11, 2025 doi: 10.1177/1471082X251397715 open full text
Paired comparison models with strength-dependent ties and order effects.
Mark E. Glickman.
Statistical Modelling: An International Journal. December 09, 2025

Statistical Modelling, Ahead of Print.
Paired comparison models, such as theBradley and Terry (1952)model and its variants, are widely used to measure relative strength or merit in competitive settings, including games, sports and other domains where two entities are compared. Extensions ...

December 09, 2025 doi: 10.1177/1471082X251400474 open full text
Paired comparison models with strength-dependent ties and order effects.
Mark E. Glickman.
Statistical Modelling: An International Journal. December 09, 2025

Statistical Modelling, Ahead of Print.
Paired comparison models, such as theBradley and Terry (1952)model and its variants, are widely used to measure relative strength or merit in competitive settings, including games, sports and other domains where two entities are compared. Extensions ...

December 09, 2025 doi: 10.1177/1471082X251400474 open full text
Modelling maximum daily gas flow on exits of gas transmission networks using mixtures.
Sanela Omerovic, Herwig Friedl, Bettina Grün.
Statistical Modelling: An International Journal. November 14, 2025

Statistical Modelling, Ahead of Print.
We present an approach for modelling heterogeneous gas flow patterns in gas transmission networks based on mixtures of generalized nonlinear models (GNMs). This enables the use of a gamma distribution for the dependent variable and to incorporate standard ...

November 14, 2025 doi: 10.1177/1471082X251379265 open full text
Modelling maximum daily gas flow on exits of gas transmission networks using mixtures.
Sanela Omerovic, Herwig Friedl, Bettina Grün.
Statistical Modelling: An International Journal. November 14, 2025

Statistical Modelling, Ahead of Print.
We present an approach for modelling heterogeneous gas flow patterns in gas transmission networks based on mixtures of generalized nonlinear models (GNMs). This enables the use of a gamma distribution for the dependent variable and to incorporate standard ...

November 14, 2025 doi: 10.1177/1471082X251379265 open full text
Human-in-the-loop: Towards label embeddings for assessing classification difficulty.
Katharina Hechinger, Christoph Schweden, Xiao Xiang Zhu, Göran Kauermann.
Statistical Modelling: An International Journal. October 27, 2025

Statistical Modelling, Ahead of Print.
Uncertainty in machine learning models is a timely and vast field of research. In supervised learning, uncertainty can already occur in the first stage of the training process, the annotation phase. This scenario is particularly evident when some ...

October 27, 2025 doi: 10.1177/1471082X251371796 open full text
Detection of nonlinearity, discontinuity and interactions in generalized regression models.
Nikolai Spuck, Matthias Schmid, Moritz Berger.
Statistical Modelling: An International Journal. August 13, 2025

Statistical Modelling, Ahead of Print.
In generalized regression models, the effect of continuous covariates is commonly assumed to be linear. This assumption, however, may be too restrictive in applications and may lead to biased effect estimates and decreased predictive ability. While a ...

August 13, 2025 doi: 10.1177/1471082X251353353 open full text
Detection of nonlinearity, discontinuity and interactions in generalized regression models.
Nikolai Spuck, Matthias Schmid, Moritz Berger.
Statistical Modelling: An International Journal. August 13, 2025

Statistical Modelling, Ahead of Print.
In generalized regression models, the effect of continuous covariates is commonly assumed to be linear. This assumption, however, may be too restrictive in applications and may lead to biased effect estimates and decreased predictive ability. While a ...

August 13, 2025 doi: 10.1177/1471082X251353353 open full text
Detection of nonlinearity, discontinuity and interactions in generalized regression models.
Nikolai Spuck, Matthias Schmid, Moritz Berger.
Statistical Modelling: An International Journal. August 13, 2025

Statistical Modelling, Ahead of Print.
In generalized regression models, the effect of continuous covariates is commonly assumed to be linear. This assumption, however, may be too restrictive in applications and may lead to biased effect estimates and decreased predictive ability. While a ...

August 13, 2025 doi: 10.1177/1471082X251353353 open full text
New mixture distributions for modelling count data.
Rose Baker.
Statistical Modelling: An International Journal. July 25, 2025

Statistical Modelling, Ahead of Print.
A class of new 1-parameter underdispersed distributions is introduced. Mixed with Poisson distributions; they generate 2- and 3-parameter discrete distributions that generalize the Poisson distribution and can be both under and over-dispersed. ...

July 25, 2025 doi: 10.1177/1471082X251357353 open full text
New mixture distributions for modelling count data.
Rose Baker.
Statistical Modelling: An International Journal. July 25, 2025

Statistical Modelling, Ahead of Print.
A class of new 1-parameter underdispersed distributions is introduced. Mixed with Poisson distributions; they generate 2- and 3-parameter discrete distributions that generalize the Poisson distribution and can be both under and over-dispersed. ...

July 25, 2025 doi: 10.1177/1471082X251357353 open full text
New mixture distributions for modelling count data.
Rose Baker.
Statistical Modelling: An International Journal. July 25, 2025

Statistical Modelling, Ahead of Print.
A class of new 1-parameter underdispersed distributions is introduced. Mixed with Poisson distributions; they generate 2- and 3-parameter discrete distributions that generalize the Poisson distribution and can be both under and over-dispersed. ...

July 25, 2025 doi: 10.1177/1471082X251357353 open full text
Nonparametric estimation of bivariate hidden Markov models using tensor-product B-splines.
Rouven Michels, Roland Langrock.
Statistical Modelling: An International Journal. May 06, 2025

Statistical Modelling, Ahead of Print.
For multivariate time series driven by underlying states, hidden Markov models (HMMs) constitute a powerful framework which can be flexibly tailored to the situation at hand. However, in practice, it can be challenging to choose an adequate family of ...

May 06, 2025 doi: 10.1177/1471082X251335431 open full text
Nonparametric estimation of bivariate hidden Markov models using tensor-product B-splines.
Rouven Michels, Roland Langrock.
Statistical Modelling: An International Journal. May 06, 2025

Statistical Modelling, Ahead of Print.
For multivariate time series driven by underlying states, hidden Markov models (HMMs) constitute a powerful framework which can be flexibly tailored to the situation at hand. However, in practice, it can be challenging to choose an adequate family of ...

May 06, 2025 doi: 10.1177/1471082X251335431 open full text
Nonparametric estimation of bivariate hidden Markov models using tensor-product B-splines.
Rouven Michels, Roland Langrock.
Statistical Modelling: An International Journal. May 06, 2025

Statistical Modelling, Ahead of Print.
For multivariate time series driven by underlying states, hidden Markov models (HMMs) constitute a powerful framework which can be flexibly tailored to the situation at hand. However, in practice, it can be challenging to choose an adequate family of ...

May 06, 2025 doi: 10.1177/1471082X251335431 open full text
Bayesian semiparametric inference for TVP-SVAR models with asymmetry and fat tails.
Matteo Iacopini, Luca Rossini.
Statistical Modelling: An International Journal. April 17, 2025

Statistical Modelling, Volume 26, Issue 2, Page 184-203, April 2026.
Time-varying parameter (TVP) structural vector autoregressive models with stochastic volatility (SVAR-SV) usually assume Gaussian innovations and a smooth or discrete path for the coefficients. To account for possible skewness and fat tails, this work ...

April 17, 2025 doi: 10.1177/1471082X251326360 open full text
Bayesian semiparametric inference for TVP-SVAR models with asymmetry and fat tails.
Matteo Iacopini, Luca Rossini.
Statistical Modelling: An International Journal. April 17, 2025

Statistical Modelling, Volume 26, Issue 2, Page 184-203, April 2026.
Time-varying parameter (TVP) structural vector autoregressive models with stochastic volatility (SVAR-SV) usually assume Gaussian innovations and a smooth or discrete path for the coefficients. To account for possible skewness and fat tails, this work ...

April 17, 2025 doi: 10.1177/1471082X251326360 open full text
Fast and efficient joint modelling of multivariate longitudinal data and time-to-event data with a pairwise-fitting approach.
Dries De Witte, Geert Molenberghs, Ariel Alonso Abad, Thomas Neyens, Geert Verbeke.
Statistical Modelling: An International Journal. April 11, 2025

Statistical Modelling, Ahead of Print.
In empirical studies, multiple outcomes are often measured repeatedly over time, and interest frequently lies in studying the association between these longitudinal outcomes and a time-to-event outcome. Therefore, shared-parameter joint models for ...

April 11, 2025 doi: 10.1177/1471082X251328452 open full text
Fast and efficient joint modelling of multivariate longitudinal data and time-to-event data with a pairwise-fitting approach.
Dries De Witte, Geert Molenberghs, Ariel Alonso Abad, Thomas Neyens, Geert Verbeke.
Statistical Modelling: An International Journal. April 11, 2025

Statistical Modelling, Ahead of Print.
In empirical studies, multiple outcomes are often measured repeatedly over time, and interest frequently lies in studying the association between these longitudinal outcomes and a time-to-event outcome. Therefore, shared-parameter joint models for ...

April 11, 2025 doi: 10.1177/1471082X251328452 open full text
Fast and efficient joint modelling of multivariate longitudinal data and time-to-event data with a pairwise-fitting approach.
Dries De Witte, Geert Molenberghs, Ariel Alonso Abad, Thomas Neyens, Geert Verbeke.
Statistical Modelling: An International Journal. April 11, 2025

Statistical Modelling, Ahead of Print.
In empirical studies, multiple outcomes are often measured repeatedly over time, and interest frequently lies in studying the association between these longitudinal outcomes and a time-to-event outcome. Therefore, shared-parameter joint models for ...

April 11, 2025 doi: 10.1177/1471082X251328452 open full text
Realized covariance models with time-varying parameters and spillover effects.
Luc Bauwens, Edoardo Otranto.
Statistical Modelling: An International Journal. March 19, 2025

Statistical Modelling, Volume 26, Issue 2, Page 163-183, April 2026.
A realized covariance model specifies a dynamic process for a conditional covariance matrix of daily asset returns as a function of past realized variances and covariances. We propose parsimonious parameterizations enabling a spillover effect in the ...

March 19, 2025 doi: 10.1177/1471082X251324273 open full text
Realized covariance models with time-varying parameters and spillover effects.
Luc Bauwens, Edoardo Otranto.
Statistical Modelling: An International Journal. March 19, 2025

Statistical Modelling, Volume 26, Issue 2, Page 163-183, April 2026.
A realized covariance model specifies a dynamic process for a conditional covariance matrix of daily asset returns as a function of past realized variances and covariances. We propose parsimonious parameterizations enabling a spillover effect in the ...

March 19, 2025 doi: 10.1177/1471082X251324273 open full text
Realized covariance models with time-varying parameters and spillover effects.
Luc Bauwens, Edoardo Otranto.
Statistical Modelling: An International Journal. March 19, 2025

Statistical Modelling, Volume 26, Issue 2, Page 163-183, April 2026.
A realized covariance model specifies a dynamic process for a conditional covariance matrix of daily asset returns as a function of past realized variances and covariances. We propose parsimonious parameterizations enabling a spillover effect in the ...

March 19, 2025 doi: 10.1177/1471082X251324273 open full text
A general framework for random effects models for binary, ordinal, count type and continuous dependent variables.
Gerhard Tutz.
Statistical Modelling: An International Journal. February 26, 2025

Statistical Modelling, Volume 26, Issue 2, Page 144-162, April 2026.
A general random effects model is proposed that allows for continuous as well as discrete distributions of the responses. Responses can be unrestricted continuous, bounded continuous, binary, ordered categorical or given in the form of counts. The ...

February 26, 2025 doi: 10.1177/1471082X251318471 open full text
A general framework for random effects models for binary, ordinal, count type and continuous dependent variables.
Gerhard Tutz.
Statistical Modelling: An International Journal. February 26, 2025

Statistical Modelling, Volume 26, Issue 2, Page 144-162, April 2026.
A general random effects model is proposed that allows for continuous as well as discrete distributions of the responses. Responses can be unrestricted continuous, bounded continuous, binary, ordered categorical or given in the form of counts. The ...

February 26, 2025 doi: 10.1177/1471082X251318471 open full text
A general framework for random effects models for binary, ordinal, count type and continuous dependent variables.
Gerhard Tutz.
Statistical Modelling: An International Journal. February 26, 2025

Statistical Modelling, Volume 26, Issue 2, Page 144-162, April 2026.
A general random effects model is proposed that allows for continuous as well as discrete distributions of the responses. Responses can be unrestricted continuous, bounded continuous, binary, ordered categorical or given in the form of counts. The ...

February 26, 2025 doi: 10.1177/1471082X251318471 open full text
A two-level multivariate response model for data with latent structures.
Yingjuan Zhang, Jochen Einbeck, Reza Drikvandi.
Statistical Modelling: An International Journal. February 07, 2025

Statistical Modelling, Volume 26, Issue 2, Page 123-143, April 2026.
A novel approach is proposed for analysing multilevel multivariate response data. The approach is based on identifying a one-dimensional latent variable spanning the space of responses, which then induces correlation between upper-level units. The latent ...

February 07, 2025 doi: 10.1177/1471082X241313024 open full text
A two-level multivariate response model for data with latent structures.
Yingjuan Zhang, Jochen Einbeck, Reza Drikvandi.
Statistical Modelling: An International Journal. February 07, 2025

Statistical Modelling, Volume 26, Issue 2, Page 123-143, April 2026.
A novel approach is proposed for analysing multilevel multivariate response data. The approach is based on identifying a one-dimensional latent variable spanning the space of responses, which then induces correlation between upper-level units. The latent ...

February 07, 2025 doi: 10.1177/1471082X241313024 open full text
A two-level multivariate response model for data with latent structures.
Yingjuan Zhang, Jochen Einbeck, Reza Drikvandi.
Statistical Modelling: An International Journal. February 07, 2025

Statistical Modelling, Volume 26, Issue 2, Page 123-143, April 2026.
A novel approach is proposed for analysing multilevel multivariate response data. The approach is based on identifying a one-dimensional latent variable spanning the space of responses, which then induces correlation between upper-level units. The latent ...

February 07, 2025 doi: 10.1177/1471082X241313024 open full text
Graph-structured variable selection with Gaussian Markov random field horseshoe prior.
Marie Denis, Mahlet G. Tadesse.
Statistical Modelling: An International Journal. January 28, 2025

Statistical Modelling, Volume 26, Issue 2, Page 103-122, April 2026.
A graph structure is commonly used to characterize the dependence between variables, which may be induced by time, space, biological networks or other factors. Incorporating this dependence structure into the variable selection procedure can improve the ...

January 28, 2025 doi: 10.1177/1471082X241310958 open full text
Graph-structured variable selection with Gaussian Markov random field horseshoe prior.
Marie Denis, Mahlet G. Tadesse.
Statistical Modelling: An International Journal. January 28, 2025

Statistical Modelling, Volume 26, Issue 2, Page 103-122, April 2026.
A graph structure is commonly used to characterize the dependence between variables, which may be induced by time, space, biological networks or other factors. Incorporating this dependence structure into the variable selection procedure can improve the ...

January 28, 2025 doi: 10.1177/1471082X241310958 open full text
Graph-structured variable selection with Gaussian Markov random field horseshoe prior.
Marie Denis, Mahlet G. Tadesse.
Statistical Modelling: An International Journal. January 28, 2025

Statistical Modelling, Volume 26, Issue 2, Page 103-122, April 2026.
A graph structure is commonly used to characterize the dependence between variables, which may be induced by time, space, biological networks or other factors. Incorporating this dependence structure into the variable selection procedure can improve the ...

January 28, 2025 doi: 10.1177/1471082X241310958 open full text
Dual-phase threshold selection methodology for modelling extreme events.
K.M Sakthivel, V. Nandhini.
Statistical Modelling: An International Journal. January 27, 2025

Statistical Modelling, Ahead of Print.
Extreme value theory is a method for modelling and evaluating risks in unusual or rare situations, and it has gained popularity in risk management. In general, the probability of extreme occurrences can be assessed by fitting a probability distribution to ...

January 27, 2025 doi: 10.1177/1471082X241307286 open full text
Dual-phase threshold selection methodology for modelling extreme events.
K.M Sakthivel, V. Nandhini.
Statistical Modelling: An International Journal. January 27, 2025

Statistical Modelling, Ahead of Print.
Extreme value theory is a method for modelling and evaluating risks in unusual or rare situations, and it has gained popularity in risk management. In general, the probability of extreme occurrences can be assessed by fitting a probability distribution to ...

January 27, 2025 doi: 10.1177/1471082X241307286 open full text
Renewal model for anomalous traffic in Internet2 links.
John Nicholson, Piotr Kokoszka, Robert Lund, Peter Kiessler, Julia Sharp.
Statistical Modelling: An International Journal. February 02, 2021

Statistical Modelling, Ahead of Print.
We propose and estimate an alternating renewal model describing the propagation of anomalies in a backbone internet network in the United States. Internet anomalies, either caused by equipment malfunction, news events or malicious attacks, have been a ...

February 02, 2021 doi: 10.1177/1471082X19983146 open full text
Renewal model for anomalous traffic in Internet2 links.
John Nicholson, Piotr Kokoszka, Robert Lund, Peter Kiessler, Julia Sharp.
Statistical Modelling: An International Journal. February 02, 2021

Statistical Modelling, Ahead of Print.
We propose and estimate an alternating renewal model describing the propagation of anomalies in a backbone internet network in the United States. Internet anomalies, either caused by equipment malfunction, news events or malicious attacks, have been a ...

February 02, 2021 doi: 10.1177/1471082X19983146 open full text
Renewal model for anomalous traffic in Internet2 links.
John Nicholson, Piotr Kokoszka, Robert Lund, Peter Kiessler, Julia Sharp.
Statistical Modelling: An International Journal. February 02, 2021

Statistical Modelling, Ahead of Print.
We propose and estimate an alternating renewal model describing the propagation of anomalies in a backbone internet network in the United States. Internet anomalies, either caused by equipment malfunction, news events or malicious attacks, have been a ...

February 02, 2021 doi: 10.1177/1471082X19983146 open full text
WITHDRAWN—Administrative Duplicate Publication Kernel-based estimation of individual location densities from smartphone data.

Statistical Modelling: An International Journal. December 22, 2020

Statistical Modelling, Ahead of Print.
SAGE Publishing regrets that due to an administrative error, this article was accidentally published Online First and in Volume 20 Issue 6 with different DOIs. There was no duplication of the article in the printed and online version of Volume 20 Issue 6....

December 22, 2020 doi: 10.1177/1471082X17870331 open full text
Adding flexibility to Markov Switching models.
Otranto, E.
Statistical Modelling: An International Journal. November 28, 2016

Abstract:
Very often time series are subject to abrupt changes in the level, which are generally represented by Markov Switching (MS) models, assuming that the level is constant within a certain state (regime). This is not a realistic framework because in the same regime the level could change with minor jumps with respect to a change of state; this is a typical situation in many economic time series such as the Gross Domestic Product (GDP) or the volatility of financial markets. We propose to make the state flexible, introducing a very general model which provides oscillations of the level of the time series within each state of the MS model; these movements are driven by a forcing variable. The new model allows for consideration of extreme jumps in a parsimonious way, without the adoption of a large number of regimes (in our examples the two-state MS models are used). Moreover, this model increases the interpretability and in particular the out-of-sample performance with respect to the most used alternative models. This approach can be applied in several fields, also using unobservable data. We show its advantages in three distinct applications, extending particular MS models, which involve macroeconomic variables, volatilities of financial markets and conditional correlations.

November 28, 2016 doi: 10.1177/1471082X16672025 open full text
Sequential regression measurement error models with application.
Moffatt, J. L., Scarf, P.
Statistical Modelling: An International Journal. October 10, 2016

Sequential regression approaches can be used to analyze processes in which covariates are revealed in stages. Such processes occur widely, with examples including medical intervention, sports contests and political campaigns. The naïve sequential approach involves fitting regression models using the covariates revealed by the end of the current stage, but this is only practical if the number of covariates is not too large. An alternative approach is to incorporate the score (linear predictor) from the model developed at the previous stage as a covariate at the current stage. This score takes into account the history of the process prior to the stage under consideration. However, the score is a function of fitted parameter estimates and, therefore, contains measurement error. In this article, we propose a novel technique to account for error in the score. The approach is demonstrated with application to the sprint event in track cycling and is shown to reduce bias in the estimated effect of the score and avoid unrealistically extreme predictions.

October 10, 2016 doi: 10.1177/1471082X16663065 open full text
Penalized complexity priors for degrees of freedom in Bayesian P-splines.
Ventrucci, M., Rue, H.
Statistical Modelling: An International Journal. September 07, 2016

Abstract
Bayesian penalized splines (P-splines) assume an intrinsic Gaussian Markov random field prior on the spline coefficients, conditional on a precision hyper-parameter . Prior elicitation of is difficult. To overcome this issue, we aim to building priors on an interpretable property of the model, indicating the complexity of the smooth function to be estimated. Following this idea, we propose penalized complexity (PC) priors for the number of effective degrees of freedom. We present the general ideas behind the construction of these new PC priors, describe their properties and show how to implement them in P-splines for Gaussian data.

September 07, 2016 doi: 10.1177/1471082X16659154 open full text
Time-dependent ROC methodology to evaluate the predictive accuracy of semiparametric multi-state models in the presence of competing risks: An application to peritoneal dialysis programme.
Teixeira, L., Cadarso-Suarez, C., Rodrigues, A., Mendonca, D.
Statistical Modelling: An International Journal. July 25, 2016

Abstract:
The evaluation of peritoneal dialysis (PD) programmes requires the use of statistical methods that suit the complexity of such programmes. Multi-state regression models taking competing risks into account are a good example of suitable approaches. In this work, multi-state structured additive regression (STAR) models combined with penalized splines (P-splines) are proposed to evaluate peritoneal dialysis programmes. These models are very flexible since they may consider smooth estimates of baseline transition intensities and the inclusion of time-varying and smooth covariate effects at each transition. A key issue in survival analysis is the quantification of the time-dependent predictive accuracy of a given regression model, which is typically assessed using receiver operating characteristic (ROC)’based methodologies. The main objective of the present study is to adapt the concept of time-dependent ROC curve, and their corresponding area under the curve (AUC), to a multi-state competing risks framework. All statistical methodologies discussed in this work were applied to PD survival data. Using a multi-state competing risks framework, this study explored the effects of major clinical covariates on survival such as age, sex, diabetes and previous renal replacement therapy. Such multi-state model was composed of one transient state (peritonitis) and several absorbing states (death, transfer to haemodialysis and renal transplantation). The application of STAR models combined with time-dependent ROC curves revealed important conclusions not previously reported in the nephrology literature when using standard statistical methodologies. For practical application, all the statistical methods proposed in this article were implemented in R and we wrote and made available a script named as NestedCompRisks.

July 25, 2016 doi: 10.1177/1471082X16658731 open full text
Semi-parametric frailty model for clustered interval-censored data.
Yavuz, A. C., Lambert, P.
Statistical Modelling: An International Journal. July 19, 2016

Abstract:
The shared frailty model is a popular tool to analyze correlated right-censored time-to-event data. In the shared frailty model, the latent frailty is assumed to be shared by the members of a cluster and is assigned a parametric distribution, typically a gamma distribution due to its conjugacy. In the case of interval-censored time-to-event data, the inclusion of frailties results in complicated intractable likelihoods. Here, we propose a flexible frailty model for analyzing such data by assuming a smooth semi-parametric form for the conditional time-to-event distribution and a parametric or a flexible form for the frailty distribution. The results of a simulation study suggest that the estimation of regression parameters is robust to misspecification of the frailty distribution (even when the frailty distribution is multimodal or skewed). Given sufficiently large sample sizes and number of clusters, the flexible approach produces smooth and accurate posterior estimates for the baseline survival function and for the frailty density, and it can correctly detect and identify unusual frailty density forms. The methodology is illustrated using dental data from the Signal Tandmobiel® study.

July 19, 2016 doi: 10.1177/1471082X16655631 open full text
A multivariate single-index model for longitudinal data.
Wu, J., Tu, W.
Statistical Modelling: An International Journal. July 15, 2016

Abstract:
Index measures are commonly used in medical research and clinical practice, primarily for quantification of health risks in individual subjects or patients. The utility of an index measure is ultimately contingent on its ability to predict health outcomes. Construction of medical indices has largely been based on heuristic arguments, although the acceptance of a new index typically requires objective validation, preferably with multiple outcomes. In this article, we propose an analytical tool for index development and validation. We use a multivariate single-index model to ascertain the best functional form for risk index construction. Methodologically, the proposed model represents a multivariate extension of the traditional single-index models. Such an extension is important because it assures that the resultant index simultaneously works for multiple outcomes. The model is developed in the general framework of longitudinal data analysis. We use penalized cubic splines to characterize the index components while leaving the other subject characteristics as additive components. The splines are estimated directly by penalizing nonlinear least squares, and we show that the model can be implemented using existing software. To illustrate, we examine the formation of an adiposity index for prediction of systolic and diastolic blood pressure in children. We assess the performance of the method through a simulation study.

July 15, 2016 doi: 10.1177/1471082X16655633 open full text
Partitioned conditional generalized linear models for categorical responses.
Peyhardi, J., Trottier, C., Guedon, Y.
Statistical Modelling: An International Journal. July 13, 2016

Abstract:
In categorical data analysis, several regression models have been proposed for hierarchically structured responses, such as the nested logit model, the two-step model or the partitioned conditional model for partially ordered set. The specifications of these models are heterogeneous and they have been formally defined for only two or three levels in the hierarchy. Here, we introduce the class of partitioned conditional generalized linear models (PCGLMs) that encompasses all these models and is defined for any number of levels in the hierarchy. The hierarchical structure of these models is fully specified by a partition tree of categories. Using the genericity of the recently introduced (r,F,Z) specification of generalized linear models (GLMs) for categorical responses, it is possible to use different link functions and explanatory variables for each partitioning step. PCGLMs thus constitute a very flexible framework for modelling hierarchically structured categorical responses including partially ordered responses.

July 13, 2016 doi: 10.1177/1471082X16644874 open full text
Bayesian dynamic modelling to assess differential treatment effects on panic attack frequencies.
Krone, T., Albers, C., Timmerman, M.
Statistical Modelling: An International Journal. July 13, 2016

Abstract:
To represent the complex structure of intensive longitudinal data of multiple individuals, we propose a hierarchical Bayesian Dynamic Model (BDM). This BDM is a generalized linear hierarchical model where the individual parameters do not necessarily follow a normal distribution. The model parameters can be estimated on the basis of relatively small sample sizes and in the presence of missing time points. We present the BDM and discuss the model identification, convergence and selection. The use of the BDM is illustrated using data from a randomized clinical trial to study the differential effects of three treatments for panic disorder. The data involves the number of panic attacks experienced weekly (73 individuals, 10–52 time points) during treatment. Presuming that the counts are Poisson distributed, the BDM considered involves a linear trend model with an exponential link function. The final model included a moving average parameter and an external variable (duration of symptoms pre-treatment). Our results show that cognitive behavioural therapy is less effective in reducing panic attacks than serotonin selective re-uptake inhibitors or a combination of both. Post hoc analyses revealed that males show a slightly higher number of panic attacks at the onset of treatment than females.

July 13, 2016 doi: 10.1177/1471082X16650777 open full text
A class of mixture models for multidimensional ordinal data.
Colombi, R., Giordano, S.
Statistical Modelling: An International Journal. July 07, 2016

In rating surveys, people are requested to express preferences on several aspects related to a topic by selecting a category in an ordered scale. For such data, we propose a model defined by a mixture of a uniform distribution and a Sarmanov distribution with CUB (combination of uniform and shifted binomial) marginal distributions (D'Elia and Piccolo, 2005). This mixture generalizes the CUB model to the multivariate case by taking into account the association among answers of the same individual to the items of a questionnaire. It also allows us to distinguish two kinds of uncertainty: specific uncertainty, related to the indecision for single items, and global uncertainty referred to the respondent's hesitancy in completing the whole questionnaire. A simulation and a real case study highlight the usefulness of the new methodology.

July 07, 2016 doi: 10.1177/1471082X16649730 open full text
Bayesian semiparametric density ratio modelling with applications to medical malpractice reform.
Dayaratna, K. D., Kedem, B.
Statistical Modelling: An International Journal. June 15, 2016

This study examines the efficacy of tort reforms instituted throughout the country during the last decade, improving upon existing semiparametric density ratio estimation (DRE) methodologies in the process. DRE is a well-known semiparametric modelling technique that has been used for well over two decades. Although the approach has been demonstrated to be extremely useful in statistical modelling, it has suffered from one main limitation—the methodology has thus far not been capable of modelling individual-level heterogeneity. We address this issue by presenting a novel adaptation of DRE to model individual level heterogeneity. We do so by marginalizing the associated empirical likelihood function involving density ratios to provide an overall distribution of the entire population despite having extremely limited initial information about each individual in the dataset. We apply this approach to medical malpractice loss data from the previous decade to quantify the probability of changes in tort losses. Our results demonstrate the success of a number of recently implemented malpractice reforms. Comparisons to existing DRE methods, as well as standard regression methods, illustrate the efficacy of our approach.

June 15, 2016 doi: 10.1177/1471082X16641793 open full text
Rejoinder: Regularized regression for categorical data.
Tutz, G., Gertheiss, J.
Statistical Modelling: An International Journal. June 08, 2016

There is no abstract available for this paper.

June 08, 2016 doi: 10.1177/1471082X16652780 open full text
Sums of smooth exponentials to decompose complex series of counts.
Camarda, C. G., Eilers, P. H., Gampe, J.
Statistical Modelling: An International Journal. May 30, 2016

Representing the conditional mean in Poisson regression directly as a sum of smooth components can provide a realistic model of the data generating process. Here, we present an approach that allows such an additive decomposition of the expected values of counts. The model can be formulated as a penalized composite link model and can, therefore, be estimated by a modified iteratively weighted least-squares algorithm. Further shape constraints on the smooth additive components can be enforced by additional penalties, and the model is extended to two dimensions. We present two applications that motivate the model and demonstrate the versatility of the approach.

May 30, 2016 doi: 10.1177/1471082X16641796 open full text
Discussion: Bayesian regularization and effect smoothing for categorical predictors.
Wagner, H., Pauger, D.
Statistical Modelling: An International Journal. May 30, 2016

There is no abstract available for this paper.

May 30, 2016 doi: 10.1177/1471082X16642655 open full text
Regularized regression for categorical data.
Tutz, G., Gertheiss, J.
Statistical Modelling: An International Journal. May 23, 2016

In the last two decades, regularization techniques, in particular penalty-based methods, have become very popular in statistical modelling. Driven by technological developments, most approaches have been designed for high-dimensional problems with metric variables, whereas categorical data has largely been neglected. In recent years, however, it has become clear that regularization is also very promising when modelling categorical data. A specific trait of categorical data is that many parameters are typically needed to model the underlying structure. This results in complex estimation problems that call for structured penalties which are tailored to the categorical nature of the data. This article gives a systematic overview of penalty-based methods for categorical data developed so far and highlights some issues where further research is needed. We deal with categorical predictors as well as models for categorical response variables. The primary interest of this article is to give insight into basic properties of and differences between methods that are important with respect to statistical modelling in practice, without going into technical details or extensive discussion of asymptotic properties.

May 23, 2016 doi: 10.1177/1471082X16642560 open full text
Discussion on 'regularized regression for categorical data (Tutz and Gertheiss).
Bühlmann, P., Dezeure, R.
Statistical Modelling: An International Journal. May 12, 2016

There is no abstract available for this paper.

May 12, 2016 doi: 10.1177/1471082X16642566 open full text
Discussion of 'regularized regression for categorical data by Tutz and Gertheiss.
Leng, C.
Statistical Modelling: An International Journal. May 06, 2016

This is a discussion on the article ‘Regularized Regression for Categorical Data’ by Tutz and Gertheiss.

May 06, 2016 doi: 10.1177/1471082X16642652 open full text
Discussion: Deterioration of performance of the lasso with many predictors.
Flynn, C. J., Hurvich, C. M., Simonoff, J. S.
Statistical Modelling: An International Journal. May 03, 2016

Oracle inequalities provide probability loss bounds for the lasso estimator at a deterministic choice of the regularization parameter and are commonly cited as theoretical justification for the lasso and its ability to handle high-dimensional settings. Unfortunately, in practice, the regularization parameter is not selected to be a deterministic quantity, but is instead chosen using a random, data-dependent procedure, often making these inequalities misleading in their implications. We discuss general results and demonstrate empirically for data using categorical predictors that the amount of deterioration in performance of the lasso as the number of unnecessary predictors increases can be far worse than the oracle inequalities suggest, but imposing structure on the form of the estimates can reduce this deterioration substantially.

May 03, 2016 doi: 10.1177/1471082X16642643 open full text
Categorical regularization: Discussion of article by Tutz and Gertheiss.
Agresti, A.
Statistical Modelling: An International Journal. May 03, 2016

This is a discussion of the article ’Regularized Regression for Categorical Data’ by Tutz and Gertheiss. As part of the discussion, I raise some questions that may suggest future research work.

May 03, 2016 doi: 10.1177/1471082X16642563 open full text
A confusion index for measuring separation and clustering.
Longford, N. T., Bartošova, J.
Statistical Modelling: An International Journal. May 29, 2014

<abstract>
An index for characterizing the separation of two distributions is introduced. It is applied to assessing whether mixture components are clusters. A related property of being a satellite and a partial ordering of the components are defined. A sequence of clustering structures is defined for a finite mixture with a continuum of thresholds that qualify a cluster. The approach is suitable for outcomes with arbitrary univariate or multivariate distributions and their mixtures. The properties of the index are explored by simulations and on examples.
</abstract>

May 29, 2014 doi: 10.1177/1471082X13503454 open full text
Flexible mixture modelling with the polynomial Gaussian cluster-weighted model.
Punzo, A.
Statistical Modelling: An International Journal. May 14, 2014

<abstract>
In the context of mixture models with random covariates, this article presents the polynomial Gaussian cluster-weighted model (CWM). It extends the linear Gaussian CWM, for bivariate data, in a twofold way. First, it allows for possible nonlinear dependencies in the mixture components by considering a polynomial regression. Second, it is not restricted to be used for model-based clustering only being contextualized in the most general model-based classification framework. Maximum likelihood parameter estimates are derived using the EM algorithm and model selection is carried out using the Bayesian information criterion (BIC) and the integrated completed likelihood (ICL). The article also investigates the conditions under which the posterior probabilities of component-membership from a polynomial Gaussian CWM coincide with those of other well-established mixture-models which are related to it. When applied to artificial and real data, the polynomial Gaussian CWM has shown to outperform the mixture of polynomial Gaussian regressions, which is its natural competitor in the class of mixture models with fixed covariates.
</abstract>

May 14, 2014 doi: 10.1177/1471082X13503455 open full text
Segmented mixed models with random changepoints: a maximum likelihood approach with application to treatment for depression study.
Muggeo, V. M., Atkins, D. C., Gallop, R. J., Dimidjian, S.
Statistical Modelling: An International Journal. May 13, 2014

<abstract>
We present a simple and effective iterative procedure to estimate segmented mixed models in a likelihood based framework. Random effects and covariates are allowed for each model parameter, including the changepoint. The method is practical and avoids the computational burdens related to estimation of nonlinear mixed effects models. A conventional linear mixed model with proper covariates that account for the changepoints is the key to our estimating algorithm. We illustrate the method via simulations and using data from a randomized clinical trial focused on change in depressive symptoms over time which characteristically show two separate phases of change.
</abstract>

May 13, 2014 doi: 10.1177/1471082X13504721 open full text
Local Sensitivity to non-ignorability in Joint Models.
Viviani, S., Rizopoulos, D., Alfo, M.
Statistical Modelling: An International Journal. April 30, 2014

This article deals with the analysis of sensitivity to the non-ignorability of the dropout process in joint models (JMs). We investigate the behaviour of the maximum likelihood estimates for the longitudinal process in a neighbourhood of ignorability through the Index of Local Sensitivity to Non Ignorability (ISNI). Some concerns may arise since the ISNI is an absolute measure of change in parameter estimates induced by departures from the MAR assumption; for this reason, we introduce a relative index based on the ratio between the ISNI and a measure of its variability under the MAR assumption, highlighting potential interpretation and drawbacks of this approach. The local sensitivity of the JM and the performance of the relative index are discussed in a simulation study, by varying the number of repeated measurements per individual and the random effect covariance structure. The approach is also applied to a benchmark dataset on Primary Biliary Cirrhosis (PBC).

April 30, 2014 doi: 10.1177/1471082X13504716 open full text