nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo journalinfonormal searchdiv searchzone qikanlogo popupnotification paper paperNew
2025, 08, v.42 135-146
几何回归模型中的零膨胀检验及应用
基金项目(Foundation): 新疆维吾尔自治区研究生科研创新项目“零膨胀几何回归的统计推断及应用研究”(XJ2025G219); 北京市自然科学基金项目“函数型分层分位回归建模理论方法及应用”(1242005)
邮箱(Email): mztian@ruc.edu.cn;
DOI: 10.19343/j.cnki.11-1302/c.2025.08.011
摘要:

几何回归模型是处理计数数据的有效工具,但该类数据往往包含大量零值,正确识别零膨胀特征对于选择合适的模型至关重要,过量的零值会导致几何回归模型出现推断偏差,而不必要地采用复杂的零膨胀几何回归模型可能会导致解释和计算上的困难,从而影响模型的实用性和准确性。为此,本文提出一种新的检验方法,用于判断几何回归模型中是否存在零膨胀特征。该方法的核心在于比较实际观察到的零值与几何回归模型预测的零值的差异,进而基于估计方程得到检验统计量的显式表达式及其渐近性质。通过新检验方法与Wald、Likelihood Ratio(LR)和Score检验进行模拟研究对比,结果表明新检验方法在多数情况下具有显著优势,特别是在控制第一类错误方面。最后,本文通过实证分析验证本文方法的有效性和实用性。

Abstract:

Geometric regression models are effective for modeling count data, which often contain an excess of zero values. Correctly identifying zero inflation is crucial for selecting an appropriate model, as excessive zeros can lead to bias in the inference of the geometric model, and the unnecessary adoption of a complex zero-inflated geometric regression model may lead to interpretative and computational difficulties, which may affect the practicality and accuracy of the model. To address this, we propose a novel test for detecting zero-inflation within geometric regression models. The core of our method involves comparing the number of observed zeros with the number predicted by the geometric regression model. Based on estimating equations, we derive the explicit expression of the test statistic and its asymptotic properties. Simulation studies comparing the new test with Wald, Likelihood Ratio(LR), and Score tests demonstrate that the proposed method offers significant advantages in most cases, particularly in controlling the Type I error rate. Finally, an empirical analysis validates the effectiveness and practical utility of our approach.

参考文献

[1]陈钰晓,赵绍阳.助力健康老龄化:长期照护保险的健康价值研究[J].统计研究, 2024, 41(3):140–152.

[2]高雅倩,孟生旺.双参数Tweedie机器学习模型及其精算应用[J].统计研究, 2024, 41(4):126–140.

[3]胡亚南,田茂再.零膨胀计数数据的联合建模及变量选择[J].统计研究, 2019, 36(1):104–114.

[4]解峰昌,韦博成,林金官.零过多数据的统计分析及其应用[M].北京:科学出版社, 2013.

[5]乔新惠.计数模型下零膨胀统计检验研究[D].对外经济贸易大学, 2023.

[6]王平鲜.零膨胀模型及检验方法的比较研究[D].贵州民族大学, 2017.

[7]王芝皓,刘艳霞,田茂再,等.零膨胀计数数据函数型部分变系数模型[J].统计研究, 2021, 38(7):127–139.

[8]肖翔. 0–1膨胀几何分布回归模型及其应用[J].系统科学与数学, 2019, 39(9):1486–1499.

[9]许杰.多层零过多计数数据的参数估计及假设检验[D].南京师范大学, 2017.

[10] Famoye F, Lee C. Exponentiated-exponential Geometric Regression Model[J]. Journal of Applied Statistics, 2017, 44(16):2963–2977.

[11] Hall D B. Zero-inflated Poisson and Binomial Regression with Random Effects:A Case Study[J]. Biometrics, 2000, 56(4):1030–1039.

[12] He H, Tang W, Kelly T, et al. Statistical Tests for Latent Class in Censored Data Due to Detection Limit[J]. Statistical Methods in Medical Research, 2020, 29(8):2179–2197.

[13] He H, Zhang H, Ye P, et al. A Test of Inflated Zeros for Poisson Regression Models[J]. Statistical Methods in Medical Research, 2019, 28(4):1157–1169.

[14] Iwunor C C. Estimation of Parameters of the Inflated Geometric Distribution for Rural Out-Migration[J]. Genus, 1995, 51(3–4):253–260.

[15] Lambert D. Zero-Inflated Poisson Regression, With an Application to Defects in Manufacturing[J]. Technometrics, 1992, 34(1):1–14.

[16] Nagesh S, Nanjundan G, Suresh R, et al. A Characterization of Zero-inflated Geometric Model[J]. International Journal of Mathematics Trends and Technology, 2015, 23(1):71–73.

[17] Sakamoto Y, Kitagawa G. Akaike Information Criterion Statistics[M]. Dordrecht:D.Reidel Publishing Company, 1986.

[18] Sari D N, Purhadi P, Rahayu S P, et al. Estimation and Hypothesis Testing for the Parameters of Multivariate Zero Inflated Generalized Poisson Regression Model[J]. Symmetry, 2021, 13(10):1876.

[19] Satheesh Kumar C, Ramachandran R. On Some Aspects of a Zero-inflated Overdispersed Model and Its Applications[J]. Journal of Applied Statistics, 2020, 47(3):506–523.

[20] Schwarz G. Estimating the Dimension of a Model[J]. The Annals of Statistics, 1978, 6(2):461–464.

[21] Sharma H. A Probability Distribution for Out-migration[J]. Janasamkhya, 1987, 5(2):95–101.

[22] Stewart P, Ning W. Empirical-likelihood-based Hypothesis Tests for the Means of Two Zero-inflated Populations[J]. Communications in Statistics-Simulation and Computation, 2023, 52(10):4933–4961.

[23] Tang Y, Tang W. Testing Modified Zeros for Poisson Regression Models[J]. Statistical Methods in Medical Research, 2019, 28(10–11):3123–3141.

[24] Vuong Q H. Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses[J]. Econometrica, 1989, 57(2):307–333.

[25] Wilson P. The Misuse of the Vuong Test for Non-nested Models to Test for Zero-inflation[J]. Economics Letters, 2015, 127:51–53.

[26] Xiao X, Tang Y, Xu A, et al. Bayesian Inference for Zero-and-one-inflated Geometric Distribution Regression Model Using Pólya-Gamma Latent Variables[J]. Communications in Statistics-Theory and Methods, 2020, 49(15):3730–3743.

[27] Ye P, Qiao X, Tang W, et al. Testing Latent Class of Subjects with Structural Zeros in Negative Binomial Models with Applications to Gut Microbiome Data[J]. Statistical Methods in Medical Research, 2022, 31(11):2237–2254.

[28] Ye P, Tang Y, Sun L, et al. Testing Inflated Zeros in Binomial Regression Models[J]. Biometrical Journal, 2021, 63(1):59–80.

[29] Young D S, Roemmele E S, Yeh P. Zero-inflated Modeling Part I:Traditional Zero-inflated Count Regression Models, Their Applications, and Computational Tools[J]. WIREs Computational Statistics, 2022, 14(1):1541.

[30] Zou Y, Peng Z, Cornell J, et al. A New Statistical Test for Latent Class in Censored Data Due to Detection Limit[J]. Statistics in Medicine, 2021,40(3):779–798.

(1)因篇幅所限,定理1~3证明以附件1展示,见《统计研究》网站所列附件。下同。

(1)因篇幅所限,几何回归模型下不同样本量时,无协变量的QQ图、协变量服从正态分布的QQ图、协变量服从均匀分布的QQ图分别以附图1~3展示。

(1)因篇幅所限,无协变量时ZIGR模型零膨胀检验功效结果以附图4展示。

(1)因篇幅所限,均值的协变量服从正态分布,零膨胀参数φ无协变量和依赖协变量时ZIGR模型零膨胀检验功效分别以附图5~6展示。

(2)因篇幅所限,均值的协变量服从均匀分布,零膨胀参数φ无协变量和依赖协变量时ZIGR模型零膨胀检验功效分别以附图7~8展示。

(1)数据来源为CHARLS数据库,相关网址为https://charls.charlsdata.com/pages/Data/2020-charls-wave5/zh-CN.html。

基本信息:

DOI:10.19343/j.cnki.11-1302/c.2025.08.011

中图分类号:O212.1

引用信息:

[1]张丽平,田茂再.几何回归模型中的零膨胀检验及应用[J].统计研究,2025,42(08):135-146.DOI:10.19343/j.cnki.11-1302/c.2025.08.011.

基金信息:

新疆维吾尔自治区研究生科研创新项目“零膨胀几何回归的统计推断及应用研究”(XJ2025G219); 北京市自然科学基金项目“函数型分层分位回归建模理论方法及应用”(1242005)

投稿时间:

2024-06-14

投稿日期(年):

2024

终审时间:

2025-07-25

终审日期(年):

2025

审稿周期(年):

2

发布时间:

2025-08-25

出版时间:

2025-08-25

检 索 高级检索

引用

GB/T 7714-2015 格式引文
MLA格式引文
APA格式引文