nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo journalinfonormal searchdiv searchzone qikanlogo popupnotification paper paperNew
2026, 03, v.43 3-24
大数据、人工智能与经济统计:机遇与挑战
基金项目(Foundation): 中国科学院大学数智时代经济管理复杂系统建模教育部哲学社会科学创新团队(E5820801); 教育部人文社会科学重点研究基地重大项目(22JJD790049); 国家自然科学基金“计量建模与经济政策研究”基础科学中心项目(71988101); 内蒙古自治区自然科学基金青年基金“稳健贫困脆弱性方法视角下收入流动的群体划分、影响机理与促进政策优化研究”(2025QN07015)
邮箱(Email): mingzhang1990@imu.edu.cn;
DOI: 10.19343/j.cnki.11-1302/c.2026.03.001
发布时间: 2026-03-25
出版时间: 2026-03-25
移动端阅读
摘要:

随着数智时代的来临,基于互联网、移动互联网产生的大数据与人工智能技术正在深刻重塑经济统计体系。本文在梳理传统经济统计局限性的基础上,系统探讨大数据与人工智能对经济统计的多维度影响。数据获取方式上,商品扫描数据、手机信令数据、卫星遥感数据等多源非结构化数据以及大模型生成数据正成为传统统计调查的重要补充。经济测度范式上,实现经济统计指标高频化实时化,推动基于非结构化数据的主观因素测度,借助大数据与机器学习等方法赋能基于估计与预测的经济测度,以及创新统计监督方式。统计指标体系上,从以生产为核心的传统指标向涵盖社会福利、可持续发展等多维度统计指标体系拓展。实践层面上,经济大数据的非实验性特征可能带来样本选择偏差,凸显了因果推断与循证政策评估在数智时代的挑战性。面对大数据与人工智能带来的机遇与挑战,本文提出数智时代经济统计测度方法、指标体系与理论基础等方面的若干重要发展方向,以此推动经济统计实现理论、方法与应用的协同升级。

Abstract:

With the advent of the digital intelligence era, big data and artificial intelligence technologies are profoundly reshaping the economic statistical system. On the basis of reviewing the limitations of traditional economic statistics, this paper systematically explores the multi-dimensional impacts of big data and artificial intelligence on economic statistics. In terms of data acquisition methods, multi-source unstructured data such as product scanning data, mobile signaling data, satellite remote sensing data, as well as data generated by large models, are becoming important supplements to traditional statistical surveys. In terms of economic measurement paradigms, these technologies enable the high-frequency and real-time updating of economic statistical indicators, facilitate the measurement of subjective factors based on unstructured data, empower economic measurement through estimation and prediction with big data and machine learning methods, and innovate approaches to statistical oversight. Regarding the statistical indicator system, there is a gradual shift from traditional production-centered indicators toward a multi-dimensional statistical framework encompassing social welfare and sustainable development. At the practice level, the non-experimental nature of economic big data may introduce sample selection bias, highlighting the challenges of causal inference and evidence-based policy evaluation in the digital intelligence era. In response to the opportunities and challenges brought by big data and artificial intelligence, this paper proposes several important directions for the development of economic statistical measurement methods, indicator systems, and theoretical foundations in the digital intelligence era, aiming to promote the coordinated advancement of theory, methodology, and application in economic statistics.

参考文献

[1]陈梦根,陈曦,葛建军.数字经济对货币供应量统计的挑战及改进思路[J].统计研究, 2025, 42(12):31–46.

[2]傅小兰.心理学与神经科学的互动:作用、挑战与突破[J].中国科学院院刊, 2025, 40(6):1082–1088.

[3]洪永淼.经济统计学与计量经济学等相关学科的关系及发展前景[J].统计研究, 2016, 33(5):3–12.

[4]洪永淼,邱东,罗良清.经济测度与经济统计学面临的挑战与机遇[J].统计研究, 2025, 42(1):4–17.

[5]洪永淼,史九领.人工智能的政治经济学分析[J].学术月刊, 2024, 56(1):43–59.

[6]洪永淼,史九领.数字经济为经济理论创新带来哪些机遇与挑战?[R].工作论文, 2025.

[7]洪永淼,汪寿阳.大数据,机器学习与统计学.挑战与机遇[J].计量经济学报, 2021, 1(1):17–35.

[8]姜婷凤,汤珂,刘涛雄.基于在线大数据的通货膨胀“现时”预测[J].计量经济学报, 2022, 2(3):597–619.

[9]李国锋,李祚娟,王哲吉.基于多任务深度神经网络的企业纳税行为甄别研究[J].统计研究, 2022, 39(7):137–149.

[10]刘涛雄,戎珂,张亚迪.数据资本估算及对中国经济增长的贡献——基于数据价值链的视角[J].中国社会科学, 2023(10):46–64.

[11]刘涛雄,汤珂,姜婷凤,等.一种基于在线大数据的高频CPI指数的设计及应用[J].数量经济技术经济研究, 2019, 36(9):86–101.

[12]唐晓彬,崔茂生,刘思明.基于多源高维数据的经济景气实时监测研究[J].统计研究, 2024, 41(12):28–41.

[13]许宪春,张钟文,胡亚茹.数据资产统计与核算问题研究[J].管理世界, 2022, 138(2):16–30.

[14]张兴祥,钟威,洪永淼.国民幸福感的指标体系构建与影响因素分析:基于LASSO的筛选方法[J].统计研究, 2018, 35(11):3–13.

[15]Almås I, Attanasio O, Jervis P. Presidential Address:Economics and Measurement:New Measures to Model Decision Making[J]. Econometrica,2024, 92(4):947–978.

[16]Andersen T G, Bollerslev T. Answering the Skeptics:Yes, Standard Volatility Models Do Provide Accurate Forecasts[J]. International Economic Review, 1998, 39(4):885–905.

[17]Antolín-Díaz J, Drechsel T, Petrella I. Advances in Nowcasting Economic Activity:The Role of Heterogeneous Dynamics and Fat Tails[J].Journal of Econometrics, 2024, 238(2):105634.

[18]Athey S. The Impact of Machine Learning on Economics. Economics of Artificial Intelligence:An Agenda[M]. Chicago:University of Chicago Press, 2019:507–547.

[19]Athey S, Imbens G W. Machine Learning Methods That Economists Should Know About[J]. Annual Review of Economics, 2019, 11:685–725.

[20]Baker S, Bloom N, Davis S J. Measuring Economic Policy Uncertainty[J]. Quarterly Journal of Economics, 2016, 131(4):1593–1636.

[21]Barwick J P, Chen S, Fu C, et al. Digital Distractions with Peer Influence:The Impact of Mobile App Usage on Academic and Labor Market Outcomes[R]. NBER Working Paper, 2024.

[22]Benjamin D, Heffetz O, Kimball M, et al. Beyond Happiness and Satisfaction:Toward Well-Being Indices Based on Stated Preference[J].American Economic Review, 2014, 104(9):2698–2735.

[23]Berkman H, Jana J, Soderstrom N. Firm-specific Climate Risk and Market Valuation[J]. Accounting, Organizations and Society, 2024, 112:101547.

[24]Bisbee J, Clinton J D, Dorff C, et al. Synthetic Replacements for Human Survey Data? The Perils of Large Language Models[J]. Political Analysis, 2024, 32(4):401–416.

[25]Brynjolfsson E. The Productivity Paradox of Information Technology[J]. Communications of the ACM, 1993, 36(12):66–77.

[26]Brynjolfsson E, Collis A, Diewert W, et al. GDP-B:Accounting for the Value of New and Free Goods[J]. American Economic Journal:Macroeconomics, 2025, 17(4):312–344.

[27]Cannari L, D’Alessio G. Wealth Inequality in Italy:Reconstruction of 1968-75 Data and Comparison with Recent Estimates[R]. SSRN Working Paper, 2018.

[28]Cavallo A. Inflation with COVID Consumption Baskets[J]. Special Issue on the Global Economy:Looking Back, Moving Forward, Part II. IMF Economic Review, 2024, 72(2):902–917.

[29]Cavallo A, Rigobon R. The Billion Prices Project:Using Online Prices for Measurement and Research[J]. Journal of Economic Perspectives,2016, 30(2):151–178.

[30]Chang C H, Erbaugh J T, Fajardo P, et al. Global Evidence of Human Well-being and Biodiversity Impacts of Natural Climate Solutions[J].Nature Sustainability, 2025, 8:75–85.

[31]Chernozhukov V, Chetverikov D, Demirer M, et al. Double/debiased Machine Learning for Treatment and Structural Parameters[J].Econometrics Journal, 2018, 21:C1–C68.

[32]Chetty R, Jackson M, Kuchler T, et al. Social Capital I:Measurement and Associations with Economic Mobility[J]. Nature, 2022, 608:108–121.

[33]Chipman H, George E, McCulloch R. BART:Bayesian Additive Regression Trees[J]. Annals of Applied Statistics, 2010, 4(1):266–298.

[34]Choi S, Shin J, Yoo S Y. Are Government Spending Shocks Inflationary at the Zero Lower Bound? New Evidence from Daily Data[J]. Journal of Economic Dynamics and Control, 2022, 139:104423.

[35]Colonescu C. The Effects of Donald Trump’s Tweets on US Financial and Foreign Exchange Markets[J]. Athens Journal of Business&Economics, 2018, 4(4):375–388.

[36]Dubois P, Griffith R, O’Connell M. The Use of Scanner Data for Economics Research[J]. Annual Review of Economics, 2022, 14:723–745.

[37]Elshendy M, Colladon A F. Big Data Analysis of Economic News:Hints to Forecast Macroeconomic Indicators[J]. International Journal of Engineering Business Management, 2017, 9:1–12.

[38]Fang H, Li M, Lu G. Decoding China’s Industrial Policies[R]. NBER Working Paper, 2025.

[39]Fang J, Wu Si, Tong J. From Data Acquisition to Lag Modeling:Quantitative Exploration of A-Share Market with Low-Coupling System Design[R]. arXiv:2506.19255, 2025.

[40]Farbmacher H, Huber M, Lafférs L, et al. Causal Mediation Analysis with Double Machine Learning[J]. Econometrics Journal, 2022, 25:277–300.

[41]Gelman A, Vehtari A. What Are the Most Important Statistical Ideas of the Past 50 Years?[J]. Journal of American Statistical Association, 2021,116(536):2087–2097.

[42]Ginglinger E, Moreau Q. Climate Risk and Capital Structure[J]. Management Science, 2023, 69(12):7492–7516.

[43]Glaeser E L, Kominers S D, Luca M, et al. Big Data and Big Cities:The Promises and Limitations of Improved Measures of Urban Life[J].Economic Inquiry, 2016, 56(1):114–137.

[44]Granger C W J. Investigating Causal Relations by Econometric Models and Cross-spectral Methods[J]. Econometrica, 1969, 37(3):424–438.

[45]Granger C W J. Testing for Causality:A Personal Viewpoint[J]. Journal of Economic Dynamics and Control, 1980, 2:329–352.

[46]Gulen H, Ion M. Policy Uncertainty and Corporate Investment[J]. Review of Financial Studies, 2016, 29(3):523–564.

[47]Haavelmo T. The Role of the Econometrician in the Advancement of Economic Theory[J]. Econometrica, 1958, 26(3):351–357.

[48]Han S. Mining Causality:AI-Assisted Search for Instrumental Variables[R]. arXiv:2409.14202v1, 2024.

[49]Heckman J J. Shadow Prices, Market Wages, and Labour Supply[J]. Econometrica, 1974, 42(4):679–694.

[50]Heckman J J. Sample Selection Bias as a Specification Error[J]. Econometrica, 1979, 47(1):153–161.

[51]Hill J, Linero A, Murray J. Bayesian Additive Regression Trees:A Review and Look Forward[J]. Annual Review of Statistics and Its Application,2020, 7(1):251–278.

[52]Hsiao C, Ching H S, Wan S K. A Panel Data Approach for Program Evaluation:Measuring the Benefits of Political and Economic Integration of Hong Kong with the Mainland[J]. Journal of Applied Econometrics, 2012, 27(5):705–740.

[53]Hu D, Jone C M, Zhang X, et al. When Do Short Sellers Trade? Evidence from Intraday Data and Implications for Informed Trading Models[J].Journal of Financial Economics, 2025, 172:104148.

[54]Hu Y, Yao J. Illuminating Economic Growth[J]. Journal of Econometrics, 2022, 228(2):359–378.

[55]Huang D, Jiang F, Tu J, et al. Investor Sentiment Aligned:A Powerful Predictor of Stock Returns[J]. Review of Financial Studies, 2015, 28(3):791–837.

[56]Huang Y, Luk P. Measuring Economic Policy Uncertainty in China[J]. China Economic Review, 2020, 59(2):101367.

[57]Imbens G, Xu Y. Comparing Experimental and Nonexperimental Methods:What Lessons Have We Learned Four Decades after LaLonde(1986)?[J]. Journal of Economic Perspectives, 2025, 39(4):173–202.

[58]Islayem R, Gebreab S, AlKhader W, et al. Using Large Language Models for Enhanced Fraud Analysis and Detection in Blockchain Based Health Insurance Claims[J]. Scientific Reports, 2025, 15:29763.

[59]Jaravel X, O’Connell M. Real-time Price Indices:Inflation Spike and Falling Product Variety During the Great Lockdown[J]. Journal of Public Economics, 2020, 191:104270.

[60]Jha M, Qian J, Weber M, et al. Harnessing Generative AI for Economic Insights[J]. arXiv:2410.03897, 2024.

[61]Jiang F, Lee J, Martin J, et al. Manager Sentiment and Stock Returns[J]. Journal of Financial Economics, 2019, 132(1):126–149.

[62]Jiang S, Wei L, Zhang C. Donald Trumps in the Virtual Polls:Simulating and Predicting Public Opinions in Surveys Using Large Language Models[R]. arXiv:2411.01582, 2025.

[63]Juma’h A H, Alnsour Y. Using Social Media Analytics:The Effect of President Trump’s Tweets on Companies’ Performance[J]. Journal of Accounting and Management Information Systems, 2018(1):100–121.

[64]Jurafsky D, Martin J H. Speech and Language Processing(3rd ed.)[M]. Pearson, 2023.

[65]Lane G. Adapting to Climate Risk with Guaranteed Credit:Evidence from Bangladesh[J]. Econometrica, 2024, 92(2):355–386.

[66]Lazer D, Kennedy R, King G, et al. Big Data. The Parable of Google Flu:Traps in Big Data Analysis[J]. Science, 2014, 343:1203–1205.

[67]Leamer E E. Let’s Take the Con Out of Econometrics[J]. American Economic Review, 1983, 73(1):31–43.

[68]Li P, Castelo N, Katona Z, et al. Frontiers:Determining the Validity of Large Language Models for Automated Perceptual Analysis[J]. Marketing Science, 2024, 43(2):254–266.

[69]Liang A, Lu J, Mu X, et al. Algorithm Design:A Fairness-Accuracy Frontier[J]. Journal of Political Economy, Forthcoming, 2025.

[70]Lin J, Sun L, Yan Y. Simulating Macroeconomic Expectations Using LLM Agents[R]. arXiv:2505.17648, 2025.

[71]Ludwig J, Mullainathan S. Machine Learning as a Tool for Hypothesis Generation[J]. Quarterly Journal of Economics, 2024, 139(2):751–827.

[72]Lythreatis S, Singh S K, El-Kassar A N. The Digital Divide:A Review and Future Research Agenda[J]. Technological Forecasting and Social Change, 2022, 175:121359.

[73]Meng L, Liu P, Zhou Y, et al. Blaming the Wind? The Impact of Wind Turbine on Bird Biodiversity[J]. Journal of Development Economics,2025, 172:103402.

[74]Momtaz P P. CEO Emotions and Firm Valuation in Initial Coin Offerings:An Artificial Emotional Intelligence Approach[J]. Strategic Management Journal, 2021, 42(3):558–578.

[75]Oparina E, Kaiser C, Gentile N, et al. Machine Learning in the Prediction of Human Wellbeing[J]. Scientific Reports, 2025, 15(1):1632.

[76]Pankratz N, Schiller C. Climate Change and Adaptation in Global Supply-Chain Networks[J]. Review of Financial Studies, 2024, 37(6):1729–1777.

[77]Pataranutaporn P, Powdthavee N, Archiwaranguprok C, et al. Simulating Human Well-being with Large Language Models:Systematic Validation and Misestimation across 64000 Individuals from 64 Countries[J]. Proceedings of the National Academy of Sciences, 2025, 122(48):e2519394122.

[78]Piketty T, Saez E, Zucman G. Distributional National Accounts:Methods and Estimates for the United States[J]. Quarterly Journal of Economics,2018, 133(2):553–609.

[79]Rambachan A, Roth J. A More Credible Approach to Parallel Trends[J]. Review of Economic Studies, 2023, 90(5):2555–2591.

[80]Rathje J, Katila R, Reineke P. Making the Most of AI and Machine Learning in Organizations and Strategy Research:Supervised Machine Learning, Causal Inference, and Matching Models[J]. Strategic Management Journal, 2024, 45(10):1926–1953.

[81]Sargent T, Selvakumar Y, Yang Z. Aggregate Shocks and Cross-Section Dynamics:Quantifying Redistribution and Insurance in US Household Data[R]. Working Paper, 2025.

[82]Shapira E, Madmon O, Reichart R, et al. Can LLMs Replace Economic Choice Prediction Labs? The Case of Language-based Persuasion Games[R]. arXiv:2401.17435, 2024.

[83]Solow R. We’d Better Watch Out[J]. New York Times Book Review, 1987, 12.

[84]Voukelatou V, Gabrielli L, Miliou I, et al. Measuring Objective and Subjective Well-being:Dimensions and Data Sources[J]. International Journal of Data Science and Analytics, 2021, 11:279–309.

[85]Wager S, Athey S. Estimation and Inference of Heterogeneous Treatment Effects Using Random Forests[J]. Journal of American Statistical Association, 2018, 113(523):1228–1242.

[86]Wei H, Liu Y. The Impact of Climate Risk on Rural Land Transfer:Evidence from 360 Chinese Villages[J]. Land Use Policy, 2025, 159:107801.

[87]Yang J W, Jiang L M, Luojus K, et al. Snow Depth Estimation and Historical Data Reconstruction over China Based on a Random Forest Machine Learning Approach[J]. Cryosphere, 2020, 14(6):1763–1778.

[88]Yao X, Fang S. Exploring the Coupling of Ecosystem Services and Human Well-being:Evidence from Chinese Cities through Interpretable Machine Learning[J]. Ecological Indicators, 2025, 114315.

[89]Zhou Y, Xue L, Shi Z, et al. Measuring Housing Vitality from Multi-Source Big Data and Machine Learning[J]. Journal of American Statistical Association, 2022, 117(539):1045–1059.

(1)《新中国政府统计的发展历程是怎样的》,参见https://www.stats.gov.cn/zs/tjws/tjzn/202301/t20230101_1903758.html。

(1)Penn World Table最开始由美国宾夕法尼亚大学的经济学者创建,随后管理权与后续开发移交到加州大学戴维斯分校,现在由荷兰格罗宁根大学管理。

(1)The World’s Most Valuable Resource,参见https://www.economist.com/leaders/2017/05/06/the-worlds-most-valuable-resource-is-no-longeroil-but-data。

(1)联合国统计大数据和数据科学全球中心:《手机信令数据在统计中的应用》,详见https://unbdc.stats.gov.cn/cxycg/rktj/202210/t20221012_991.html。

(1)例如,江苏省苏州市高新区检察院研发专利恶意诉讼监督模型,该模型利用人工智能对苏州地区民事裁判文书进行智能筛选,通过提取案涉专利号、诉讼时间节点等数据要素,与专利信息数据库进行比较,进而识别专利权已经失效的信息,有效加强专利恶意诉讼领域的检察监督。具体参见《当大数据模型遇上知识产权检察——来自三地检察机关的办案实践》,https://www.spp.gov.cn/zdgz/202506/t20250611_697982.shtml。

(2)详见https://www.eurofound.europa.eu/en/surveys/european-quality-life-surveys-eqls。

(1)自下而上方法是指通过详细分析各个碳排放源(如农业、工业、自然生态系统等)的活动数据和排放因子,估算碳排放量。自下而上方法能够提供详细的碳排放源和碳汇的分布信息,适合分析特定区域或部门的排放特征,但该方法严重依赖基础数据和模型假设,可能存在不确定性。

(2)Ceres是一家成立于1989年的美国非营利倡导组织,总部位于波士顿。CookESG数据库是由Impact Cubed(前身为Arabesque)的联合创始人Robert G. Eccles和Svetlana Klimenko等人领导开发的一套商业化的企业ESG数据与评分产品。

(1)周晓光:《稳就业需释放中小微企业潜力》,《经济日报》,2025-3-11。

(2)详见https://news.cnr.cn/native/gd/20240826/t20240826_526872023.shtml。

(1)《2024-2025 Accomplishments Report:From Commitment to Continuity 》,详见https://www.statcan.gc.ca/en/about/transparency/disaggregated-data-accomplishments/disaggregated-data-accomplishments-2024-2025.pdf。

(2)[法]托马斯·皮凯蒂:《21世纪资本论》,巴曙松等译,中信出版社2014年版,第270页。

基本信息:

DOI:10.19343/j.cnki.11-1302/c.2026.03.001

中图分类号:F222;TP311.13;TP18

引用信息:

[1]洪永淼,罗良清,张明.大数据、人工智能与经济统计:机遇与挑战[J].统计研究,2026,43(03):3-24.DOI:10.19343/j.cnki.11-1302/c.2026.03.001.

基金信息:

中国科学院大学数智时代经济管理复杂系统建模教育部哲学社会科学创新团队(E5820801); 教育部人文社会科学重点研究基地重大项目(22JJD790049); 国家自然科学基金“计量建模与经济政策研究”基础科学中心项目(71988101); 内蒙古自治区自然科学基金青年基金“稳健贫困脆弱性方法视角下收入流动的群体划分、影响机理与促进政策优化研究”(2025QN07015)

发布时间:

2026-03-25

出版时间:

2026-03-25

检 索 高级检索

引用

GB/T 7714-2015 格式引文
MLA格式引文
APA格式引文