Research on Statistical Problems in the Era of Big Data
Download as PDF
DOI: 10.25236/icamfss.2024.019
Corresponding Author
Wenjia Deng
Abstract
With the advent of the big data era, statistics is facing unprecedented challenges and opportunities. This paper aims to discuss the statistical problems in the era of big data and propose solutions. First, we outline the definition, characteristics and impact of big data on statistics, and point out the challenges posed by big data to traditional statistical methods. Then, we classify and analyze the big data statistics problems, including data acquisition and preprocessing, data quality and accuracy, data analysis and modelling, data visualization and interpretation, etc. Then, we discussed the methods to solve the problem of big data statistics, including the application and limitations of traditional statistical methods, the application of machine learning and deep learning in big data statistics, and the impact of big data technology on statistics. Finally, we summarize the research, point out the existing problems and deficiencies, and put forward the future development direction and suggestions. This paper aims to provide a reference for understanding statistical problems in the era of big data and guidance for research and practice in related fields.
Keywords
Big Data; Statistical Problems; Data Mining; Data Analysis; Machine Learning; Deep Learning; Statistical Methods