files/journal/2022-09-02_12-54-44-000000_354.png

Journal of Engineering and Applied Sciences

ISSN: Online 1818-7803
ISSN: Print 1816-949x
93
Views
0
Downloads

A Statistical Method for Big Data with Excessive Zero-Inflated Problem

Sunghae Jun
Page: 2465-2469 | Received 21 Sep 2022, Published online: 21 Sep 2022

Full Text Reference XML File PDF File

Abstract

In many cases, we meet the zero-inflated problem in big data analysis. This is because the value of zero is too much in the data table structured through preprocessing from collected big data. If the big data is analyzed as it is the performances of estimation and prediction of statistical models will deteriorate. To build valid models for big data analysis, we have to solve the zero-inflated problem of big data. So, we propose a statistical modeling to overcome the zero-inflated problem in big data analysis. In this study, we combine the method of data division with count data models such as Poisson, hurdle, negative binomial regressions. In order to verify the validity of the proposed approach, we carry out case study using simulated and patent big data.


How to cite this article:

Sunghae Jun. A Statistical Method for Big Data with Excessive Zero-Inflated Problem.
DOI: https://doi.org/10.36478/jeasci.2019.2465.2469
URL: https://www.makhillpublications.co/view-article/1816-949x/jeasci.2019.2465.2469