On generating multivariate Poisson data in management science applications
Abstract
Generating multivariate Poisson random variables is essential in many applications, such as multi echelon supply chain systems, multi‐item/multi‐period pricing models, accident monitoring systems, etc. Current simulation methods suffer from limitations ranging from computational complexity to restrictions on the structure of the correlation matrix, and therefore are rarely used in management science. Instead, multivariate Poisson data are commonly approximated by either univariate Poisson or multivariate Normal data. However, these approximations are often not adequate in practice.
In this paper, we propose a conceptually appealing correction for NORTA (NORmal To Anything) for generating multivariate Poisson data with a flexible correlation structure and rates. NORTA is based on simulating data from a multivariate Normal distribution and converting it into an arbitrary continuous distribution with a specific correlation matrix. We show that our method is both highly accurate and computationally efficient. We also show the managerial advantages of generating multivariate Poisson data over univariate Poisson or multivariate Normal data. Copyright © 2011 John Wiley & Sons, Ltd.
Citing Literature
Number of times cited according to CrossRef: 39
- Lynette M. Smith, Walter W. Stroup, David B. Marx, Poisson cokriging as a generalized linear mixed model, Spatial Statistics, 10.1016/j.spasta.2019.100399, 35, (100399), (2020).
- Hassan S. Bakouch, Y. Sunecher, N. Mamode Khan, V. Jowaheer, A non‐stationary bivariate INAR(1) process with a simple cross‐dependence: Estimation with some properties, Australian & New Zealand Journal of Statistics, 10.1111/anzs.12285, 62, 1, (25-48), (2020).
- Euijin Lim, Won-Chan Lee, Subscore Equating and Profile Reporting, Applied Measurement in Education, 10.1080/08957347.2020.1732381, (1-18), (2020).
- H. Demirtas, R. Gao, Mixed data generation packages and related computational tools in R, Communications in Statistics - Simulation and Computation, 10.1080/03610918.2020.1745841, (1-44), (2020).
- Erfan Ghasemi, Alireza Akbarzadeh Baghban, Farid Zayeri, Asma Pourhoseingholi, Seyed Mohammadreza Safavi, A doubly-inflated Poisson regression for correlated count data, Journal of Applied Statistics, 10.1080/02664763.2020.1757049, (1-17), (2020).
- Shuguang He, Lisha Song, Yanfen Shang, Zhiqiong Wang, Change-point detection in Phase I for autocorrelated Poisson profiles with random or unbalanced designs, International Journal of Production Research, 10.1080/00207543.2020.1762017, (1-18), (2020).
- Yisu Jia, Robert Lund, James Livsey, SUPERPOSITIONED STATIONARY COUNT TIME SERIES, Probability in the Engineering and Informational Sciences, 10.1017/S0269964819000433, (1-19), (2019).
- Robert Mislavsky, Berkeley J. Dietvorst, Uri Simonsohn, The minimum mean paradox: A mechanical explanation for apparent experiment aversion, Proceedings of the National Academy of Sciences, 10.1073/pnas.1912413116, (201912413), (2019).
- N. Mamode Khan, Hatice Oncel Cekim, Gamze Ozel, The family of the bivariate integer-valued autoregressive process (BINAR(1)) with Poisson–Lindley (PL) innovations, Journal of Statistical Computation and Simulation, 10.1080/00949655.2019.1694929, (1-14), (2019).
- N. Mamode Khan, V. Jowaheer, Y. Sunecher, Communication in Statistics-Theory and methods improved GQL estimation method for the generalised BINMA(1) model, Communications in Statistics - Theory and Methods, 10.1080/03610926.2018.1435807, 48, 3, (709-725), (2018).
- Maiju Pesonen, Jaakko Nevalainen, Steven Potter, Somnath Datta, Susmita Datta, A Combined PLS and Negative Binomial Regression Model for Inferring Association Networks from Next-Generation Sequencing Count Data, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 10.1109/TCBB.2017.2665495, 15, 3, (760-773), (2018).
- Simone Silvestri, Rahul Urgaonkar, Murtaza Zafer, Bong Jun Ko, A Framework for the Inference of Sensing Measurements Based on Correlation, ACM Transactions on Sensor Networks, 10.1145/3272035, 15, 1, (1-28), (2018).
- Bochao Jia, Suwa Xu, Guanghua Xiao, Vishal Lamba, Faming Liang, Learning gene regulatory networks from next generation sequencing data, Biometrics, 10.1111/biom.12682, 73, 4, (1221-1230), (2017).
- David I. Inouye, Eunho Yang, Genevera I. Allen, Pradeep Ravikumar, A review of multivariate distributions for count data derived from the Poisson distribution, Wiley Interdisciplinary Reviews: Computational Statistics, 10.1002/wics.1398, 9, 3, (2017).
- Ehsan Yadollahi, El‐Houssaine Aghezzaf, Birger Raa, Managing inventory and service levels in a safety stock‐based inventory routing system with stochastic retailer demands, Applied Stochastic Models in Business and Industry, 10.1002/asmb.2241, 33, 4, (369-381), (2017).
- Hakan Demirtas, Rawan Allozi, Yiran Hu, Gul Inan, Levent Ozbek, Joint Generation of Binary, Ordinal, Count, and Normal Data with Specified Marginal and Association Structures in Monte-Carlo Simulations, Monte-Carlo Simulation-Based Statistical Modeling, 10.1007/978-981-10-3307-0_1, (3-15), (2017).
- Hakan Demirtas, A Multiple Imputation Framework for Massive Multivariate Data of Different Variable Types: A Monte-Carlo Technique, Monte-Carlo Simulation-Based Statistical Modeling, 10.1007/978-981-10-3307-0_8, (143-162), (2017).
- Yuvraj Sunecher, Naushad Mamode Khan, Vandna Jowaheer, A GQL estimation approach for analysing non-stationary over-dispersed BINAR(1) time series, Journal of Statistical Computation and Simulation, 10.1080/00949655.2017.1296152, 87, 10, (1911-1924), (2017).
- Vandna Jowaheer, Yuvraj Sunecher, Naushad Mamode Khan, A non-stationary BINAR(1) process with negative binomial innovations for modeling the number of goals in the first and second half: The case of Arsenal Football Club, Communications in Statistics: Case Studies, Data Analysis and Applications, 10.1080/23737484.2017.1284577, 2, 1-2, (21-33), (2017).
- Sergei Leonov, Bahjat Qaqish, Correlated endpoints: simulation, modeling, and extreme correlations, Statistical Papers, 10.1007/s00362-017-0960-2, (2017).
- N. Mamode Khan, Y. Sunecher, V. Jowaheer, Inferential methods for an unconstrained nonstationary BINMA time series process with Poisson innovations, Journal of Statistical Theory and Practice, 10.1080/15598608.2016.1258600, 11, 1, (76-106), (2016).
- Yuvraj Sunecher, Naushad Mamode Khan, Vandna Jowaheer, Estimating the parameters of a BINMA Poisson model for a non-stationary bivariate time series, Communications in Statistics - Simulation and Computation, 10.1080/03610918.2016.1212068, 46, 9, (6803-6827), (2016).
- Alexander Kreinin, Correlated Poisson Processes and Their Applications in Financial Modeling, Financial Signal Processing and Machine Learning, 10.1002/9781118745540, (191-232), (2016).
- Sobom M. Somé, Célestin C. Kokonendji, Effects of associated kernels in nonparametric multiple regressions, Journal of Statistical Theory and Practice, 10.1080/15598608.2016.1160010, 10, 2, (456-471), (2016).
- Reza Modarres, Multivariate Poisson interpoint distances, Statistics & Probability Letters, 10.1016/j.spl.2016.01.025, 112, (113-123), (2016).
- Anup Amatya, Hakan Demirtas, Concurrent generation of multivariate mixed data with variables of dissimilar types, Journal of Statistical Computation and Simulation, 10.1080/00949655.2016.1177530, 86, 18, (3595-3607), (2016).
- Alexander Kolovos, Lynette M. Smith, Aimee Schwab-McCoy, Sarah Gengler, Hwa-Lung Yu, Emerging patterns in multi-sourced data modeling uncertainty, Spatial Statistics, 10.1016/j.spasta.2016.05.005, 18, (300-317), (2016).
- Roger S. Zoh, Bani Mallick, Ivan Ivanov, Veera Baladandayuthapani, Ganiraju Manyam, Robert S. Chapkin, Johanna W. Lampe, Raymond J. Carroll, PCAN: Probabilistic correlation analysis of two non‐normal data sets, Biometrics, 10.1111/biom.12516, 72, 4, (1358-1368), (2016).
- Jochen Kruppa, Frank Kramer, Tim Beißbarth, Klaus Jung, A simulation framework for correlated count data of features subsets in high-throughput sequencing or proteomics experiments, Statistical Applications in Genetics and Molecular Biology, 10.1515/sagmb-2015-0082, 15, 5, (2016).
- Naushad Mamode Khan, Yuvraj Sunecher, Vandna Jowaheer, Modelling a non-stationary BINAR(1) Poisson process, Journal of Statistical Computation and Simulation, 10.1080/00949655.2016.1150482, 86, 15, (3106-3126), (2016).
- Jing Lv, Hu Yang, Chaohui Guo, An efficient and robust variable selection method for longitudinal generalized linear models, Computational Statistics & Data Analysis, 10.1016/j.csda.2014.08.006, 82, (74-88), (2015).
- Simone Silvestri, Rahul Urgaonkar, Murtaza Zafer, Bong Jun Ko, undefined, 2015 IEEE 35th International Conference on Distributed Computing Systems, 10.1109/ICDCS.2015.35, (268-277), (2015).
- George Kalema, Samuel Iddi, Geert Molenberghs, The Combined Model: A Tool for Simulating Correlated Counts with Overdispersion, Communications in Statistics - Simulation and Computation, 10.1080/03610918.2014.906610, 45, 7, (2491-2510), (2014).
- Alessandro Barbiero, Pier Alda Ferrari, Simulation of correlated Poisson variables, Applied Stochastic Models in Business and Industry, 10.1002/asmb.2072, 31, 5, (669-680), (2014).
- Anup Amatya, Hakan Demirtas, Simultaneous generation of multivariate mixed data with Poisson and normal marginals, Journal of Statistical Computation and Simulation, 10.1080/00949655.2014.953534, 85, 15, (3129-3139), (2014).
- Inbal Yahav, Galit Shmueli, Directionally Sensitive Multivariate Control Charts in Practice: Application to Biosurveillance, Quality and Reliability Engineering International, 10.1002/qre.1491, 30, 2, (159-179), (2013).
- Seyed Taghi Akhavan Niaki, Majid Khedmati, Monotonic change-point estimation of multivariate Poisson processes using a multi-attribute control chart and MLE, International Journal of Production Research, 10.1080/00207543.2013.857797, 52, 10, (2954-2982), (2013).
- Carlos León, Estimating Financial Institutions’ Intraday Liquidity Risk: A Monte Carlo Simulation Approach, SSRN Electronic Journal, 10.2139/ssrn.2101239, (2012).
- Peirong Xu, Lixing Zhu, Estimation for a marginal generalized single-index longitudinal model, Journal of Multivariate Analysis, 10.1016/j.jmva.2011.10.004, 105, 1, (285-299), (2012).




