This article presents a parametric bootstrap approach to inference on the regression coefficients in panel data models. We aim to propose a method that is easily applicable for implement hypothesis testing and construct confidence interval of the regression coefficients vector of balanced and unbalanced panel data models. We show the results of our simulation study to compare of our parametric bootstrap approach with other approaches and approximated methods based on a Monte Carlo simulation study.
Panel data are the combination of observations on a cross section of individuals, cities, factories, etc., over many time periods. Panel data have been applied in economics extensively. The Panel Study of Income Dynamics (PSID) from the Survey Research Center at the University of Michigan and the Gasoline demand panel of annual observations across 18 Organisation for Economic Co-operation and Development (OECD) countries, covering the period 1960–1978, are two famous examples of panel data. Analysis of panel data models has worked in statistics and econometrics by many researchers ([ 1 , 5 , 14 ], and references therein). Baltagi [ 2 ] presents an overview for panel data models excellently. Precious inferences in panel data are difficult when these models have nuisance parameters. Zhao [ 23 ] suggested generalized p value inferences in panel data models of generalized p values. When nuisance parameters are present in models, generalized p values are effective to solve the testing problems [ 8 , 11 , 16 , 17 , 18 – 19 , 24 ]. Parametric bootstrap approaches are used as another method for inferences in panel data models, when unknown parameters are present. Xu et al. [ 21 ] and [ 22 ] provided parametric bootstrap inferences for parameters of the linear combination of regression coefficients in balanced and unbalanced panel data models, respectively.
In this article, we aim to propose a method that is easily applicable for implement hypothesis testing and construct confidence interval of the regression coefficient vector of balanced and unbalanced panel data models. Our procedure is based on a new parametric bootstrap pivot variable . The performance of our PB method is compared with generalized p value approaches introduced by [ 23 ]. The numerical results in section “Simulation study” show that in terms of the type I error rate and power, the performance of our method is better than generalized p value (GPV) inferences and approximate (AP) method.
The rest of this paper is organized as follows. Our PB approaches for hypothesis testing and constructing confidence region about the regression coefficients vector are presented for the balanced and unbalanced panel data models in section “PB inferences for the regression coefficients”. In section “Simulation study”, the proposed PB methods are evaluated in terms of type I error rates and powers. The suggested PB approaches are illustrated with a real data example in section “Example”. The some conclusions are assumed in section “Conclusions”.
Panel data regression models show the behaviour of several explanatory variables on the response variable between
N
individuals over
T
time periods. A panel data model is
Equation (
2.1
) can also be expressed as matrix notations,
Let
The values of
We can construct a approximated (AP) confidence region as
Let
The unbalanced panel data model is given by:
In this section, we present the results of our simulation study to compare the size and powers of our PB approach with generalized p values by [ 23 ] and approximated methods based on a Monte Carlo simulation study. we use the abbreviation PB, GPV and AP to refer these three methods. At first, we briefly review the GPV method.
[
23
] only proposed a generalized
p
value method for testing
Algorithm : We use the following steps to estimate powers of the PB and GPV methods.
1.
For a given (
N
,
T
) and
2.
Generate
3.
Repeat step 2 many times (
4.
Repeat steps (
1
) to (
3
) for
For power estimation of the AP method, we compute the fraction of times that the value of
The results of simulation for the different values of
Simulated powers of the GPV, PB and AP tests at 5% nominal level ( Tests (2,3,1,5) (2.1,3.1,1.1,5.1) (4,3,1,5) (2,3.1,1,5.1) (2,3,1.5,5.1) (10, 6) (0.01, 1) GPV 1.0000 1.0000 1.0000 1.0000 PB 0.0458 1.0000 1.0000 1.0000 1.0000 AP 1.0000 1.0000 1.0000 1.0000 (1, 1) GPV 0.8582 0.9570 0.8784 0.8194 PB 0.0446 0.8700 0.9754 0.8956 0.7896 AP 0.9530 0.9942 0.9654 0.9114 (10, 1) GPV 0.2612 0.2622 0.2578 0.3680 PB 0.0508 0.2388 0.2870 0.2416 0.3006 AP 0.3654 0.4068 0.3550 0.4456 (100, 1) GPV 0.1322 0.0814 0.1336 0.2556 PB 0.0466 0.1074 0.0756 0.1078 0.2483 AP 0.1788 0.1280 0.1782 0.3312 (12, 5) (0.01, 1) GPV 0.0518 1.0000 1.0000 1.0000 1.0000 PB 0.0486 1.0000 1.0000 1.0000 1.0000 AP 1.0000 1.0000 1.0000 1.0000 (1, 1) GPV 0.0494 0.8594 0.9796 0.8850 0.6903 PB 0.0496 0.8484 0.9760 0.8730 0.6812 AP 0.9620 0.9964 0.9732 0.8734 (10, 1) GPV 0.1826 0.2724 0.1888 0.1386 PB 0.0508 0.1632 0.2362 0.1596 0.1200 AP 0.3320 0.4498 0.3354 0.2662 (100, 1) GPV 0.0880 0.0654 0.0630 0.0710 PB 0.0547 0.0644 0.0484 0.0446 0.0528 AP 0.1466 0.1106 0.1138 0.1286 (20, 3) (0.01, 1) GPV 1.0000 1.0000 1.0000 1.0000 PB 0.0492 1.0000 1.0000 1.0000 1.0000 AP 1.0000 1.0000 1.0000 1.0000 (1, 1) GPV 0.9948 1.0000 0.9968 0.9726 PB 0.0528 0.9960 1.0000 0.9968 0.9698 AP 0.9988 1.0000 0.9988 0.9840 (10, 1) GPV 0.3576 0.5180 0.3628 0.3818 PB 0.0516 0.3456 0.5166 0.3558 0.3548 AP 0.4302 0.6010 0.4446 0.4446 (100, 1) GPV 0.0510 0.0968 0.0950 0.0906 0.2002 PB 0.0456 0.0912 0.0900 0.0836 0.1906 AP 0.1284 0.1266 0.1190 0.2452 Data of motor gasoline consumption Country Year Austria 1960 4.1732 1961 4.1010 1962 4.0732 1963 4.0595 1964 4.0377 Belgium 1960 4.1640 1961 4.1244 1962 4.0760 1963 4.0013 1964 3.9944 Canada 1960 4.8552 1961 4.8266 1962 4.8505 1963 4.8381 1964 4.8398 Denmark 1960 4.5020 1961 4.4828 1962 4.3854 1963 4.3540 1964 4.3264 France 1960 3.9077 1961 3.8856 1962 3.8237 1963 3.7890 1964 3.7671 Germany 1960 3.9170 1961 3.8853 1962 3.8715 1963 3.8488 1964 3.8690 Spain 1960 4.7494 1.1253 1961 4.5892 1.1096 1962 4.4291 1.0570 1963 4.3465 0.9768 1964 4.3006 0.9153 Sweden 1960 4.0630 1961 4.0619 1962 4.0064 1963 4.0028 1964 4.0249 Switzer 1960 4.3976 1961 4.4413 1962 4.2871 1963 4.3125 1964 4.3134 Turkey 1960 6.1296 1961 6.1062 1962 6.0846 1963 6.0751 1964 6.0646 U.K. 1960 4.1002 1961 4.0886 1962 4.0481 1963 3.9853 1964 3.9768 U.S.A. 1960 4.8240 1961 4.7963 1962 4.7989 1963 4.7879 1964 4.8083Table 1
Table 2
Note that the estimated powers vary slightly from one simulation to another [
9
]. Therefore, we used the well-known
z
test to compare powers of two methods. One can conclude that the powers of two test procedures are statistically significant at 100
In all cases that we considered here, the estimated sizes of our PB test vary between 0.0446 and 0.0547 which shows that our proposed test behaves like the exact test.
The simulated size probabilities in the GPV and AP often exceed the upper limit of this range, and then, these methods are assumed to be liberal. Therefore, in this paper, the powers of these test methods cannot be comparable with our parametric bootstrap approach.
To compare the estimated power, in the cases that the estimated size of GPV is close to 0.05, the PB test and GPV have not significantly different powers.
Overall, it seems that the proposed PB method has better performance than two other methods in terms of both controlling the type I error rates and powers.
To illustrate our suggested approach to inference on the regression coefficients of a panel data, we consider the following gasoline demand equation like [
3
] as
For testing
In this example, for obtaining PB confidence region for the regression coefficient vector
In this article, we propose a parametric bootstrap method for testing hypothesis as well as constructing confidence region on the regression coefficients vector (
We are grateful to referees for their valuable comments and suggestions.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.