DOI: 10.32725/978-80-7694-053-6.63



Inproforum 2023, 17:5-10 | DOI: 10.32725/978-80-7694-053-6.63

Statistical Method Selection Matters: Vanilla Methods in Regression May Yield Misleading Results

Jan Kalina

The primary aim of this work is to illustrate the importance of the choice of the appropriate methods for the statistical analysis of economic data. Typically, there exist several alternative versions of common statistical methods for every statistical modeling task and the most habitually used (“vanilla”) versions may yield rather misleading results in non-standard situations. Linear regression is considered here as the most fundamental econometric model. First, the analysis of a world tourism dataset is presented, where the number of international arrivals is modeled for 140 countries of the world as a response of 14 pillars (indicators) of the Travel and Tourism Competitiveness Index. Heteroscedasticity is clearly recognized in the dataset. However, the Aitken estimator, which would be the standard remedy in such a situation, is revealed here to be very inappropriate; regression quantiles represent a much more suitable solution here. The second illustration with artificial data reveals standard regression quantiles to be unsuitable for data contaminated by outlying values; their recently proposed robust version turns out to be much more appropriate. Both illustrations reveal that choosing suitable methods represent an important (and often difficult) part of the analysis of economic data.

Keywords: linear regression, assumptions, non-standard situations, robustness, diagnostics

Published: January 20, 2024  Show citation

ACS AIP APA ASA Harvard Chicago IEEE ISO690 MLA NLM Turabian Vancouver
Kalina, J. (2023). Statistical Method Selection Matters: Vanilla Methods in Regression May Yield Misleading Results. Inproforum17(1), 5-10. doi: 10.32725/978-80-7694-053-6.63
Download citation

References

  1. Assaf, A.G., & Tsionas, M. (2020). Correcting for Endogeneity in Hospitality and Tourism Research. International Journal of Contemporary Hospitality Management, 32, 2657-2675. Go to original source...
  2. Calderwood, L.U., & Soshkin, M. (2019). The Travel & Tourism Competitiveness Report 2019: Travel and Tourism at a Tipping Point. World Economic Forum, Geneva, https://www3.weforum.org/docs/WEF_TTCR_2019.pdf.
  3. Fox, J. (2019). Regression Diagnostics: An Introduction. 2. ed. Thousand Oaks: SAGE Publications.
  4. Gneiting, T., Wolffram, D., Resin, J., Kraus, K., Bracher, J., et al. (2023). Model Diagnostics and Forecast Evaluation for Quantiles. Annual Review of Statistics and Its Application, 10, 597-621. Go to original source...
  5. Greene, W.H. (2012). Econometric Analysis. 7. ed. New York: Pearson.
  6. Hlubinka, D., & Šiman M. (2013). On Elliptical Quantiles in the Quantile Regression Setup. Journal of Multivariate Analysis, 116, 163-171. Go to original source...
  7. Jurečková, J., Picek, J., & Schindler, M. (2019). Robust Statistical Methods with R. 2. ed. Boca Raton: CRC Press. Go to original source...
  8. Kalina, J., & Rensová, D. (2015). How to Reduce Dimensionality of Data: Robustness Point of View. Serbian Journal of Management, 10, 131-140. Go to original source...
  9. Kalina, J., & Tichavský, J. (2020). On Robust Estimation of Error Variance in (Highly) Robust Regression. Measurement Science Review, 20, 6-14. Go to original source...
  10. Kalina, J., Vašaničová, P., & Litavcová, P. (2019). Regression Quantiles under Heteroscedasticity and Multi-collinearity: Analysis of Travel and Tourism Competitiveness. Ekonomický časopis/Journal of Economics, 67, 69-85.
  11. Kalina, J., & Vidnerová P. (2022). Least Weighted Squares Quantiles Reveal how Competitiveness Contributes to Tourism Performance. Czech Journal of Economics and Finance, 72, 150-171.
  12. Koenker, R. (2017). Quantile Regression: 40 Years on. Annual Review of Economics, 9, 155-176. Go to original source...
  13. Li, Z., & Yao, J. (2019). Testing for Heteroscedasticity in High-Dimensional Regressions. Econometrics and Statistics, 9, 122-139. Go to original source...
  14. Liu, X., Long, W., Peng, L., & Yang, B. (2023). A Unified Inference for Predictive Quantile Regression. Journal of the American Statistical Association. In press. Go to original source...
  15. Long, J.D., & Teetor, P. (2019). R Cookbook. Proven Reciples for Data Analysis, Statistics & Graphics. 2. ed. Sebastopol: O'Reilly.
  16. Roser, M. (2020). https://ourworldindata.org/tourism.
  17. Saleh, A., Picek, J., & Kalina, J. (2012). R-estimation of the Parameters of a Multiple Regression Model with Measurement Errors. Metrika, 75, 311-328. Go to original source...
  18. Sasaki, Y., & Wang, Y. (2023). Diagnostic Testing of Finite Moment Conditions for the Consistency and Root-n Asymptotic Normality of the GMM and M Estimators. Journal of Business & Economic Statistics, 41, 339-348. Go to original source...
  19. Víšek, J.Á. (2011). Consistency of the Least Weighted Squares under Heteroscedasticity. Kybernetika, 47, 179-206.
  20. White, H. (1980). A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroscedasticity. Econometrica, 48, 817-838. Go to original source...