Title Evaluating the impact of a non-probability sample-based estimator in a linear combination with an estimator from a probability sample
Authors Čiginas, Andrius ; Krapavickaitė, Danutė ; Nekrašaitė-Liegė, Vilma
DOI 10.1177/0282423X251331346
Full Text Download
Is Part of Journal of official statistics.. Thousand Oaks, CA : SAGE Publications. 2025, vol. 41, iss. 2, p. 649-674.. ISSN 0282-423X. eISSN 2001-7367
Keywords [eng] propensity score ; variance estimation ; Poisson pseudo-sampling design ; super-population ; composite estimator
Abstract [eng] In this article, the estimators based on data from independent non-probability and probability samples are combined to estimate finite population parameters. Assuming that the values of the study variable are available in both samples, the integration of the non-probability and probability samples through a composite estimator of the population total is studied. The integration is done using a linear combination of the inverse probability weighted (IPW) estimator and a design-based estimator. By evaluating the variance of the former estimator, the randomness of the underlying non-probability sample is taken into account through the distribution of the estimated propensity scores. This approach is then compared with a variance estimator based on the asymptotic variance and with a bootstrap variance estimator. The proposed linear combination is not sensitive to the misspecification of the model for the propensity scores due to the incorporated estimator of the bias of the IPW estimator. The number of Lithuanian companies possessing websites is estimated in a simulation study. By combining the sample survey data and big voluntary sample data, the properties of the introduced estimators are demonstrated numerically.
Published Thousand Oaks, CA : SAGE Publications
Type Journal article
Language English
Publication date 2025
CC license CC license description