journal article Aug 16, 2022

Investigating the Association of a Sensitive Attribute with a Random Variable Using the Christofides Generalised Randomised Response Design and Bayesian Methods

View at Publisher Save 10.1111/rssc.12585
Abstract
AbstractIn empirical studies involving sensitive topics, in addition to the problem of estimating the population proportion with a sensitive characteristic, a question arises as to whether or not there is heterogeneity in the distribution of an auxiliary random variable representing the information of subjects collected from a sensitive group and a non-sensitive group. That is, it is of interest to investigate the influence of sensitive attribute on the auxiliary random variable of interest. Finite mixture models are utilised to evaluate the association. A proposed Bayesian method through data augmentation and Markov chain Monte Carlo is applied to estimate unknown parameters of interest. Deviance information criterion and marginal likelihood are employed to select a suitable model to describe the association of the sensitive characteristic with the auxiliary random variable. Simulation and real data studies are conducted to assess the performance of and illustrate applications of the proposed methodology.
Topics

No keywords indexed for this article. Browse by subject →

References
61
[1]
Abernathy "Estimates of induced abortion in urban North Carolina" Demography (1970) 10.2307/2060019
[2]
Adepetun "Bayesian analysis of Kim and Warde randomized response technique using alternative priors" American Journal of Computational and Applied Mathematics (2014)
[3]
Arnab (2017)
[4]
Arnab "Randomized response techniques: a case study of the risky behaviors' of students of a certain University" Model Assisted Statistics and Applications (2015) 10.3233/mas-150344
[5]
Bhargava "A modified randomization device for Warner's model" Statistica (2000)
[6]
Blair "Design and analysis of the randomized response technique" Journal of the American Statistical Association (2015) 10.1080/01621459.2015.1050028
[7]
Bourke "Estimating proportions from randomized response data using the EM algorithm" Journal of the American Statistical Association (1988) 10.1080/01621459.1988.10478685
[8]
Celeux "Deviance information criteria for missing data models" Bayesian Analysis (2006) 10.1214/06-ba122
[9]
Chang "Estimation of parameters of logistic regression for two-stage randomized response technique" Computational Statistics (2021) 10.1007/s00180-021-01068-5
[10]
Chaudhuri (2011)
[11]
Chaudhuri (2013) 10.1007/978-3-642-36276-7
[12]
Chaudhuri (2016)
[13]
Chaudhuri "Optionally randomized response techniques" Calcutta Statistical Association Bulletin (1985) 10.1177/0008068319850311
[14]
Chaudhuri (1988)
[15]
Chib "Marginal likelihood from the Gibbs output" Journal of the American Statistical Association (1995) 10.1080/01621459.1995.10476635
[16]
Christofides "A generalized randomized response technique" Metrika (2003) 10.1007/s001840200216
[17]
Devore "A note on the randomized response technique" Communications in Statistics-Theory and Methods (1977) 10.1080/03610927708827594
[18]
Fidler "Randomized response versus direct questioning: two data-collection methods for sensitive information" Psychological Bulletin (1977) 10.1037/0033-2909.84.5.1045
[19]
[20]
Frühwirth-Schnatter (2006)
[21]
Frühwirth-Schnatter "Bayesian inference for finite mixtures of univariate and multivariate skew-normal and skew-t distributions" Biostatistics (2010) 10.1093/biostatistics/kxp062
[22]
Gau "Bayesian approach for mixture models with grouped data" Computational Statistics (2014) 10.1007/s00180-013-0478-6
[23]
Gelfand "Bayesian model choice: asymptotics and exact calculations" Journal of the Royal Statistical Society, Series B (Methodological) (1994) 10.1111/j.2517-6161.1994.tb01996.x
[24]
Goodstadt "The randomized response technique: A test on drug use" Journal of the American Statistical Association (1975) 10.1080/01621459.1975.10480307
[25]
Greenberg "The unrelated question randomized response model: theoretical framework" Journal of the American Statistical Association (1969) 10.1080/01621459.1969.10500991
[26]
Groenitz "Using prior information in privacy-protecting survey designs for categorical sensitive variables" Statistical Papers (2015) 10.1007/s00362-013-0573-3
[27]
Horvitz "The unrelated question randomized response model" Proceedings of the Social Statistics Section, American Statistical Association (1967)
[28]
Hsieh "Semiparametric analysis of randomized response data with missing covariates in logistic regression" Computational Statistics and Data Analysis (2009) 10.1016/j.csda.2009.01.011
[29]
Hsieh "Logistic regression analysis of randomized response data with missing covariates" Journal of Statistical Planning and Inference (2010) 10.1016/j.jspi.2009.09.020
[30]
Hsieh "Comparison of estimators for multi-level randomized response data: evidence from a case of sexual identity" Field Methods (2021) 10.1177/1525822x20977990
[31]
Hsieh "Estimating the proportion of non-heterosexuals in Taiwan using Christofides' randomized response model: a comparison of different estimation methods" Social Science Research (2021) 10.1016/j.ssresearch.2020.102475
[32]
Hussain "Bayesian estimation of population proportion in Kim and Warde mixed randomized response technique" Electronic Journal of Applied Statistical Analysis (2012)
[33]
Hussain "Bayesian estimation using Warner's randomized response model through simple and mixture prior distributions" Communications in Statistics–Simulation and Computation (2011) 10.1080/03610918.2010.532897
[34]
Kim "A stratified Warner's randomized response model" Journal of Statistical Planning and Inference (2004) 10.1016/s0378-3758(02)00500-1
[35]
Li "Deviance information criterion for latent variable models and misspecified models" Journal of Econometrics (2020) 10.1016/j.jeconom.2019.11.002
[36]
A Test of Missing Completely at Random for Multivariate Data with Missing Values

Roderick J. A. Little

Journal of the American Statistical Association 1988 10.1080/01621459.1988.10478722
[37]
Little (2019)
[38]
Liu "Marginal likelihood calculation for the Gelfand–Dey and Chib methods" Economics Letters (2012) 10.1016/j.econlet.2011.12.034
[39]
Mangat "An alternative randomized response procedure" Biometrika (1990) 10.1093/biomet/77.2.439
[40]
Mieth "Do they really wash their hands? Prevalence estimates for personal hygiene behaviour during the COVID-19 pandemic based on indirect questions" BMC Public Health (2021) 10.1186/s12889-020-10109-5
[41]
Migon "Bayesian approximations in randomized response model" Computational Statistics and Data Analysis (1997) 10.1016/s0167-9473(96)00075-8
[42]
Morel "A finite mixture distribution for modelling multinomial extra variation" Biometrika (1993) 10.1093/biomet/80.2.363
[43]
Nandram "Bayesian analysis of sparse counts obtained from the unrelated question design" International Journal of Statistics and Probability (2019) 10.5539/ijsp.v8n5p66
[44]
Oh "Bayesian analysis of randomized response models: a Gibbs sampling approach" Journal of the Korean Statistical Society (1994)
[45]
Pitz "Bayesian analysis of random response models" Psychological Bulletin (1980) 10.1037/0033-2909.87.1.209
[46]
Preisendörfer "Who is telling the truth? A validation study on determinants of response behavior in surveys" Public Opinion Quarterly (2014) 10.1093/poq/nft079
[47]
Reiber "Self-protecting responses in randomized response designs: a survey on intimate partner violence during the coronavirus disease 2019 pandemic" Sociological Methods & Research (2022)
[48]
Inference and missing data

Donald B. Rubin

Biometrika 1976 10.1093/biomet/63.3.581
[49]
Rueda "Randomized response estimation in multiple frame surveys" International Journal of Computer Mathematics (2020) 10.1080/00207160.2018.1476856
[50]
Scheers "Covariate randomized response models" Journal of the American Statistical Association (1988) 10.1080/01621459.1988.10478686

Showing 50 of 61 references

Metrics
4
Citations
61
References
Details
Published
Aug 16, 2022
Vol/Issue
71(5)
Pages
1471-1502
License
View
Funding
Ministry of Science and Technology, Taiwan Award: MOST-107-2118-M-035-004-MY2
Cite This Article
Shen-Ming Lee, Truong-Nhat Le, Phuoc-Loc Tran, et al. (2022). Investigating the Association of a Sensitive Attribute with a Random Variable Using the Christofides Generalised Randomised Response Design and Bayesian Methods. Journal of the Royal Statistical Society Series C: Applied Statistics, 71(5), 1471-1502. https://doi.org/10.1111/rssc.12585
Related

You May Also Like

Algorithm AS 136: A K-Means Clustering Algorithm

J. A. Hartigan, M. A. Wong · 1979

7,644 citations

A Non-Parametric Approach to the Change-Point Problem

A. N. Pettitt · 1979

2,570 citations

Generalized Additive Models for Location, Scale and Shape

R. A. Rigby, D. M. Stasinopoulos · 2005

2,037 citations

Ridge Estimators in Logistic Regression

S. Le Cessie, J. C. Van Houwelingen · 1992

1,040 citations