Analysis of missing data

Main page (in Portuguese)

“The most pressing task, in my opinion, is placing further emphasis on the general recognition and understanding, at a conceptual level, of the necessity of properly dealing with the missing-data mechanism, as part of our ongoing emphasis on the importance of the data collection process in any meaningful statistical analysis. The missing-data mechanism is in the blood of statistics, and it is the nastiest and the most deceptive cell, especially for nonstatisticians - why on earth should anyone be concerned with data that one does not even have?” Meng (2000)

“(Colonel Ross) Is there any other point to which you would wish to draw my attention?
(Holmes) To the curious incident of the dog in the night-time.
(Ross) The dog did nothing in the night-time.
  That was the curious incident!, remarked Sherlock Holmes.” Dawid e Dickey (1977)

 

Poleto, F.Z., Molenberghs, G., Paulino, C.D. and Singer, J.M. (2010). Inferential implications of over-parameterization: a case study in incomplete categorical data. Technical report RT-MAE-2010-04. Instituto de Matemática e Estatística, Universidade de São Paulo, Brazil.

Poleto, F.Z., Singer, J.M. and Paulino, C.D. (2010). A product-multinomial framework for categorical data analysis with missing responses. Submitted for publication. R code to reproduce the analyses of the manuscript.

Poleto, F.Z., Singer, J.M. and Paulino, C.D. (2010). Comparing diagnostic tests with missing data. To appear in Journal of Applied Statistics. R code to reproduce the analyses of the manuscript.

Poleto, F.Z., Singer, J.M. and Paulino, C.D. (2010). Missing data mechanisms and their implications on the analysis of categorical data. To appear in Statistics and Computing. doi: 10.1007/s11222-009-9143-x.

Singer, J.M., Poleto, F.Z. and Paulino, C.D. (2007). Catdata: software for analysis of categorical data with complete or missing responses. Actas de la XII Reunión Científica del Grupo Argentino de Biometría y I Encuentro Argentino-Chileno de Biometría.

Poleto, F.Z. (2007). Comandos (em R) para reproduzir as análises de exemplos do livro Análise de Dados Categorizados de Paulino e Singer (2006) [Commands (in R) to reprocuce the analyses of the examples of the book Analysis of Categorical Data by Paulino and Singer (2006), in Portuguese]. Manuscrito não publicado (Unpublished manuscript). Código R para reproduzir as análises do manuscrito (R code to reproduce the analyses of the manuscript).

Poleto, F.Z., Singer, J.M. and Paulino, C.D. (2007). Analyzing categorical data with complete or missing responses using the Catdata package. Unpublished vignette for the R package. Source code of the Catdata library of functions. R code to reproduce the analyses of the manuscript.

Poleto, F.Z., Singer, J.M. and Paulino, C.D. (2007). A product-multinomial framework for categorical data analysis with missing responses. Technical report RT-MAE-2007-07. Instituto de Matemática e Estatística, Universidade de São Paulo, Brazil.

Poleto, F.Z. (2006). Análise de dados categorizados com omissão (Analysis of categorical data with missingness, in Portuguese). Dissertação de mestrado (M.Sc. dissertation). Versão corrigida (corrected version). Instituto de Matemática e Estatística, Universidade de São Paulo, Brazil. Código R para reproduzir as análises do manuscrito (R code to reproduce the analyses of the manuscript): Ex. 1, Ex. 2 - parte 1, Ex. 2 - parte 2, Ex. 3, Ex. 4 e Ex. 5.

 

References

Dawid, A.P. and Dickey, J.M. (1977). Likelihood and bayesian inference from selectively reported data. Journal of the American Statistical Association 72, 845-850.

Meng, X.-L. (2000). Missing data: dial M for ???. Journal of the American Statistical Association 95, 1325-1330.

Paulino, C.D. and Singer, J.M. (2006). Análise de dados categorizados (Analysis of categorical data, in Portuguese). São Paulo: Edgard Blücher.

Main page (in Portuguese)


Frederico Zanqueta Poleto <frederico@poleto.com> 's home page. Last modified: Jul 04th, 2010.