Missing values

Missing values

In statistics, missing values are a common occurrence. Several statistical methods have been developed to deal with this problem. Missing values mean that no data value is stored for the variable in the current observation. Modern statistical packages have made dealing with missing values much easier. Often these use a maximum likelihood estimation for summary statistics, confidence intervals, etc.

Techniques of dealing with missing values

*Imputation (statistics)
*EM imputation, i.e.expectation-maximization imputation: see Expectation-maximization algorithm)
*full information maximum likelihood estimation
*indicator variable
*Listwise deletion/casewise deletion
*Pairwise deletion
*Mean substitution
*Mplus
*MCAR (missing completely at random)
*Censoring (statistics)

Further reading

* Little, R. J. A. & Rubin, D. B.. "Statistical Analysis with Missing Data". John Wiley and Sons, New York, 2002.
* Acock, A. C, "Working With Missing Values", "JOURNAL OF MARRIAGE AND FAMILY", 2005, VOL 67; NUMBER 4, pages 1012-1028
* Jan Van den Broeck, Solveig Argeseanu Cunningham, Roger Eeckels, and Kobus Herbst, "Data Cleaning: Detecting, Diagnosing, and Editing Data Abnormalities", PLoS Med. 2005 October; 2(10): e267. [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1198040]

References

* [http://www.clustan.com/missing_values.html Missing values]
* [http://www.csc.fi/cschelp/sovellukset/stat/sas/sasdoc/sashtml/lrcon/z1292604.htm Missing values]
* [http://www.cs.hmc.edu/~fleck/envision/user-manual/missing.html Missing values]
* [http://www.psychwiki.com/wiki/Missing_Values Missing Values] , [http://www.psychwiki.com/wiki/Identifying_Missing_Data Identifying Missing Values] , and [http://www.psychwiki.com/wiki/Dealing_with_Missing_Data Dealing with Missing Values]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • Missing data — In statistics, missing data, or missing values, occur when no data value is stored for the variable in the current observation. Missing data are a common occurrence and can have a significant effect on the conclusions that can be drawn from the… …   Wikipedia

  • Missing completely at random — In statistical analysis, data values in a data set are missing completely at random (MCAR) if the events that lead to any particular data item being missing are independent both of observable variables and of unobservable parameters of interest.… …   Wikipedia

  • Extraneous and missing solutions — In mathematics, an extraneous solution represents a solution, such as that to an equation, that emerges from the process of solving the problem but is not a valid solution to the original problem. A missing solution is a solution that was a valid …   Wikipedia

  • One of Our Aircraft Is Missing — theatrical poster Directed by Michael Powell Emeric Pressburger …   Wikipedia

  • Delimiter-separated values — Formats that use delimiter separated values (also DSV)[1] store two dimensional arrays of data by separating the values in each row with specific delimiter characters. Most database and spreadsheet programs are able to read or save data in a… …   Wikipedia

  • Null (SQL) — The Greek lowercase omega (ω) character is used to represent Null in database theory. Null is a special marker used in Structured Query Language (SQL) to indicate that a data value does not exist in the database. Introduced by the creator of the… …   Wikipedia

  • Robust statistics — provides an alternative approach to classical statistical methods. The motivation is to produce estimators that are not unduly affected by small departures from model assumptions. Contents 1 Introduction 2 Examples of robust and non robust… …   Wikipedia

  • Expectation-maximization algorithm — An expectation maximization (EM) algorithm is used in statistics for finding maximum likelihood estimates of parameters in probabilistic models, where the model depends on unobserved latent variables. EM alternates between performing an… …   Wikipedia

  • Predictive Model Markup Language — The Predictive Model Markup Language (PMML) is an XML based language developed by the Data Mining Group (DMG) which provides a way for applications to define statistical and data mining models and to share models between PMML compliant… …   Wikipedia

  • Data analysis — Analysis of data is a process of inspecting, cleaning, transforming, and modeling data with the goal of highlighting useful information, suggesting conclusions, and supporting decision making. Data analysis has multiple facets and approaches,… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”