Kakoj Programmoj Mozhno Razdetj Devushku Na Foto
Latinskaya cypochka glubokii minet za dengi potomu chto ona ne hochet rabotat v mestnom magazine • published on.
Data transformation is a powerful tool when the data don't look like forming a normal distribution. The idea of data transformation is that you convert your data so that you can assume the normality and use parametric tests. To determine whether we need any data transformation, we need to check the normality of the data.
Although there are several statistical methods for checking the normality, what you should do is to look at a histogram and QQ-plot, and then run a test for checking the normality. You also should read the section for the differences of the two statistical methods explained in this page.
One important point of data transformation is that you must defend that your data transformation is legitimate. You cannot do arbitrary data transformation so that you can get results you want to get. Make sure you clarify why you do data transformation and why it is appropriate. We prepare data by using a random function. To be able to reproduce the results quickly, we set the seed for the random functions.
Set.seed(111) data_normal. Another way to visually investigate whether data forms the normal distribution is to draw a Q-Q plot. A Q-Q plot shows the mapping between the distribution of the data and the ideal distribution (the normal distribution in this case). Let's take a look at it.
Layout(matrix(c(1,2), nrow=1)) qqnorm(data_normal) qqline(data_normal) qqnorm(data_exp) qqline(data_exp) layout(1) If your data are close to the normal distribution, most of the data points should be close to the line. So obviously, some of the data points in data_exp are far from the line, which means that it is less likely that data_exp were taken from the normal distribution. Statistical tests for normality.
One common test for checking the normality is Shapiro-Wilk test. This test works well even for a small sample size, so generally you just need to use this. The null hypothesis of Shapiro-Wilk test is that the samples are taken from a normal distribution. So, if the p value is less than 0.05, you reject the hypothesis, and thinks that the samples are not taken from a normal distribution.
In R, you just need to use shapiro.test() function to do Shapiro-Wilk test. Shapiro.test(data_normal) And you get the result.
Shapiro-Wilk normality test data: data_normal W = 0.9737, p-value = 0.8302 In this case, you can still assume the normality. Let's try the same test with data_exp.
Shapiro.test(data_exp) Shapiro-Wilk normality test data: data_exp W = 0.6804, p-value = 2.269e-05 So, we reject the null hypothesis, and the samples are not considered to be taken from a normal distribution. Thus, you need to do data transformation or use a non-parametric test. Kolmogorov-Smirnov test. Another test you can use for checking the normality is Kolmogorov-Smirnov test.