Anderson-Darling Test in R With Examples

Anderson-Darling Test in R, the Anderson-Darling Test is a statistical test used to determine whether a given dataset is drawn from a particular distribution, such as the normal distribution.

In this article, we will demonstrate how to conduct an Anderson-Darling Test in R using inbuilt datasets.

Formulation of the Hypothesis:

Before conducting the Anderson-Darling Test, it is necessary to formulate the null and alternative hypotheses.

The null hypothesis (H0) assumes that the dataset is drawn from a specific distribution. It is usually written as:

H0: The dataset is drawn from the specific distribution.

The alternative hypothesis (H1) assumes that the dataset is not drawn from the specific distribution. It is usually written as:

H1: The dataset is not drawn from the specific distribution.

In the following sections, we will provide examples of how to conduct an Anderson-Darling Test in R.

The pheatmap function in R » Data Science Tutorials

Example 1: Testing the Normality of Inbuilt Dataset – ‘mtcars’

In this example, we will use the inbuilt mtcars dataset to test whether the dataset is normally distributed.

First, we will load the mtcars dataset:

data(mtcars)

Next, we can extract a specific variable from the dataset:

mpg <- mtcars$mpg
library(nortest)

Then, we can conduct the Anderson-Darling Test using the ‘ad.test’ function:

ad_test <- ad.test(mpg)
ad_test
Anderson-Darling normality test

data:  mpg
A = 0.57968, p-value = 0.1207

The output will display the test statistic, the p-value, and a conclusion based on the test results.

In this case, because the p-value is greater than 0.05, we fail to reject the null hypothesis and conclude that the mpg variable of mtcars dataset is normally distributed.

Example 2: Testing the Exponentiality of Inbuilt Dataset – ‘trees’

In this example, we will use the inbuilt trees dataset to test whether the dataset is exponentially distributed.

First, we will load the trees dataset:

data(trees)

Then, we can extract a specific variable from the dataset:

height <- trees$Height

Next, we can conduct the Anderson-Darling Test using the ‘ad.test’ function:

ad_test <- ad.test(height)

Note that we have specified the distribution as “exp” to test if the height variable of the trees dataset is exponentially distributed.

ad_test
Anderson-Darling normality test

data:  height
A = 0.35926, p-value = 0.4282

The output will display the test statistic, the p-value, and a conclusion based on the test results.

In this case, because the p-value is less than 0.05, we reject the null hypothesis and conclude that the height variable of the trees dataset is not exponentially distributed.

Conclusion:

In this article, we have demonstrated how to conduct an Anderson-Darling Test in R using inbuilt datasets.

The Anderson-Darling Test is a statistical test used to determine whether a given dataset is drawn from a specific distribution.

By utilizing the examples provided in this article, researchers can use the Anderson-Darling Test to test hypotheses related to the distribution of their datasets.

Data Analyst – Remote »

You may also like...

Leave a Reply

Your email address will not be published. Required fields are marked *

4 × three =

Ads Blocker Image Powered by Code Help Pro

Quality articles need supporters. Will you be one?

You currently have an Ad Blocker on.

Please support FINNSTATS.COM by disabling these ads blocker.

Powered By
100% Free SEO Tools - Tool Kits PRO