This section shows how you can use the Chi-squared test to compare two parts of the dunes. Information about other statistical tests can be found here.

## Simpson’s Diversity Index

Simpson’s Diversity Index is a measure both of species richness (i.e. the number of different species present) and species evenness (i.e. how evenly distributed each species is).

- \(D\) = Simpson’s Diversity Index
- \(n\) = the number of individuals of each species
- \(N\) = the total number of individuals

### Worked example

A biologist is comparing species diversity at two sites within Traeth-y-goes sand dunes. Raw data from point quadrats is used. The number means the total hits per species per site.

Species | Mobile dune | Fixed dune |
---|---|---|

Marram grass | 10 | 4 |

Sea holly | 3 | 0 |

Sand fescue | 1 | 11 |

Saltwort | 2 | 0 |

Dandelion | 0 | 8 |

Calculate \(n\), \(n\times(n-1)\), \(N\) and \(D\) for each site

Mobile dune | Mobile dune | Fixed dune | Fixed dune | |
---|---|---|---|---|

Species | \(n\) | \(n\times(n-1)\) | \(n\) | \(n\times(n-1)\) |

Marram grass | 10 | 90 | 4 | 12 |

Sea holly | 3 | 6 | 0 | 0 |

Sand fescue | 1 | 2 | 11 | 110 |

Saltwort | 2 | 2 | 0 | 0 |

Dandelion | 0 | 0 | 8 | 56 |

TOTAL | 17 | 100 | 23 | 178 |

D = 17(16) / 100 | D = 23(22) / 178 | |||

D = 2.72 | D = 2.84 |

The larger the value of D, the higher the species diversity. A low value of D could be due to low overall species richness (like at the strand line) or to the dominance of one species (as in dune scrub). You could present your results as a graph

## Mann Whitney U test

Mann Whitney U is a statistical test that is used either to test whether there is a significant difference between the medians of two sets of data.

The Mann Whitney U test can only be used if there are at least 6 pairs of data. It does not require a normal distribution.

There are 3 steps to take when using the Mann Whitney U test

### Step 1. State the null hypothesis

There is no significant difference between _______ and _______

### Step 2. Calculate the Mann Whitney U statistic

\(U_1= n_1 \times n_2 + 0.5 n_2 (n_2 + 1)\;- ∑ R_2\) \(U_2 = n_1 \times n_2 + 0.5 n_1 (n_1 + 1)\;- ∑ R_1\)- \(n_1\) is the number of values of \(x_1\)
- \(n_2\) is the number of values of \(x_2\)
- \(R_1\) is the ranks given to \(x_1\)
- \(R_2\) is the ranks given to \(x_2\)

### Step 3. Test the significance of the result

Compare the value of U against the critical value for U at a confidence level of 95% / significance value of P = 0.05.

If U is equal to or smaller than the critical value (p=0.05) the REJECT the null hypothesis. There is a SIGNIFICANT difference between the 2 data sets.

If U is greater than the critical value, then ACCEPT the null hypothesis. There is NOT a significant difference between the 2 data sets.

### Worked example

A biologist is investigating whether managed areas of the sand dunes have greater species richness than unmanaged areas. A frame quadrat was used to record number of species at 10 randomly chosen points in each of two sites: one in a grazed section of the fixed dunes and one in an ungrazed section of the fixed dunes.

Here are the results.

Number of times out of 10 the pin hits vegetation | |
---|---|

Site 1 (Grazed fixed dunes) | Site 2 (Ungrazed fixed dunes) |

2 | 2 |

3 | 3 |

0 | 6 |

1 | 4 |

2 | 5 |

1 | 3 |

2 | 3 |

2 | 2 |

1 | 4 |

0 | 3 |

### Step 1. State the null hypothesis

There is no significant difference in species richness between the grazed and the ungrazed parts of the fixed dunes.

### Step 2. Calculate Mann Whitney U statistic

(a) Give each result a rank. Calculate the sum of the ranks for the two columns.

Number of times out of 10 the pin hits vegetation | |||
---|---|---|---|

Site 1 (Grazed fixed dunes) | Site 1 (Grazed fixed dunes) | Site 2 (Ungrazed fixed dunes) | Site 2 (Ungrazed fixed dunes) |

Number | Rank | Number | Rank |

2 | 8.5 | 2 | 8.5 |

3 | 14 | 3 | 14 |

0 | 1.5 | 6 | 20 |

1 | 4 | 4 | 17.5 |

2 | 8.5 | 5 | 19 |

1 | 4 | 3 | 14 |

2 | 8.5 | 3 | 14 |

2 | 8.5 | 2 | 8.5 |

1 | 4 | 4 | 17.5 |

0 | 1.5 | 3 | 14 |

(b) Calculate \(∑R_1\) and \(∑R_2\)

\(∑R_1\) is the sum of the ranks in the first column (deciduous woodland) = 63

\(∑R_2\) is the sum of the ranks in the first column (evergreen woodland) = 147

\(n_1\) = 10 and \(n_2 = 10\)

(c) Calculate \(U_1\) and \(U_2\)

\(U_1= 10 \times 10 + 0.5 \times 10 \times (10 + 1)\;- 63 = 92\) \(U_2= 10 \times 10 + 0.5 \times 10 \times (10 + 1)\;- 147 = 8\)### Step 3. Test the significance of the result

In this example, \(U_1= 92\) and \(U_2 = 8\)

U is the smaller of the two values, so U=8

The critical value at p=0.05 significance level for \(n_1\)=10 and \(n_2=10\) is 23. Since our calculated value of 8 < 23, the null hypothesis can be rejected.

In conclusion, there is a significant difference in species richness between the grazed and the ungrazed parts of the fixed dunes.