Measurement, precision, accuracy
2024-09-11
Science is fundamentally about observation
We reason about the processes that could have lead to the observations we saw
Often “observation” means taking measurements and comparing measurements from different
Statistics is one of the main tools scientists use to make sense of observations
We use statistics because our observations are
As scientists or someone using the tools of science we must be aware of the sources of error (uncertainty) and variation in our observations
With the fish heart measurements:
We use randomisation to try to handle 🤷♂️
Because we see many sources of variation in the measurements we observe we replicate
Independent (loosely) means they provide new information
We take measurements from more than one (1) fish heart
To the extent possible we want to minimise the
Eliminate unneccessary sources of variation
Use a protocol and stick to it
Be as careful and consistent
Source: Ismay & Kim (2024) Modern Dive CC-BY-NC-SA
Source: xkcd CC-BY-NC
One of the fundamental estimates we make using statistics is of the typical value — what is the weight of the typical fish heart?
We can’t possibly sample all possible fish of relevance so we will estimate the value of this typical weight
One of the most important estimators is the arithmetic mean — AKA the average
\[ \overline{\text{weight}} = \frac{\sum_{i=1}^n \text{weight}_i}{n} \]
We add up the individual weights for each of the \(n\) fish hearts; \(i\) indicates which fish heart we are adding; \(n\) is the number of fish hearts in out sample
Statisticians refer to accuracy of estimators using the term bias
An accurate estimator is an unbiased estimator
bias is the difference between an estimator’s expected value and the true value of the parameter being estimated
We can measure the variation in our data using the sample variance or standard deviation
\[ s^2 = \widehat{\sigma}^2 = \frac{1}{n-1} \sum_{i = 1}^n \left( \text{weight}_i - \overline{\text{weight}} \right) \]
\[ s = \widehat{\sigma} = \sqrt{\frac{1}{n-1} \sum_{i = 1}^n \left( \text{weight}_i - \overline{\text{weight}} \right)} \]
Both measure the spread of the data around the typical value (the mean)
We can also assess the precision of our estimates
For the mean we can compute its standard error
The standard error of the mean weight of a fish heart is
\[ \widehat{\sigma}_{\overline{\text{weight}}} = \frac{\widehat{\sigma}}{\sqrt{n}} \]
where \(\widehat{\sigma}_{\overline{\text{weight}}}\) is the standard error of the mean fish heart; \(\widehat{\sigma}\) is the standard deviation of our sample of data; \(n\) is the number of indpendent observations
The points covered in these slides are things to be thinking about while