open_content.gifUntitled-1.pngTitle.png

 

 

 

This site gives a number of tools for teaching introductory statistics at university level. These tools support a particular philosophy on teaching statistics which has been built up over some sixty combined years of statistics teaching experience! Over the past five years this underlying philosophy in tandem with the associated spreadsheet modules has been introduced at UCT in one of the largest first year courses, STA1000F/S, involving some 2000 students per year.

 

Our observations over the years have led us to conclude that teaching statistics via theory followed by apparent application (in the form of a paper exercise involving manipulation of theory) can give misleadingly good students grades. In fact, students are often seen to achieve remarkably good grades at relatively advanced levels of statistics but, as some gentle probing at postgraduate levels has revealed, in many instances fail to understand the foundational concepts of their discipline! In addition, our current teaching methods need to take cognisance of the fact that, for a variety of reasons, a large portion of our students are mathematically poorly prepared for first year university; and many students have to cope with learning in a language other than their mother tongue.

 

Adobe SystemsThe Modules

 

Note that these are written for MS Excel 2007 (or later versions). The modules roughly follow chapters in the first year statistics textbook, Introstat (LG Underhill) and essentially support and supplement that book. They are to a significant extent self explanatory for those with some knowledge of statistics and simulation. In cases where the modules require additional explanation, the spreadsheets themselves have accompanying notes. These modules are essentially crafted as teaching tools and the experience of first year students would be of the lecturer leading the students through the simulations at an appropriate pace, allowing plenty of opportunity for discussion and clarification. Lab based tutorials also support this process.

 

Adobe SystemsDescription of Available Modules

 

1.    Module 1: We discuss the question: - What are random numbers and what is a statistical distribution? We introduce the Uniform distribution, the most simple of statistical distributions.

 

2.    Module 2: In order to test a claim that a set of 5 mice have been taught how to navigate a maze, we explore the chances of different numbers of successful mice, under the assumption that the mice are making purely random choices. This supports a discussion of how the Binomial distribution arises.

 

3.    Module 3: We sketch the following scenario: a stretch of road is surveyed to determine the number of potholes. Unfortunately information on the individual positions of the potholes is lost but the total number of potholes is correctly recorded. We manage to salvage the situation from embarrassment by employing the Poisson distribution to good effect!

 

4.    Module 4: The same situation pertains as in module 3; however we focus our efforts on the chances of finding stretches of road without potholes, and discover the exponential distribution.

 

5.    Module 5: We explore the magical effects of averaging and find a surprising commonality across the distributions of averages arising from a multitude of different situations (give or take a few assumptions they all seem to converge to that bell shaped curve?).

 

6.    Module 6: We consider hypothesis testing and attempt to pin down the chances that we’re wrong when we think we’re right…or is it right when we think we’re wrong? Oh yes, we also look at statistical power…do we have enough information to attempt to adjudicate between these two hypotheses anyway?

 

7.    Module 7: We find a relationship between two variables and express this as a mathematical straight line formula. But the actual line we get depends on the sample we have. We explore how certain we can be that we know anything about the relationship between our two variables at all.

 

Adobe SystemsMore about our teaching:

 

Our approach has been to, as far as possible, make the teaching of statistics visual and experiential. Keeping it visual is of particular relevance where there are language challenges and the experiential approach aims to address the undesirable ‘black box’ of theory effect. A key part of this approach is to provide students with tools whereby they can interrogate theory using a ‘what if’ approach.

 

The basis of our teaching philosophy is to shift from a pen-and-paper/chalk-and-board paradigm to a “virtual statistics laboratory”, in the form of a spreadsheet platform.

Teaching mathematical and statistical principles through a spreadsheet platform offers significant advantages, particularly for those with weak grounding in basic algebra. The structuring of a spreadsheet develops a general algebraic way of thinking as the process requires skill in expressing numerical relationships using algebraic notation. However, the richest feature that a spreadsheet offers to the teacher of statistics is its ability to show how one can mimic the process of repeated statistical experiments. By simulating statistical sampling one can elucidate a range of subtle and often misunderstood ideas which are central to basic statistical knowledge, such as those of randomness and statistical distributions.

 

We argue for a two-stage approach in which statistical understanding is built by initially empowering students to use simple spreadsheet operations in MS Excel, followed up by the use of more sophisticated statistical simulation tools in a Visual Basic for Applications (VBA) programming environment. Teaching sessions are built around a set of structured spreadsheet exercises as well as a suite of VBA-based simulations which demonstrate the concept of random variation, as well as show how statistical tools can be used to explore the concept of uncertainty.

 

At this stage we will list and discuss the currently developed suite of VBA simulation programmes used at first year level and offered on this site. Note that these are written for MS Excel 2007 (or later versions). The modules roughly follow chapters in the first year statistics textbook, Introstat (LG Underhill) and essentially support and supplement that book. They are to a significant extent self explanatory for those with some knowledge of statistics and simulation. In cases where the modules require additional explanation, the spreadsheets themselves have accompanying notes. These modules are essentially crafted as teaching tools and the experience of first year students would be of the lecturer leading the students through the simulations at an appropriate pace, allowing plenty of opportunity for discussion and clarification. Lab based tutorials also support this process.