Students must complete four units as follows:
Choose one unit from:
This unit will be replaced by 300700 - Statistical Decision Making from 2010. This Level 1 unit introduces the basic concepts and techniques of statistics that are particularly relevant to problem solving in science and technology. It also provides a sound base for more advanced study in statistics in subsequent sessions. Topics include: presentation of data; descriptive statistics; the role of uncertainty in decision making; hypothesis testing; and simple linear regression.
This unit introduces students to various statistical techniques necessary in scientific endeavours. Presentation of the content will emphasize the correct principles and procedures for collecting and analysing scientific data, using a ‘hands-on’ approach. Topics include effective methods of gathering data, statistical principles of designing experiments, error analysis, describing different sets of data, probability distributions, statistical inference, non-parametric methods, and simple linear regression and correlation.
This level 100 unit introduces the basic concepts and techniques of statistics that are particularly relevant to problem solving in business. It also provides a sound base for more advanced study in statistics and forecasting in subsequent sessions. Topics include: presentation of data; descriptive statistics; the role of uncertainty in business decision making; hypothesis testing; and basic forecasting.
This Level 1 unit introduces students to various statistical techniques supporting the study of computing. Presentation of the content will emphasize the correct principles and procedures for collecting and analysing scientific data, using information and communication technologies. Topics include counting techniques, describing different sets of data, probability distributions, statistical inference, and simple linear regression and correlation.
And choose at least one of:
The unit builds on the basic statistical concepts introduced in first year, and also prepares students for broader application of statistics for those majoring in science or business. Topics include some common probability distributions; revision of hypothesis testing; analysis of categorical data; analysis of variance; simple and multiple linear regression analysis and correlation; some nonparametric methods; and fundamentals of time-series analysis.
Foundations of Statistical Modelling and Decision Making
This level 200 unit completes an introduction to the basic principles and concepts of statistics. There are two strands to the subject: distribution theory and statistical inference. The aim of the unit is to present a solid foundation in statistical theory and to provide an understanding of the relevance and importance of the theory in solving practical problems in the real world. The theoretical basis of the dual arms of classical statistical inference (estimation and hypothesis testing) is discussed relating the probabilistic half of the course to the final objective - inference.
Database Design and Development
The main purpose of this unit is to provide students with an opportunity to gain a basic knowledge of database design and development including data modeling methods and techniques and database implementation using a database management system
And choose at least one of:
Regression Analysis & Experimental Design
This unit covers linear regression analysis and experimental design, with analysis of variance being the primary analytical tool. Topics in linear regression are: the statistical model, the method of least squares, sampling distributions of least squares estimators, statistical inferences and testing hypotheses, methods for model building, detecting violations of the regression assumption and remedies, logistic regression, and Poisson regression. Topics in designed experiments are: completely randomised experiment, factorial experiment, randomised block, Latin square, random model, and mixed model. For each design the following aspects are covered: the statistical model, the normal equations and their solutions, sums of squares and basic algebraic identity, the ANOVA table and relevant tests, and treatment comparisons.
This Level 3 unit presents the basic techniques of time series analysis with emphasis on model identification, parameter estimation and diagnostic checking. The use of time series models for the process of forecasting future behaviour is discussed. In addition, alternative forecasting approaches, in particular econometic methods, are introduced and some guidelines for choosing an appropriate forecasting method are outlined.
This unit presents data mining as a well structured standard process, namely, the Cross Industry Standard Process for Data Mining (CISP-DM). Further, this unit emphasizes (1) the presentation of data mining as a process, (2) the “White box” approach, emphasizing an understanding of the underlying algorithmic structures, (3) the graphical approach, emphasizing exploratory data analysis, and (4) the logical presentation, flowing naturally from the CRISP-DM standard process and the set of data mining tasks. This unit gives the insight of the data mining algorithms, by using small data sets and then provides examples of the application of the various algorithms on actual large data sets. Finally it provides the hands-on analysis problems, representing an opportunity to apply acquired data mining expertise to solving real problems using large data sets.
Surveys and Multivariate Analysis
In the first half of this unit students gain an appreciation of survey methodology, including questionnaire design, as well the application of sampling techniques. These include simple random sampling, stratification, supplementary information and cluster sampling. The second half of the unit covers the principal methods of multivariate data analysis, principal components, factor analysis, discriminant analysis, and cluster analysis.