Student research opportunities

Spatio-temporal forecast using non-Gaussian processes

Project Code: CECS_855

This project is available at the following levels:
PhD
Please note that this project is only for higher degree (postgraduate) applicants.

Keywords:

non-Gaussian processes, forecast

Supervisor:

Dr Warren Jin

Outline:

Gaussian processes are well understood and widely used by statistics, machine learning and scientific communities because of their stability and relatively computational and theoretical tractability (see e.g., Cressie and Wikle 2011, Rasmussen and Williams 2006). However, for a wide range of environmental data, such as daily precipitation, monthly rainfall, pollutant concentrations, pollen, and soil moisture, Gaussian spatio-temporal models cannot reasonably be fitted to the observations, not mention the forecast (see e.g., Kokic et al (2014, 2015)).

This project will develop non-Gaussian spatio-temporal models for data that may be zero inflated, skewed, and/or long-tailed. One direction is to transform a Gaussian process in a way that fits observations, with the potential use of some kind of link functions, like those in generalised linear regression. Care must be taken in the spatial prediction and/or temporal projection step as the covariance function in the transformed space is different, actually biased, from the one in the original space. Computation efficiency will become another issue for normally very large environmental data sets. Another direction is to assume that the spatio-temporal environmental data follow specific processes such as t-process or Gamma processes. Challenges here will be around theoretical development of the models, appropriate covariance functions, and efficient computation (such as reduced rank approximation, tapering, Gaussian predictive processes).

Goals of this project

The project will develop sophisticated models based on non-Gaussian processes or asymptotic Gaussian processes, and implement associated software. These developed techniques are applicable to various environmental problems such as daily precipitation projection, extreme weather modelling, remote sensed data, climate change attribution, and so on. It will also impact these important areas by combining sophisticated statistical modelling techniques with modern computation techniques.

Requirements/Prerequisites


  • Applicants are expected to have a major in statistics/mathematics, or computer science.

  • Strong interest in environmental problems

  • Preferably with strong background in statistical machine learning or statistical computation.

  • Preferably with excellent programming skills (R, MatLab or C/C++)

Student Gain

A student working in this project can expect

  • to learn state-of-art of statistical modelling and machine learning techniques

  • to be involved in developing cutting-edge techniques to handle real-world environmental challenges with great impact;

  • Supplementary PhD scholarship available from CSIRO $15000 per year for three years, subject to a separate application to CSIRO


Background Literature


  • Gaussian Processes for Machine Learning. Carl Edward Rasmussen and Christopher K. I. Williams. MIT Press, 2006. ISBN-10 0-262-18253-X.


  • Porcu et al. (eds.), Advances and Challenges in Space-time Modelling of Natural Events. Springer, 2012.

  • Cressie, N., T. Shi, and E. L. Kang (2010), Fixed Rank Filtering for Spatio-Temporal Data, Journal of Computational and Graphical Statistics, 19(3), 724-745, DOI 10.1198/jcgs.2010.09051;

  • Khandoker Shuvo Bakar, Philip Kokic and Huidong Jin (2015). “A spatio-dynamic model for assessing frost risk in south-eastern Australia.” Journal of the Royal Statistical Society: Series C (Applied Statistics). In press

  • P. Kokic, H. Jin, K.S. Bakar, R. Shah and S. Crimp (2014). “Statistical forecasting of monsoon rainfall for several case study sites in India and Sri Lanka.” Technical report

  • P. Kokic et al (2015). "Modelling Rainfall Using a Censored Generalised Hyperbolic Model." Under review.

  • Reinhard Furrer and Stephan R. Sain. Spatial model fitting for large datasets with applications to climate and microarray problems. Statistics and Computing. Volume 19, Number 2 (2009), 113-128, DOI: 10.1007/s11222-008-9075-x

Links

co-supervisor: Dr. Phil Kokic

Contact:



Updated:  4 September 2015 / Responsible Officer:  JavaScript must be enabled to display this email address. / Page Contact:  JavaScript must be enabled to display this email address. / Powered by: Snorkel 1.4