Modern Statistical Methods for Big Data and Big Questions in Genomics


Sunday January 10th, 2PM-5:30PM

  Course Description
Current research ambitions in genomics are growing, alongside the size of genomic datasets and the quantitative scientists’ big-data analysis capabilities. More and more big projects are being launched, aiming to answer various genomic and biomedical questions that could be as challenging as "how the human genome is transforming medicine". To ensure the success of such projects, the feasibility of the project goals must be evaluated given the resources available, including manpower, data, subject-matter knowledge, and statistical and computational methods.

In this short course, we will concentrate on introducing essential high-dimensional statistical tools, as well as data handling and critical thinking skills, in carrying out a large-scale genomic project. Two specific topics will be covered: 1) RNA-Seq data analysis - from normalization to higher-level analysis such as differential expression and cluster analysis and 2) gene networks - from network edge inference to locating gene functional sub-communities or modules in the network. In introducing these topics, we will additionally discuss steps for bringing qualitative biological and medical knowledge into the formulation of an appropriate statistical inference question, upon which all computation and statistical methodology work will be built.


  Instructors
  • Sandrine Dudoit, Professor of Biostatistics and Statistics, University of California, Berkeley
  • Haiyan Huang, Associate Professor of Statistics, University of California, Berkeley
  • Elizabeth Purdom, Assistant Professor of Statistics, University of California, Berkeley


  Who should attend
To be announced


  Short Course Agenda
To be announced


  Registration
Please refer to the Registration section for pricing and registration link.