Data analysis and modeling of biological systems

lboe2112  2024-2025  Louvain-la-Neuve

Data analysis and modeling of biological systems
5.00 credits
24.0 h + 36.0 h
Q1
Language
English
Prerequisites
In order to successfully follow this course, you should be acquainted with the concept of Probability and the
rules of Probability calculus, the bases of statistical inference, the principles and practice of the classical methods for statistical analysis of continuous data (Regression, Analysis of Variance) and of discrete data (Contingency tables, Goodness of fit tests), and the use of a statistical software for applying the above.
Main themes
Taking into account the most frequently encountered needs of researchers in Biology, as well as the time
constraints, the course offers of two main modules : Linear Modeling, and Methods of Multivariate Analysis.
The examples presented are mainly drawn from researches in Ecology.
Learning outcomes

At the end of this learning unit, the student is able to :

1 The objectives are that, as a result of successfully attending this course, the students :

- Are aware of the necessity of planning any scientific experiment before it is started.
- Have practiced, in the frame of a personal scientific question, the principles of experimental design.
- Are able to review, choose, and apply knowingly the best adapted methods for modeling and analysing
data from their domain of expertise in Biology.
- Are able to set up a scientific experiment, to manage the data generated by this experiment, to analyse
them (usually with the help of a computer software), and to interprete critically the results.
- Have shown their ability to report a scientific experiment in a written document and through an oral
communication. These reports may be elaborated in groups of two or three students.
 
Content
Module 1 : Linear statistical modeling
Theoretical introduction into mixed and generalized models (6h); Practical sessions on R (14h);  Two case studies on mixed and generalized models (4h+2h).

Module 2 : Multivariate data exploration
This module details how to visualize and check multivariate data, how to summarize and combine a set of continuous variables into a lower number of variables through PCA (Principal Component Analysis), how to perform the PCA equivalent for categorical data(FCA, Factorial Correspondence Analysis), and how to unravel the links between two sets of continuous variables (Canonical Correlation Analysis). The teaching philosophy insists on the fact that statistics are tools and that the key skills the student should acquire is the expertise to choose the right tool for the job, how to parameterize it and interpret its results critically. Real examples from ecology will be used to illustrate clean but also more difficult cases, closer to real life.
Teaching methods
Lectures, seminars, and exercise sessions in a computer room. The student is encouraged to interactivity for all these activities.
Exercises : learn to solve a statistical problem. Find the appropriate analysis when faced with a problem, check the application conditions relating to the use of this analysis, perform the statistical test on the R software, interpret the results obtained and illustrate them.
Evaluation methods
The two modules will be evaluated separately, each module contributing 10/20 to the final score. As the final score must be an integer number, the sum of the two notes will be rounded up if both modules are passed (at least 5/10) and down if it is not the case.
Module 1 (Linear statistical modelling) :
Open book exam, including two exercises on LMM and GLM(M) on R (based on practical sessions and first seminar)  and one case study (based on second seminar).
Module 2 (Multivariate data analysis) :
Open book written exam consisting of multiple choice questions, open questions and practical solution of exercises with R software on a computer. The exam is carried out on Moodle, in a computer room on campus, unless health regulations require that the exam be taken at a distance.
Other information
A basic knowledge of the R software is required: the student is expected to be able to create and modify R-data sets independently and perform basic data management and statistical analysis procedures. If such knowledge is not acquired, the student must be trained autonomously in these skills, e.g. by means of the many resources available online for free.
Online resources
All resources are available on the Moodle website: visuals of the lectures and practical sessions, data sets and R scripts, links to additional resources and supporting books.
 
Teaching materials
  • Les visuels du cours et de support aux travaux pratiques / Course visuals and materials supporting the practical work
Faculty or entity


Programmes / formations proposant cette unité d'enseignement (UE)

Title of the programme
Sigle
Credits
Prerequisites
Learning outcomes
Master [120] in Biology of Organisms and Ecology

Master [60] in Biology