Skip to main navigation Skip to search Skip to main content

Improving the analysis of designed studies by combining statistical modelling with study design information

  • Uwe Thissen*
  • , Suzan Wopereis
  • , Sjoerd A.A. van den Berg
  • , Ivana Bobeldijk
  • , Robert Kleemann
  • , Teake Kooistra
  • , Ko Willems van Dijk
  • , Ben van Ommen
  • , Age K. Smilde
  • *Corresponding author for this work
  • TI Food and Nutrition
  • Unit Healthy Living
  • Leiden University Medical Centre
  • University of Amsterdam
  • Dutch nutrigenomics consortium of the Top Institute Food and Nutrition (TIFN)

Research output: Contribution to journalArticleAcademicpeer-review

30 Citations (Scopus)
7 Downloads (Pure)

Abstract

Background: In the fields of life sciences, so-called designed studies are used for studying complex biological systems. The data derived from these studies comply with a study design aimed at generating relevant information while diminishing unwanted variation (noise). Knowledge about the study design can be used to decompose the total data into data blocks that are associated with specific effects. Subsequent statistical analysis can be improved by this decomposition if these are applied on selected combinations of effects. Results: The benefit of this approach was demonstrated with an analysis that combines multivariate PLS (Partial Least Squares) regression with data decomposition from ANOVA (Analysis of Variance): ANOVA-PLS. As a case, a nutritional intervention study is used on Apoliprotein E3-Leiden (APOE3Leiden) transgenic mice to study the relation between liver lipidomics and a plasma inflammation marker, Serum Amyloid A. The ANOVA-PLS performance was compared to PLS regression on the non-decomposed data with respect to the quality of the modelled relation, model reliability, and interpretability. Conclusion: It was shown that ANOVA-PLS leads to a better statistical model that is more reliable and better interpretable compared to standard PLS analysis. From a following biological interpretation, more relevant metabolites were derived from the model. The concept of combining data composition with a subsequent statistical analysis, as in ANOVA-PLS, is however not limited to PLS regression in metabolomics but can be applied for many statistical methods and many different types of data.

Original languageEnglish
Article number52
JournalBMC Bioinformatics
Volume10
DOIs
Publication statusPublished - 7 Feb 2009
Externally publishedYes

Fingerprint

Dive into the research topics of 'Improving the analysis of designed studies by combining statistical modelling with study design information'. Together they form a unique fingerprint.

Cite this