Analysis covariates
This page has been last updated for R13.
Sandbox directory
Analysis covariates are available in the following Sandbox directory:
/finngen/library-red/finngen_R[RELEASE]/analysis_covariates
Data files
The analysis covariate file is a tab-separated, gzip-compressed text file that contains covariate and endpoint data for each sample. The file contains three sets of columns:
column 1: Sample ID
columns 2 to N: covariates including principal components, ~200 columns for R13
columns N+1 to N+1+number of endpoints: individual's phenotype status for each FinnGen endpoint
The covariate file does not contain FinnGen genotypes for individuals with non-Finnish ancestry. For more complete phenotype data see the phenotype files.
Often users will subset this file in R to run their own analyses and/or add additional analysis columns.
Some column descriptions:
Column name
Description
FINNGENID
Sample ID
AGE_AT_DEATH_OR_END_OF_FOLLOWUP
Age of sample at death or end of followup
batch
batch
n_var
Number of genotyped variants
chip
Chip used for genotyping
IS_AFFY
Whether the sample was genotyped using Affymetrix chip
IS_FINNGEN1_CHIP
Whether the sample was genotyped using Finngen v1 chip
IS_FINNGEN2_CHIP
Whether the sample was genotyped using Finngen v2 chip
IS_AFFY_*
Whether the chip genotypes were called using the specified version of the calling algorithm
AGE_AT_DEATH_OR_END_OF_FOLLOWUP2
AGE_AT_DEATH_OR_FOLLOWUP*AGE_AT_DEATH_OR_FOLLOWUP
BATCH*
Whether the sample was part of that genotyping batch. Can be used to control for batch-specific effects in analysis.
PC*
Individual's PCA value for that component
*_IRN
Inverse rank-normalized quantitative endpoints
For other columns, refer to the minimum extended phenotype and the endpoint data pages.
Further information
The covariate file is used for GWAS and other analyses. The following covariates are used in FinnGen's core GWAS analyses:
Age
Sex
First 10 principal components
Genotyping batch (Finngen 1 or 2 chip and legacy genotyping batch)
Note: This file is usually released a little later than the phenotype files as it needs the PCA results to be created.
Last updated
Was this helpful?