# Whole exome sequencing (WES) data

## Datasets

#### **THL COHORTS:**

This data release contains **25,201 samples** in total from the FINRISK (n=12,203), Health2000 (n=4,618), and SUPER (n=8,380) collections. The sample is highly enriched for psychosis patients (the entire SUPER cohort), and subsets of FINRISK were selected as part of Alzheimer's and IBD sequencing projects

The FINRISK cohorts contain the following sub-projects:

* **FINRISK AD** (N=324) sequenced at Broad
* **FINRISK IBD** (N=945) sequenced at Broad
* **FINRISK controls** for SUPER (FR02, N=849) sequenced at Broad
* **FINRISK FinMetSeq** (N=10,085) sequenced at WashU

These data were extracted from the gnomAD v4 exome callset generated at the Broad Institute and have not undergone additional QC after gnomAD calling.

#### Data location and readme file

* Data in Sandbox: `/finngen/library-red/wes_gnomad_v4_min_QC`
* Readme file in Sandox: `/finngen/library-red/wes_gnomad_v4_min_QC/readme.txt`

####

#### **ADPKD KIDNEY TASK FORCE:**

This data is whole-exome sequencing on ADPKD patients selected by the kidney disease task force (n=629). The data has undergone full QC, with specific details found in the readme file (v1\_release\_readme\_18.03.2026.md) and accompanying QC report. The data has also been annotated with external data, including gnomAD v2 allele frequencies and predicted damaging scores, and in-silico potential damaging variants have been predicted.

**Important note:** Users of this dataset are kindly requested to provide appropriate acknowledgment of the Principal Investigator, Daniel Gordin, who led and financially supported its generation, as well as all relevant funding sources. For manuscripts or other outputs that make substantive use of the dataset, inclusion of the PI as a co-author is expected, in line with established scholarly norms and authorship guidelines. Please contact Daniel in such cases at **Daniel.Gordin (at) hus.fi.**

#### Data location and readme file

* Data in Sandbox: `/finngen/red/task_force_data/Kidney/ADPKD_WES/`
* Readme file in Sandox: `/finngen/red/task_force_data/Kidney/ADPKD_WES/ADPKD_WES/release_v1_18.03.2026.tar.gz/v1_release_readme_18.03.2026.md`


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.finngen.fi/finngen-data-specifics/red-library-data-individual-level-data/whole-exome-sequencing-wes-data.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
