# Kanta lab association results v1

The first set of Kanta lab value association results, including GWAS, finemap, autoreporting, and ldsc (heritability and genetic correlations) results for all unique lab measurements (OMOPID) with at least 1,000 individuals with at least one measurement (n = 383 lab measurements) can be found from green library: `/finngen/library-green/lab_values/gwas_release_2024_12_1/`

These results are in pheweb browser: <https://kanta.finngen.fi>

### QC and data preparation

(see `data_documentation/Kanta_labs_over1000_individuals_summary_table.txt` for further details on each lab measurement)

* Clear outliers were removed, if any (see Kanta\_labs\_over1000\_individuals\_summary\_table.txt for detailed outlier filters)
* The distributions of the measurements were looked individually, and analyzed as raw values as possible, with the following numbers:
  * 70 were analyzed as raw median values,
  * 120 were logarithm transformed,
  * 157 were normalized (inverse-rank), and
  * 36 were analyzed as binary (ever having measurement above a certain threshold or test outcome as ‘abnormal’ or ‘high’ vs never having measurement above a certain threshold or test outcome as ‘abnormal’ or ‘high’)
* Sample-wise median was calculated for all lab measurements that were analyzed as quantitative

### GWAS

* GWAS was performed for all 383 lab measurements using [REGENIE pipeline](/working-in-the-sandbox/running-analyses-in-sandbox/how-to-run-genome-wide-association-studies-gwas/how-to-run-gwas-using-regenie.md)
* Same covariates (age at end of follow-up or death, sex, genotyping batch and PC1-10) were used as in core endpoint GWASes
* Linear mixed model was used for lab measurements that were analyzed as quantitative, and logistic mixed model was used for binary phenotypes
* Summary statistics were filtered for variants with MAC >100 (for quantitative only) and imputation info score > 0.6

### FINEMAP

* All 383 summary statistics were finemapped using [finemap pipeline](/working-in-the-sandbox/running-analyses-in-sandbox/how-to-run-finemapping-pipeline.md)
  * GWS (p < 5E-8) regions were finemapped
  * MHC region was excluded

### Autoreporting

* For all finemapped (GWS regions, MHC region excluded) regions, autoreporting was performed using pipeline described here: <https://github.com/FINNGEN/autoreporting>

### LDSC (heritability estimates and genetic correlations)

* For all 383 lab measurements
  * Heritability estimates and
  * Genetic correlations with all R12 endpoints (2,469) were calculated using [LDSC pipeline](#ldsc-heritability-estimates-and-genetic-correlations)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.finngen.fi/finngen-data-specifics/green-library-data-aggregate-data/core-analysis-results-files/kanta-lab-association-results-v1.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
