# Data Releases 2021

**27 December 2021**

* Finngen R8 service sector detailed data
* These data files contain:
  * Information about service sector, specialty and contact type from the Hilmo inpatient and outpatient register, and primary health care register
  * Information about drug reimbursement costs from the kela drug purchase register.
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/service\_sector\_detailed\_1.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**20 December 2021**

* FinnGen R8 vaccination register data (version 2.0)
  * The data contains vaccination data of 316 551 FinnGen participants.
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/vaccination\_register\_2.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**20 December 2021**

* infectious disease register data (corona, version 3.0)
  * The data contains 7058 (cumulative number) corona virus positive FinnGen participants.
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/infectious\_disease\_register\_corona\_3.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**20 December 2021**

* Lists of significant coding variant associations for all analyzed endpoints in FinnGen R8
* Data in the green library:
  * Data location in Google cloud: gs\://finngen-production-library-green/finngen\_R8/finngen\_R8\_analysis\_data/summary\_stats/coding\_variants/
  * Data location in Sabox: /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_data/summary\_stats/coding\_variants/
* Documentation in the green library:
  * Data location in Google cloud: gs\://finngen-production-library-green/finngen\_R8/finngen\_R8\_analysis\_data/summary\_stats/coding\_variants/README
  * Data location in Sabox: /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_data/summary\_stats/coding\_variants/README

**17 December 2021**

* FinnGen R8 chip data (version 1.0)
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/chipd\_1.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**29 November 2021**

* FinnGen release 8 and UKBB meta-analysis results
* Data in green library:
  * Data location in Google cloud: gs\://finngen-production-library-green/finngen\_R8/finngen\_R8\_analysis\_data/ukbb\_meta/
  * Data location in Sabox: /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_data/ukbb\_meta/
* Documentation in green library:
  * Data location in Google cloud: gs\://finngen-production-library-green/finngen\_R8/finngen\_R8\_analysis\_data/ukbb\_meta/readme
  * Data location in Sabox: /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_data/ukbb\_meta/readme

**26 November 2021**

* SISuv4 LDstore correlation (BCOR) files
* Data in green library:
  * Data location in Google cloud: gs\://finngen-production-library-green/imputation\_panel/v4/LD/
  * Data location in Sabox: /finngen/library-green/imputation\_panel/v4/LD/
* Documentation in green library:
  * Data location in Google cloud: gs\://finngen-production-library-green/imputation\_panel/v4/LD/README.md
  * Data location in Sabox: /finngen/library-green/imputation\_panel/v4/LD/README.md

**24 November 2021**

* FinnGen R8 parental causes of death data (version 1.0)
* A description for this data can be found in [Other registry data files](/finngen-data-specifics/red-library-data-individual-level-data/what-phenotype-files-are-available-in-sandbox-1/other-registers.md).
* Number of unique FINNGENIDs: 262 248
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/parental\_causes\_of\_death\_1.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**23 November 2021**

* FinnGen R8 visual impairment data (version 1.0)
* This data contains information from the Finnish Register of Visual impairment. Data description can be found in [Other registry data files](/finngen-data-specifics/red-library-data-individual-level-data/what-phenotype-files-are-available-in-sandbox-1/other-registers.md).
* Number of unique FINNGENIDs: 2401
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/visual\_impairment\_register\_1.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**15 November 2021**

* R8 core analyses:
* FinnGen R8 GWAS analysis data
  * Updated version of regenie
  * These files contain the summary statistics for R8 core endpoints as well as manhattan plot images
  * Data in green library:
    * /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_data/summary\_stats
* FinnGen R8 Fine-mapping analysis data
  * These files contain the finemapping results for R8 core endpoints, performed using SUSIE and FINEMAP
  * Data in green library:
    * /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_data/finemap
  * Documentation in green library:
    * /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_documentation/
* FinnGen R8 Autoreporting data
  * These files contain the autoreporting summaries created from finemapping & GWAS summary statistic data
  * Data in green library:
    * /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_data/autoreporting
  * Documentation in green library:
    * /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_documentation/
* FinnGen R8 Colocalization data
  * Colocalization between FinnGen SUSIE fine-mapping and multiple other resources
  * Data and documentation in green library:
    * /finngen/library-green/finngen\_R8/finngen\_R8\_analysis\_data/colocalization
* This data is also available for browsing in Pheweb: <https://results.finngen.fi>

**15 November 2021**

* The covariates and endpoints used in R8 core analyses.
* R8\_COV\_PHENO\_V4\_1.txt.gz and R8\_COV\_PHENO\_V4\_1.FID.txt.gz contain the same data with different IDs
* Data location in Sandbox:
  * /finngen/library-red/finngen\_R8/analysis\_covariates/finngen\_R8\_cov\_1.0.txt.gz
  * /finngen/library-red/finngen\_R8/analysis\_covariates/finngen\_R8\_COV\_PHENO\_V4\_1.txt.gz
  * /finngen/library-red/finngen\_R8/analysis\_covariates/finngen\_R8\_COV\_PHENO\_V4\_1.FID.txt.gz

**15 November 2021**

* Finngen R8 cluster plot data
* These data files contain genotype intensities per variant for a subset of samples that are used in cluster plots (released in the green library).
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/cluster\_plot\_1.0/
* See the detailed path of this data from manifest:
  * /finngen/library-red/finngen\_R8/cluster\_plot\_1.0/manifest.txt

**4 November 2021**

* FinnGen R8 PRS data
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/prs\_1.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**25 October 2021**

* FinnGen R8 kidney disease register data
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/kidney\_disease\_register\_1.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**20 October 2021**

* GRM, bgen chunks, and analysis covariates for custom analyses
* Data and documentation locations in Sandbox:
  * library-red/finngen\_R8/grm\_1.0/
  * library-red/finngen\_R8/bgen\_1.0\_20k\_chunks
  * library-red/finngen\_R8/analysis\_covariates/

**13 October 2021**

* FinnGen R8 phenotype data (version 4.0)
* This data has been created with an updated version of Endpointter and new definition files. These changes corrected the controls of some endpoints.
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/phenotype\_4.0
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**7 October 2021**

* tsv file mapping alleles between SISuv3 and SISuv4 panel released to green library
* The file is available in both v3 and v4 panel subfolders (the file is identical in both locations)
* Location in v3 panel subfolders
  * in Google cloud: gs\://finngen-production-library-green/imputation\_panel/v3/annotation/map\_v3\_v4\_alleles\_v2.tsv
  * in Sandbox: /finngen/library-green/imputation\_panel/v3/annotation/map\_v3\_v4\_alleles\_v2.tsv
* Location in v4 panel subfolders
  * in Google cloud: gs\://finngen-production-library-green/imputation\_panel/v4/annotation/map\_v3\_v4\_alleles\_v2.tsv
  * in Sandbox: /finngen/library-green/imputation\_panel/v4/annotation/map\_v3\_v4\_alleles\_v2.tsv

**7 October 2021 (edit: Aug 2024 - meta-analysis results with Estonian biobank are no longer available)**

* The meta-analysis results between FinnGen (release 7), UKBB and Estonia Biobank for 50 phenotypes have been released in the green library:
  * Data location in Google cloud: gs\://finngen-production-library-green/finngen\_R7/finngen\_R7\_analysis\_data/ukbb\_estbb\_meta/
  * Data location in Sandbox: /finngen/library-green/finngen\_R7/finngen\_R7\_analysis\_data/ukbb\_estbb\_meta/
* The results include summary statistics from meta-analysis (including leave-one-out results) and autoreporting results. Please see the readme for more information:
  * Readme location in Google cloud: gs\://finngen-production-library-green/finngen\_R7/finngen\_R7\_analysis\_data/ukbb\_estbb\_meta/README.md
  * Readme location in Sandbox: /finngen/library-green/finngen\_R7/finngen\_R7\_analysis\_data/ukbb\_estbb\_meta/README.md

**6 October 2021**

* Finnish nationwide breast and cervical cancer screening data (cancer\_screening\_1.0) and
* Detailed version of Finnish cancer register data (cancer\_detailed\_1.0)
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/cancer\_screening\_1.0
  * /finngen/library-red/finngen\_R8/cancer\_detailed\_1.0
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**4 October 2021**

* FinnGen R8 imputed STR data (version 1.0)
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/imputed\_str\_1.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**30 September 2021**

* FinnGen R8 infectious disease register corona data (version 2.0)
* Data contains 5372 (cumulative number) corona virus positive FinnGen participants
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/infectious\_disease\_register\_corona\_2.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**29 September 2021**

* FinnGen R8 vaccination register data (version 1.0)
* Vaccination data of 249 066 FinnGen participants
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/vaccination\_register\_1.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**23 September 2021**

* FinnGen R8 reproductive history data (version 1.0)
* Reproductive history of 151 109 FinnGen mother participants
* The data combines information from population register (DVV) (since 1953) and medical birth register (since 1987).
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/birth\_and\_dvv\_register\_1.0/
* Detailed paths to library red data from catalog:
  * /finngen/library-red/finngen\_R8/catalog/catalog.txt

**22 September 2021**

* FinnGen R8 phenotype 3.0 data
* Created using an updated version of Endpointter.
* Years 2020 and 2021 added to Hilmo registers (preliminary information).
* Data and documentation location in Sandbox:
  * /finngen/library-red/finngen\_R8/phenotype\_3.0/

**6 September 2021**

* Finngen R8 genotype plink conversion data (version 1.0) and
* PCA-kinship data (version 1.0)
* Data and documentations location in Sandbox:
  * R8 genotype plink data and documentation: /finngen/library-red/finngen\_R8/genotype\_plink\_1.0/
  * R8 PCA data and documentation:/finngen/library-red/finngen\_R8/pca\_1.0/
  * R8 prune data and documentation:/finngen/library-red/finngen\_R8/prune\_1.0/
  * R8 kinship data and documentation: /finngen/library-red/finngen\_R8/kinship\_1.0

**6 September 2021**

* FinnGen R8 detailed longitudinal phenotype data
* Data location in Sandbox: /finngen/library-red/finngen\_R8/phenotype\_2.0
* New: codes with <5 cases have not been removed from this data

**30 August 2021**

* FinnGen R8 infectious disease register corona (version 1.0) data
* This data contains 4808 (cumulative number) corona virus positive FinnGen participant
* Data location in Sandbox: /finngen/library-red/finngen\_R8/infectious\_disease\_register\_corona\_1.0/

**27 August 2021**

* FinnGen release 8 genotype and phenotype data
* Finngen R8 data was imputed using SISuV4 reference panel that contains 8,554 high coverage \[25x] WGS Finnish individuals.
* The current data statistics are:\
  Number of individuals with genotypes = 356 213

  Number of individuals with endpoints = 356 077\
  Number of imputed variants = 20 175 454\
  Number of endpoints = 4228
* Genotypes Data & Documentation location in Sandbox: /finngen/library-red/finngen\_R8/genotype\_1.0/
* Phenotypes Data & Documentation location in Sandbox: /finngen/library-red/finngen\_R8/phenotype\_2.0/ (note that phenotype\_1.0 was an internal release only)
* See the detailed paths to library red data from catalog: /finngen/library-red/finngen\_R8/catalog/catalog.txt

**29 June 2021**

* FinnGen R6 mosaic chromosomal alteration (version 1.0)
* Data location in Sandbox: /finngen/library-red/finngen\_R6/mca\_1.0/

**22 June 2021**

* FinnGen R7 Colocalization
* Data location in Google cloud: gs\://finngen-production-library-green/finngen\_R7/finngen\_R7\_analysis\_data/colocalization/
* Data location in Sabox: /finngen/library-green/finngen\_R7/finngen\_R7\_analysis\_data/colocalization/

**22 June 2021**

* FinnGen R7 Autoreporting
* Data location in Google cloud: gs\://finngen-production-library-green/finngen\_R7/finngen\_R7\_analysis\_data/autoreporting/
* Data location in Sabox: /finngen/library-green/finngen\_R7/finngen\_R7\_analysis\_data/autoreporting/

**22 June 2021**

* FinnGen R7 finemapping results
* Data location in Google cloud: gs\://finngen-production-library-green/finngen\_R7/finngen\_R7\_analysis\_data/finemap/
* Data location in Sandbox: /finngen/library-green/finngen\_R7/finngen\_R7\_analysis\_data/finemap/

**3 June 2021**

* FinnGen R7 infectious disease register corona (version 4.0)
* This data contains 3496 (cumulative number) corona virus positive FinnGen participants
* Data location in Sandbox: /finngen/library-red/finngen\_R7/infectious\_disease\_register\_corona\_4.0/

**10 May 2021**

* FinnGen R7 infectious disease register corona (version 3.0)
* This data contains 3356 (cumulative number) corona virus positive FinnGen participants.
* Data location in Sandbox: /finngen/library-red/finngen\_R7/infectious\_disease\_register\_corona\_3.0/

**3 May 2021**

* FinnGen R7 kidney disease register data (version 1.0)
* Data location in Sandbox: /finngen/library-red/finngen\_R7/kidney\_disease\_register\_1.0

**30 April 2021**

* FinnGen R7 detailed cancer data (version 1.0)
* Data location in Sandbox: /finngen/library-red/finngen\_R7/cancer\_detailed\_1.0/

**29 April 2021**

* FinnGen R7 parental endpoint data (version 2.0)
* Data locationin Sandbox: /finngen/library-red/finngen\_R7/parental\_endpoint\_2.0

**27 April 2021**

* FinnGen R7 prs data (version 1.0)
* Data location in Sandbox: /finngen/library-red/finngen\_R7/prs\_1.0/

**27 April 2021**

* FinnGen R7 parental endpoint data (version 1.0)
* Data locationin Sandbox: /finngen/library-red/finngen\_R7/parental\_endpoint\_1.0

**26 April 2021**

* FinnGen R7 GRM data (version 1.0)
* Data location in Sandbox: /finngen/library-red/finngen\_R7/grm\_1.0

**20 April 2021**

* FinnGen R7 cov\_pheno and cov data (version 1.0)
* Data location in Sandbox: /finngen/library-red/finngen\_R7/phenotype\_4.0/

**20 April 2021**

* Finngen R7 Chip data
* In addition to the normal release dataset, it also includes high-quality filtered merged data.
* Data location in Sandbox: /finngen/library-red/finngen\_R7/chipd\_1.0/

**9 April 2021**

* FinnGen R7 corona data (version 2.0)
* This data contains 2983 (cumulative number) corona virus positive FinnGen participants.
* Data location in Sandbox: /finngen/library-red/finngen\_R7/infectious\_disease\_register\_corona\_2.0/

**30 March 2021**

* FinnGen R7 vaccination register data (version 1.0)
* The data contains vaccination data of 201 707 FinnGen participants
* Data location in Sandbox: /finngen/library-red/finngen\_R7/vaccination\_register\_1.0/

**21 March 2021**

* FinnGen R7 phenotype data (version 4.0) to Sandbox
* Number of endpoints: 4137 Number of FinnGen IDs: 321 302. Updated control definition file and made & AND rule and NEVT bug fixes to Endpointter
* Data location in Sandbox: /finngen/library-red/finngen\_R7/phenotype\_4.0

**12 March 2021**

* FinnGen R7 reproductive history data (version 1.0)
* The data contains reproductive history of 137 713 FinnGen mother participants. The data combines information from population register (DVV) (since 1953) and Medical birth register (since 1987).
* Data location in Sandbox: /finngen/library-red/finngen\_R7/birth\_and\_dvv\_register\_1.0

**12 March 2021**

* FinnGen R7 phenotype data (version 3.0)
* We have used updated endpoint and control definition files and made a few bug fixes to Endpointter. We have also filtered out some negative ages that were present in the previous first event file. Number of endpoints: 4137 Number of FinnGen IDs: 321 302
* Data location in Sandbox: /finngen/library-red/finngen\_R7/phenotype\_3.0

**9 March 2021**

* FinnGen infectious disease register data (corona, version 1.0) released to Sandbox.
* This data consists of 2560 (cumulative number) corona virus positive FinnGen participants.
* Data location in Sandbox: /finngen/library-red/finngen\_R7/infectious\_disease\_register\_corona\_1.0/

**5 March 2021**

* FinnGen visual impairment register data released to Sandbox.
* Data location in Sandbox: /finngen/library-red/finngen\_R7/visual\_impairment\_register\_1.0

**2 March 2021**

* FinnGen parental causes of death data released to Sandbox
* Data location in Sandbox: /finngen/library-red/finngen\_R7/parental\_causes\_of\_death\_1.0/

**2 March 2021**

* Finngen R7 bgen data
* Data location in Sandbox: /finngen/library-red/finngen\_R7/bgen\_2.0/

**26 February 2021**

* Finnish nationwide breast and cervical cancer screening
* Data location in Sandbox: /finngen/library-red/finngen\_R7/cancer\_screening\_1.0

**24 February 2021**

* Finngen R7 bgen data
* Data location in Sandbox: /finngen/library-red/finngen\_R7/bgen\_1.0/

**23 February 2021**

* Plink converted DF7 genotypes
* Data location in Sandbox: /finngen/library-red/finngen\_R7/genotype\_plink\_2.0

**19 February 2021**

* DF7 phenotypes
* Statistics for genotype\_2.0 and phenotype\_2.0 are: Number of endpoints: 4 145 Number of FinnGen IDs in phenotype data: 321 302 Number of FinnGen IDs in genotype data: 321 464
* Data location in Sandbox: /finngen/library-red/finngen\_R7/phenotype\_2.0

**5 February 2021**

* FinnGen R6 corona data (version 7.0)
* This data contains 1515 (cumulative number) corona virus positive FinnGen participants.
* Data location in Sandbox: /finngen/library-red/finngen\_R6/corona\_7.0/

**12 January 2021**

* FinnGen R6 corona data (version 6.0)
* Data location in Sandbox: /finngen/library-red/finngen\_R6/corona\_6.0/


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.finngen.fi/release-notes/data-releases-2021.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
