LogoLogo
FinnGen Handbook
  • Introduction
  • Where to begin
    • Quick guides
      • New to FinnGen
      • Green data users
      • Red data users
    • I'm new to FinnGen, where is the best place for me to start?
    • What kind of questions can I ask of FinnGen data?
    • How do I make a custom endpoint?
    • How do I run a GWAS of a phenotype I created myself?
    • I'm interested in FinnGen rare variant phenotypes
  • Background Concepts
    • Basics of Genetics
    • Linkage Disequilibrium (LD)
    • Genotype Imputation
    • Genotype Data Processing and Quality Control (QC)
    • GWAS Analysis
    • P Values
    • Heritability and genetic correlations
    • Finemapping
    • Conditional analysis
    • Colocalization
    • Using Polygenic Risk Scores
    • PheWAS analysis
    • Survival analysis
    • Longitudinal Data Analysis
    • GWAS Association to Biological Function
    • Genetic Data Resources outside FinnGen
    • Getting Started with Unix
    • Getting Started with R
    • Structure of the FinnGen project
    • Finnish gene pool and health register data
  • FinnGen Data Specifics
    • FinnGen Data Freezes and Releases
    • Analysis proposals
      • What is a FinnGen analysis proposal and when do I need to submit one?
      • How do I submit an analysis proposal?
      • How are analysis proposals handled?
      • What is a FinnGen bespoke analysis proposal and when do I need to submit one?
      • How do I submit a bespoke analysis proposal?
      • How are bespoke analysis proposals handled?
      • What is the difference between FinnGen analysis proposals and FinnGen bespoke analyses?
      • Existing analysis proposals
    • Finnish Health Registries and Medical Coding
      • Finnish health registries
      • Register data pre-processing
      • Data Masking/Blurring of Visit Dates
      • International and Finnish Health Code Sets
      • More information on health code sets
      • VNR code mapping to RxNorm
      • Register code translation files
    • Endpoints
      • FinnGen clinical endpoints
      • History of creating the FinnGen endpoints
      • Location of FinnGen Endpoint and Control Description Files
        • What's new in DF13 endpoints
        • What’s new in DF12 endpoints
        • What’s new in DF11 endpoints
        • What’s new in the DF10 endpoints
        • What’s new in DF9 endpoints
        • What’s new in DF8 endpoints
      • Interpretation of Endpoint Definition file
      • Location of Endpoint Quality Control Report
      • Creating a User-defined Endpoint(s)
      • Requesting a User-defined Endpoint to be included in Core Analysis
      • Complete follow-up time of the FinnGen registries – primary endpoint data
        • Survival analysis using the truncated endpoint file – secondary endpoint data
    • Biobanks in Finland
    • Publishing FinnGen results
      • Preparing manuscripts or conference abstracts
      • The 1-year “Exclusivity Period” Policy
      • List of Publications using FinnGen Data
      • How to share GWAS summary statistics with FinnGen community
      • How to publish GWAS summary statistics
      • Public Result Releases
    • Red Library Data (individual level data)
      • Genotype data
        • Genotype Arrays Used
          • Legacy cohorts and chips
        • Imputation Panel
          • Sisu v4 reference panel
          • Sisu v3 reference panel
          • Sisu v4.2 reference panel
            • Variant-wise QC metrics file
        • Genome build used in FinnGen
        • Genotype Data Processing Flow
        • Genotype Files in Sandbox
          • Imputed genotypes in VCF format
          • Imputed genotypes in BGEN format
          • Imputed genotypes in PLINK format
          • Chip data
          • Imputed HLA alleles
          • Principal components analysis (PCA) data
          • Kinship data
          • Analysis covariates
          • Polygenic risk scores (PRS)
          • Genetic Ancestry
          • Genetic relationships (GRM)
          • Mosaic chromosomal alterations (mCA)
          • Prune data (R9)
          • Imputed STR genotypes (R8)
      • Phenotype data
        • Register data
        • Detailed longitudinal data
          • Splitting combination codes in detailed longitudinal data
        • Service sector data
          • Service sector data code translations
        • Endpoint and endpoint longitudinal data
        • Kanta lab values
          • Data
          • FAQ
          • How-to guides
        • Kanta prescriptions
        • Minimum extended phenotype data
          • Extracting minimum phenotype data per biobank
          • DNA isolation protocols per biobank
        • Minimum longitudinal data
        • Minimum phenotype data (before R11)
        • Cohort data (before R11)
        • Other register data files in Sandbox
          • Register of Congenital Malformations
          • Finnish Registry for Kidney Diseases
          • Reproductive history data
          • Finnish Cancer Registry: Cervical cancer screening
          • Finnish Cancer Registry: Breast cancer screening
          • Finnish Cancer Registry: Detailed cancer data
          • Finnish Register of Visual Impairment
          • Parental cause of death data
          • Ejection fraction data
          • Finnish National Infectious Disease Register
          • Finnish National Vaccination Register
          • Covid-19 primary care data
          • Blood donor data from the Finnish Red Cross Blood Service (FRCBS)
          • Dental data
          • Socioeconomic data
          • Hilmo and avohilmo extended data
      • Omics data
        • Proteomics
          • Expansion Area 5 proteomics data
          • FinnGen 3 proteomics data
        • Metabolomics
        • Single-cell transcriptomics and immune profiling
        • High-content cell imaging
        • Full blood counts and clinical chemistry
      • Hospital administered medications
      • Whole exome sequencing (WES) data
    • Green Library Data (aggregate data)
      • What is "Green" Data?
      • Accessing Green Data
      • Other analyses available
        • Colocalizations in FinnGen
        • Autoreporting – information on overlaps
          • Index of Autoreporting variables
        • HLA
        • LoF burden test
        • Meta-analyses
      • Core analysis results files
        • Recessive GWAS results format
        • Variant annotation file format
        • Genotype cluster plots format
        • GWAS results format
        • Finemapping results format
        • Colocalization results format
          • Results format in colocalization before DF13
        • Autoreporting results format
        • Sex-specific GWAS results format
        • UKBB-FinnGen meta-analysis file formats
        • Pairwise endpoint genetic correlation format
        • Heritabilities
        • Coding variant associations format
        • HLA association results
        • Proteomics results
        • Coding variant results including CHIP EWAS (Exome-Wide Association Scan)
        • Kanta lab association results v1
    • Disease specific Task Force data
      • Inflammatory bowel disease (IBD) SNOMED codes data
    • Expansion Area 3 (EA3) studies
      • EA3 study: Fatty liver disease study and data in Sandbox
      • EA3 study: Age-related macular degeneration study and data in Sandbox
      • EA3 study: Women's health studies
        • EA3 study: Women’s health – Endometriosis and data in Sandbox
        • EA3 study: Human papilloma virus-related gynecological lesions, and data in Sandbox
        • EA3 study: Women’s health – PCOS and infertility study, and data in Sandbox
      • EA3 study: Diabetic Kidney Disease and Rare Kidney Disease study and data in Sandbox
      • EA3 study: Oncology studies
        • EA3 study: Oncology – Breast cancer study and data in Sandbox
        • EA3 study: Oncology –Prostate cancer study and data in Sandbox
        • EA3 study: Oncology – Ovarian cancer study and data in Sandbox
      • EA3 study: Pulmonary diseases (IPF, asthma and COPD) study and data in Sandbox
      • EA3 study: Immune-mediated diseases
      • EA3 study: Heart Failure study and data in Sandbox
      • FinnGen EA3 leads
  • Disease Specific Task Forces
    • Inflammatory bowel disease (IBD)
    • Kidney Diseases
    • Eye Diseases
    • Rheumatic Diseases
    • Atopic Dermatitis
    • Pulmonary Diseases
    • Neurological Diseases
    • Heart Failure
    • Fibrotic Diseases
    • Metabolic diseases
    • Parkinson's diseases
  • Working in the Sandbox
    • How to get started with Sandbox
    • What is Sandbox and what can you do there
    • What do we mean by "red" and "green" data?
    • General workflows for the most common analyses
    • Quirks and Features
      • Managing your files in Sandbox
      • Navigating the Sandbox
      • How to save Sandbox window configuration
      • Copying and pasting in and out of your IVM
      • How to report issues from within the Sandbox
      • Sharing individual-level data within the Sandbox
      • How to download results from your IVM
        • Sandbox download requests – rules and examples for minimum N
      • Keyboard combinations
      • Running analyses in your IVM vs. Pipelines
      • Timeouts and saving your work (backups, github)
      • How to install a R package into Sandbox?
        • How to install R packages with many dependencies
      • Install R and Python packages from the local Sandbox repository
      • How to install a Python package into Sandbox
      • How to install GNU Debian package
      • How to upload your own files to IVM via /finngen/green
      • How to remove files from /finngen/green
      • Using Sandbox as a Chrome application (full screen mode)
      • How to reset your finngen.fi account password
      • Sandbox IVM tool request handling policy
      • Docker images
        • How to get a new Docker image to Sandbox
        • How to mount data into Docker container image
        • Containers available to Sandbox
        • Containers with user customized tool sets
        • How to write a Docker file
        • Anaconda Python environment in the Sandbox
      • Python Virtual Environment in Sandbox
      • How to shut down your IVM
    • Which tools are available?
      • FinnGen exome query tool
      • Custom GWAS tools
        • Custom GWAS GUI tool
        • Custom GWAS command line (CLI) tool
          • Custom GWAS CLI Binary mode
          • Custom GWAS CLI Quantitative mode
        • How to make your summary stats viewable in a PheWeb-style?
        • Finemapping of Custom GWAS analyses
        • PheWeb Users Input Validator tool
        • Conditional analysis of Custom GWAS analyses
      • Pipelines
      • Pre-installed Linux tools
      • PGS Browser
      • Lmod Linux tools
      • Anaconda Python module with ready set of scientific packages
      • Python packages
      • R packages
      • Atlas
        • Quick guide
          • Introduction to OHDSI, OMOP CDM and Atlas
          • From research question to concepts and cohort building
          • Using Atlas in Sandbox
          • Examples on cohort building with Atlas
        • Detailed guide
          • Atlas data model
          • Standard and non-standard codes
          • How to define a cohort in Atlas
            • Select FinnGen data release in Atlas for Search
            • How to define a simple ICD case-control cohort in Atlas
              • Define a simple ICD Concept Set in Atlas
              • Define a simple ICD case cohort in Atlas
              • Define a simple ICD control cohort in Atlas
            • Concept Sets
              • Create Concept Sets using descendants
              • Exclude and Remove codes from Concept Set
              • Simplify Concept Sets that use standard code descendants
              • Create Concept Sets using equivalent standard and non-standard codes
              • View standard code hierarchy in Atlas
            • Cohort Definitions
              • Using the Death register in Atlas
              • Filtering by clinical registries in Atlas
              • Filtering by demographic criteria in Atlas
              • Defining exit rules for a cohort in Atlas
              • Selecting the correct box in Atlas for events and medical codes
            • How to export FinnGen IDs from Atlas
          • Downstream analyses after the Atlas cohorts are created
          • Data Release Summary Statistics in Atlas
          • Cohort Summary Statistics in Atlas
            • Time-dependent Cohort Summary Statistics in Atlas
            • Event inclusion in Cohort Summary Statistics in Atlas
          • Cohort Pathways
      • BigQuery (relational database)
      • Atlas vs BigQuery cohorts
      • Genotype Browser
      • Cohort Operations tool (CO)
        • Upload cohorts to CO
        • Combine cohorts with CO
        • Operate on Atlas cohorts and data with entries and exit events
        • Explore code and endpoint enrichments with CO (CodeWAS)
        • Explore endpoint overlaps with CO
        • Compare custom endpoint to FinnGen endpoint with CO
        • Launch custom GWAS with CO
        • Export FinnGen IDs using CO
        • Understanding phenotypic overlaps using CO
      • Trajectory Visualization Tool (TVT)
        • Running TVT
          • Filtering timelines with TVT
          • Reordering timelines with TVT
          • Clustering timelines with TVT
          • Viewing TVT results
        • Viewing Atlas, CO, and Genotype cohorts in TVT
        • Exporting cohorts from TVT
        • TVT help page
      • LifeTrack
      • Miscellaneous helper scripts/tools
        • Tool to annotate variants with RSIDs
        • Proper translations of medical, service sector and provider codes
        • BigQuery Connection – R
          • Case study – All register data for a person
          • Case study – UpSet plot
          • Case study – Tornado plot
          • Case study – defining simple cohorts using medical codes for running case-control GWAS
        • BigQuery Connection - Python
          • BigQuery Python - Downstream analysis - Active Ingredient - Bar plot
          • BigQuery Python - Case Study - Sex different - Tornado plot
          • BigQuery Python - Case Study - Comorbidity - Upset plot
          • BigQuery Python - Case Study - Patient Timeline - Scatter plot
      • Sandbox internal API for software developers
    • Working with Phenotype Data
      • Variant PheWas
      • How to select controls for your cases
      • Using the R libraries to look at Phenotype data
      • How to check case counts from the data
      • Creating your own user-defined endpoint
    • Working with Genotype Data
      • Genotype Browser how to
      • Cluster Plots
      • ClusterPlot viewer V3C
      • Rare Variant Calling in V3C
      • Create map of allele
      • Genotypes from VCF files
      • Variant PheWas
      • Interpreting rare-variant analysis results
      • Tools for geno-pheno explorations
        • Example: transferring data from Genotype Browser to LifeTrack
        • Example: Visualizing Genotype Browser output data with TVT
    • Running analyses in Sandbox
      • How to run survival analyses
      • How to create custom endpoint using bigquery: example
      • How to use the Pipelines tool
      • How to submit a pipeline from the command line (finngen-cli)
      • How to run genome-wide association studies (GWAS)
        • How to run GWAS using REGENIE
        • Running quantitative GWAS with REGENIE
        • Conditional analysis
        • Conditional Analysis with custom regions and loci
        • How to run GWAS using SAIGE
        • Adding new covariates in GWAS using REGENIE and SAIGE
        • How to run GWAS using plink2 (for unrelated individuals only)
        • How to run GWAS using GATE (survival models)
        • How to run trajGWAS
        • How to run GWAS using the Regenie unmodifiable pipeline
        • How to run an interaction GWAS using the Regenie unmodifiable pipeline
        • How to run survival analysis using GATE unmodifiable pipeline
        • How to run GWAS on imputed HLA alleles using Regenie
      • How to run finemapping pipeline
        • Finemapping with custom regions in DF12
        • Unmodifiable Finemapping pipeline
      • How to run colocalization pipeline
      • How to run the LDSC pipeline
      • How to run PRS pipeline
      • How to calculate PRS weights for FinnGen data
      • Sandbox path and pipeline mappings
      • If your pipeline job fails
      • Tips on how to find a pipeline job ID
      • Managing memory in Sandbox and data filtering tips
      • Using Google Life Sciences API in Sandbox
      • Pipelines is based on Cromwell and WDL
    • Billing information and where to find more details
      • Monitoring Sandbox costs by Sandbox billing report
      • Monitoring Sandbox costs directly from your Google billing account
  • Working outside the Sandbox
    • Risteys
    • Endpoint Browser
    • PheWeb
      • Volcano plots with LAVAA
    • Meta-analysis PheWeb(s)
    • Coding variant browser
    • Multiple Manhattan Plot (MMP)
      • How to prepare an input file for MMP
      • How to use MMP
    • LD browser
    • Green library data
  • FAQ
    • FinnGen Spin Offs
    • FinnGen access and accounts
      • How do I apply for data access?
      • What is "red" or "green" data?
      • I already have green data access, how do I apply for red data access?
      • I cannot access the /finngen/red?
      • How do I enable two-factor authentication (2FA)?
      • I cannot access my FinnGen account?
      • How to reset account credentials
      • What to do if you suspect your account has been compromised
      • Can't access your smartphone for 2FA?
      • How do I access the FinnGen members' area?
      • How do I access FinnGen All Sharepoint?
      • How can I view existing analysis proposals?
      • How can I join the FinnGen Slack?
      • How do I join the FinnGen Teams group?
      • How to apply SES sandbox access
      • How to request a FinnGen account?
    • FinnGen data
      • What to do if I think I found a mistake in the data?
      • What are the field/column names in FinnGen?
      • What covariates are used in FinnGen's core GWAS analyses?
      • Does FinnGen have lab results available?
      • Does FinnGen have family and relatedness information available?
      • Where can I find a list of unrelated individuals in FinnGen?
      • When moving from BCOR to .txt files, what does the column called "correlation" mean?
      • Is there really no participant birth year data?
      • How do I calculate time between events?
      • Can I select only the columns needed for my analysis to import into RStudio?
      • What is the difference is between LD-clumping and the Saige conditional analysis?
      • Can I download all pairwise LD data across the genome at once?
      • How to find latest data releases?
      • Why are there differences in the GWAS results between Data Freezes/Releases?
    • Where can I find
      • COVID association results?
      • Users' Meeting materials?
      • A list of what coding variants are enriched in Finland?
      • A comprehensive list of key file locations in FinnGen?
      • Medical code translations?
    • PheWeb
      • What are QQ and Manhattan plots?
      • How can I access PheWeb?
      • Are fine-mapping results that available in PheWeb also available as flat files?
      • Do the autoreports report the 95% or 99% credible set?
    • Registries
      • What do KELA reimbursement codes map to?
      • What's the cutoff date for FinnGen data?
    • Sandbox
      • What is the FinnGen Sandbox?
      • Why does my IVM freeze while loading data into R/Rstudio
      • Where can I find tutorials and documentation on Sandbox?
      • How do I get my own analysis code into Sandbox?
      • Where to ask for software you'd like to see in Sandbox
      • Can I share individual level data between different Sandbox users?
      • Is there a sun grid engine for running long scripts?
      • How to clear browser cache after sandbox update
      • How do I increase the window resolution on my IVM?
      • How can I view pdf, jpg and HTML files?
      • My Sandbox job was killed - why?
      • How to unzip files in the command line
      • Why aren't my keyboard/shortcuts working in Sandbox like they do in my local computer?
      • How to know if my pipeline job was failed due preemption of worker VM
    • Risteys
      • Why is the case number dropping after the "Check pre-conditions, main-only, mode, ICD version" step?
    • Endpoints
      • Where do I find the most recent list of FinnGen endpoints?
      • What does it mean when an endpoint has “mode” at the end?
      • What scenario would cause an NA (missing data) entry rather than a zero?
      • Does it mean anything when a value is written as $!$ instead of NA?
      • Why is there an inconsistency between ICD10 code J84.1 (IPF) and J84.112?
      • How are control endpoints calculated?
      • Can I get a list of FinnGen IDs by control group for my endpoint?
      • What does Level C mean in the endpoints data table?
      • What does the SUBSET_COV field show?
      • Why is there a "K." prefix on some endpoints?
      • Why there are fewer endpoints going from R5 (N = 2,925) to R8 (N = 2,202)?
      • Should I include primary care registry (PRIM_OUT) codes in my cohort definitions?
      • I found BL_AGE after FU_END_AGE in the endpoint data, how is it possible?
      • Why do individuals who are not dead have death age in endpoint data?
      • I found EVENT_AGE after FU_END_AGE in endpoint data, how is it possible?
    • Pipelines
      • Are there example SAIGE pipelines?
      • How do I apply finemapping to my SAIGE results?
      • Why Pipelines is claiming that my files or folders are not in /finngen/red?
    • Citing
      • How do I cite analysis using publicly available FinnGen results?
      • How do I cite FinnGen results that use individual level data?
    • For biobanks
      • How to apply for data return
    • Data Security and Protection
      • How do I report a data breach?
  • Release Notes
    • Data Releases 2025
    • Data Releases 2024
    • Data Releases 2023
    • Data Releases 2022
    • Data Releases 2021
  • Tool Catalog
  • Glossary
  • User Support
  • Data Protection & Security
Powered by GitBook
On this page
  • Summary
  • Hilmo
  • Hospital data (inpatient and outpatient) - Hilmo (Health-Hilmo) register
  • Primary care data - Avohilmo register
  • Causes of death register
  • Cancer register
  • Drug purchase register
  • Drug reimbursement register

Was this helpful?

  1. FinnGen Data Specifics
  2. Red Library Data (individual level data)
  3. Phenotype data

Register data

PreviousPhenotype dataNextDetailed longitudinal data

Last updated 1 month ago

Was this helpful?

FinnGen contains data from a large number of Finnish registers. This data is only available in the Sandbox.

Depending on the register, the data is accessible from:

  • and

  • Other data file (e.g. )

Summary

The following table summarises the FinnGen registers and clinical data sets available in FinGen and where the data is available:

Phenotype data set

Start

year or time period

Standardized codes used

and

and

Other data file

1953

ICD-O-3

Yes (CANC)

Yes

Yes (Click first column link to see data location)

1964

ICD for indication, Finnish specific reimbursement codes

Yes (REIMB)

Yes

1995

ICD for indication, Finnish specific reimbursement code

Yes (PURCH)

Yes

1969

ICD-10,9,8

Yes (DEATH)

Yes

1969

ICD-10,9,8 and ATC

Yes (INPAT)

Yes

1986

Finnish specific NOMESCO and Finnish Hospital League operation codes

Yes (OPER_IN)

Yes

1998

NOMESCO and Finnish Hospital League operation codes

Yes (OUTPAT/OPER_OUT)

Yes

2011

ICD10, nurse procedures( SPAT codes) , nurse provided cause of visit (ICPC)

Yes (PRIM_OUT)

Yes

1964

Birth data as of DF12

1964

Click first column link to see data description

Yes

Click first column link to see data file location

1983

Click first column link to see data description

Yes

Click first column link to see data file location

1987

Click first column link to see data description

Yes

Click first column link to see data file location

1980

Click first column link to see data description

Click first column link to see data file location

1970

Click first column link to see data description

Click first column link to see data file location

1989

Click first column link to see data description

Click first column link to see data file location

1991

Click first column link to see data description

Click first column link to see data file location)

1992

Click first column link to see data description

Click first column link to see data file location

2011

Click first column link to see data description

Click first column link to see data file location

31.5.2012-31.5.2022

Click first column link to see data description

Click first column link to see data file location

Click first column link to see data description

Click first column link to see data file location

1.1.1990-1.2.2023

Click first column link to see data description

Click first column link to see data file location

until 23.3.2023

Click first column link to see data description

Click first column link to see data file location

until 22.2.2023

Click first column link to see data description

Click first column link to see data file location

1.1.2000-23.11.2022

Click first column link to see data description

Click first column link to see data file location

until 31.10.2022

Click first column link to see data description

Click first column link to see data file location

Click first column link to see data description

Click first column link to see data file location

14.3.2013-14.3.2023

Click first column link to see data description

Click first column link to see data file location

Table abbreviations:

  • KELA: Social Insurance Institution of Finland

  • STAT: Statistics Finland

  • THL: Finnish Institute for Health and Welfare

  • DVV: Digital and Population Data Services Agency

  • EA3: Expansion area 3

Hilmo

Hospital data (inpatient and outpatient) - Hilmo (Health-Hilmo) register

  • Patients discharged from inpatient care (since 1969)

  • Surgical operations (since 1986)

  • Specialised outpatient care (since 1998)

Accuracy and reliability

Health-Hilmo data is reviewed for any errors and omissions as soon as it arrives at the Finnish Institute for Health and Welfare (THL). Since 2016, the data has been reviewed using an automatic process that checks, among other things, mandatory data and whether the codes contained in the data correspond to the codes defined for Health-Hilmo. If errors or omissions are found in the review, the data provider is informed and responsible for correcting, supplementing or resubmitting the material.

The compiled statistics are compared with the corresponding statistics of the previous year. Unclear cases will be checked with the data providers. If, despite revisions and corrections, the data is left with deficiencies or errors, they are described in the statistical report.

Primary care data - Avohilmo register

  • Outpatient care visits, procedures, home care, and occupational health care

The data in Avohilmo is not as reliable as in Hilmo (Health-Hilmo). Finnish doctors are legally responsible for entering codes into Health-Hilmo. This is not the case in Avohilmo. Some codes are also provided by nurses (ICPC2) in Avohilmo.

Accuracy and reliability

THL reaches out to the data providers to request corrections if any shortcomings in the quality are identified. The data correction is always done by the data producers. If necessary, the Avohilmo register incorporates updated and corrected data, mainly from the last year. During 2020, the coverage of Avohilmo data was reviewed on a weekly basis for each patient information system. The information system vendors and service providers were contacted to correct any missing information.

Causes of death register

  • the year and date of death

  • the basic cause of death

  • the immediate cause of death

  • 1-4 contributing causes of death

The data is derived from death certificates and complemented by data on deaths from the Population Information System of the Population Register. The statistics on causes of death include all deaths in Finland or abroad of persons permanently resident in Finland at the time of their death.

Accuracy and reliability

Consistency

  • Between 1987 and 1995, the data was classified using the national classification of diseases 1987, ICD9, where comparability to the international version is maintained.

Cancer register

Healthcare organizations in Finland have a statutory obligation to provide information on every cancer case or strong suspicion of cancer to the Finnish Cancer Registry.

The Finnish Cancer Registry is also a statistical and epidemiological research institute that does active collaboration both nationally and internationally. It has three main focus areas: cancer statistics, research and screening.

  • Cancer research focuses on identifying the causes of cancer in the population, prevention and early detection of cancer, together with factors affecting the survival of cancer patients.

Accuracy and reliability

The data is very reliable. The Finnish Cancer Registry quality controls the data and uses it for its own research activities.

Drug purchase register

  • persons who during a specific time period have purchased medicines that are reimbursed

  • reimbursements for medicines purchased during a specific time period

  • costs for medicines purchased during a specific time period

  • prescription medicines purchased during a specific time period

The data is based on purchases of dispensed medicines reimbursable under the Finnish National Health Insurance (NHI) Scheme by the date of purchase. The purchase of medicines is accompanied by an on-the-spot NHI reimbursement to the customer according to the prescription record.

All permanent residents of Finland are covered by the Finnish National Health Insurance (NHI) Scheme and are eligible to reimburse the cost of reimbursable medicines prescribed by a doctor or dentist. Only outpatient medication costs are covered by the National Health Insurance Scheme. Medication administered in public hospitals is not reimbursable. Annual statistics are completed by the end of March following the end of the reference year.

Accuracy and reliability

These statistics are compiled from an individual-level registry on medicine reimbursements. They are produced from the benefits system for pharmacy payment reporting (prescription file). Any errors detected are corrected immediately in accordance with the release guidelines issued by the Advisory Board of the Official Statistics of Finland. Errors discovered in the online service are corrected and the erroneous information is removed.

Consistency

The effect of legislative amendments on eligibility conditions and rates should be taken into account when making year-to-year comparisons. Data on reimbursements for prescription medicines have been collected since 1995. The reimbursement scheme for medicines was changed at the beginning of 2006 with the right to reimburse smaller purchases than before. Since the beginning of 2007, the data also includes medicine purchases reimbursed by the workplace funds.

Other things to keep in mind include:

  • The buyer of reimbursed prescription medicines may not use all the medicines purchased

  • The data only includes outpatient medication and not inpatient (hospital) medication

  • The data does not include prescription medicines not approved for reimbursement by the Pharmaceuticals Pricing Board

  • The data does not include most of the over-the-counter (OTC) medicines

Drug reimbursement register

The medicine reimbursement system was created in Finland in 1964. The Health Insurance Scheme is administered by the Social Insurance Institution of Finland (Kela) and covers all residents of Finland. The latest version of the Health Insurance Act (Sairausvakuutuslaki) is 1224/2004.

The Health Insurance Scheme reimburses some of the costs for prescription medicines which are used for the treatment of an illness. An over-the-counter product may also be granted reimbursement status if the product is prescribed by a physician and considered, on medical grounds, to be an indispensable medicinal product. Basic topical ointments prescribed for the treatment of chronic skin ailments are also reimbursable, as are clinical nutritional preparations used in the treatment of a serious illness. Reimbursement is also paid to a limited group of patients for the dosage service fee charged by pharmacies.

The payment of reimbursement is possible only after the Pharmaceuticals Pricing Board has approved the reimbursement status of the medicine, basic topical ointment or clinical nutritional preparation and confirmed its reasonable wholesale price. The Pharmaceuticals Pricing Board operates under the auspices of the Ministry of Social Affairs and Health.

In the

(abbreviation)

availability

(limited access to Finnish researchers)

THL Hospital Inpatient and Outpatient (Hilmo): THL

THL Primary Care and Health Centers (Avohilmo): THL register

An example of how the Finnish register data has been used can be found in .

The Finnish Social and Healthcare Notification System () is a nationwide social and healthcare data collection and reporting system maintained by the Finnish Institute for Health and Welfare (THL). FinnGen has data from

Hilmo (Health-Hilmo): information on inpatient care and specialized outpatient care ()

Avohilmo: information on outpatient care and occupational health care ()

The Hilmo () register from THL has information on:

The quality of the data collected in Health-Hilmo has been assessed since its inception in 1969 in more than 30 scientific studies (a list can be found ). The majority of these studies have focused on cardiovascular disease, mental disorders and injuries. The general conclusion is that the data on treatment periods is comprehensive and that the main diagnoses and priority (main) procedures have been well reported. Some data omissions on side diagnoses and other procedures have been identified. Additionally, the quality and coverage of data have been observed to vary between hospital districts.

A quality report on specialized health care, in Finnish, can be found . The contains some details of the quality assurance process.

The Avohilmo () register from THL has information since 2011 on:

Data quality is monitored by the Finnish Institute for Health and Welfare (THL) and the data producers using which are updated daily. These reports monitor the coverage and quality of the data collection by service providers and type of service, as well as the coverage for reasons of visits including:

)

A quality report of Avohilmo can be found (in Finnish).

The from Statistics Finland has information since 1969 on:

harmonized 54-class basic cause of death classification () (not used in FinnGen)

The classifications of causes of death are described in the under the "Population and Education" section.

The data is very reliable and the cause and time of death are accurately recorded. A quality report can be found .

Between 1969 and 1986, the international classification ICD-8 was used with .

Since 1996, the statistics have been compiled based on the 10th revision of the International Classification of Diseases (ICD-10), with some Finnish additions (listed in Finnish). This is the WHO version of ICD10 and does not include some of the subtypes and extensions of the ICD10 codes used in the full Finnish set of ICD codes.

The has data since 1953.

Cancer statistics are used to monitor the incidence of new cancer cases and the survival and mortality of cancer patients. The statistics of the are used in the evaluation and quality control of cancer screening.

Cancer screening is the systematic search for the precursors or early stages of cancer in the population. The goal is to reduce deaths due to cancer among those screened. is included in the FinnGen.

The from KELA has information since 1995 on:

More information about reimbursed medicines including changes in the annual limit on out-of-pocket prescription drug expenses and reimbursement rates is available .

The from KELA has data since 1964.

Detailed longitudinal data file
Service sector data file
Atlas
OMOP CDM
Minimum extended phenotype data file
Care Register for Health Care
Register of Primary Health Care Visits
this publication
Hilmo
Care Register for Health Care
Register of Primary Health Care visits
Care Register for Health Care
here
here
Hilmo Opas
Register of Primary Health Care visits
Avohilmo's reports
Visits on a weekly and monthly basis
Reasons for visits and their recording rates monthly
Procedures and their recording rates monthly
Acute respiratory infection (influenza and other viruses
here
Causes of death register
54 categories
home page of the statistics in Classifications
here
Finnish additions
here
Finnish Cancer Registry
Cancer Screening Registry
Cervical and breast cancer screening data
Drug Purchase register
here
Drug Reimbursement register
Detailed longitudinal
Service sector data file
Atlas
OMOP CDM
Finnish Cancer Registry
Drug Reimbursement (KELA)
Drug Purchase (KELA)
Causes of death (STAT)
Care Register for Health Care, inpatient visits (THL Hilmo)
Care Register for Health care, inpatient operations (THL HIlmo)
Primary Health Care specialist outpatient visits incl. operations (THL Hilmo)
Primary Health Care outpatient visits (Avohilmo)
Population Register (DVV)
Minimum extended phenotype data file
Finnish Registry for Kidney Diseases
Finnish Register of Visual Impairment (THL)
Medical Birth Register (THL)
Register of Congenital Malformations (THL)
Sosioeconomic data (STAT)
Finnish National Infectious Disease Register (THL)
Finnish Cancer Registry: Cervical cancer screening (THL)
Finnish Cancer Registry: Breast cancer screening (THL)
Finnish National Vaccination Register (THL)
EA3: Fatty liver disease
EA3: Age-related macular degeneration
EA3: Women's health PCOS
EA3: Women's health Endometriosis
EA3: Women's health
EA3: Pulmonology
EA3: Diabetes and rate kidney diseases
EA3: Heart Failure
EA3: Immune-mediated diseases (Rheumatic and IBD)