Harmonized biologic medications dataset

The dataset combines biologic medications used for Task Force–specific diseases into a single file, with 36 drugs and 16370 unique FGIDs.

The data is available in the following Sandbox directory:

/finngen/library-red/task_force_data/harmonized_biologics/

Data sources include hospital records; medication purchase data from Kela and Kanta; the Finnish Quality Registry for Rheumatic Diseases; Kanta laboratory measurements (including measured drug concentrations and autoantibodies); and NOMESCO procedure codes related to hospital infusion administrations.

The data are structured in longitudinal format, with one row per event per individual. Exact duplicate records have been removed. However, partial or apparent duplicates may still remain because the data originate from multiple sources and lack unique identifiers linking related events across systems. For example, a prescription recorded in hospital data may later appear as a purchase in Kanta data and as a drug concentration measurement in Kanta lab measurement data.

For this reason, we do not recommend using the dataset to calculate the exact number of individual treatment events. Instead, it is better suited for analyses such as estimating drug exposure or treatment periods.

We have identified certain NOMESCO procedure codes that likely correspond to specific ATC drug codes in order to account for hospital-administered infusion medications. However, this linkage has not been formally validated. Users are therefore advised to assess the suitability and validity of these data for their specific research purposes.

Kanta laboratory values were filled by searching for drug names in the laboratory test descriptions and filling in ATC codes based on the drug names. E.g. "inFLIXimab Ab [Mass/volume] in Serum or Plasma" -> "infliximab".

The columns include:

FINNGENID

Study ID

APPROX_EVENT_DAY

Approximate day of event

EVENT_AGE

Age of event

ATC

ATC code of the prescription

VNR

VNR code of the prescription

Substance

Name of drug

MedicineName

Commercial name of medication

PackageSize

Size of medication package

DDDPerPack

Defined daily dose per package

Dosage

Dosage of the medication

DosageUnit

The unit of dosage

MERGED_SOURCE

Source of data, listed also above

OMOP_CONCEPT_ID

OMOP ID of the mapping

NOMESCO

NOMESCO code of the medication

OMOP_CONCEPT_NAME

NOMESCO name of the medication

Last updated

Was this helpful?