Harmonized biologic medications dataset
The dataset combines biologic medications used for Task Force–specific diseases into a single file, with 36 drugs and 16370 unique FGIDs.
The data is available in the following Sandbox directory:
/finngen/library-red/task_force_data/harmonized_biologics/Data sources include hospital records; medication purchase data from Kela and Kanta; the Finnish Quality Registry for Rheumatic Diseases; Kanta laboratory measurements (including measured drug concentrations and autoantibodies); and NOMESCO procedure codes related to hospital infusion administrations.
The data are structured in longitudinal format, with one row per event per individual. Exact duplicate records have been removed. However, partial or apparent duplicates may still remain because the data originate from multiple sources and lack unique identifiers linking related events across systems. For example, a prescription recorded in hospital data may later appear as a purchase in Kanta data and as a drug concentration measurement in Kanta lab measurement data.
For this reason, we do not recommend using the dataset to calculate the exact number of individual treatment events. Instead, it is better suited for analyses such as estimating drug exposure or treatment periods.
We have identified certain NOMESCO procedure codes that likely correspond to specific ATC drug codes in order to account for hospital-administered infusion medications. However, this linkage has not been formally validated. Users are therefore advised to assess the suitability and validity of these data for their specific research purposes.
Kanta laboratory values were filled by searching for drug names in the laboratory test descriptions and filling in ATC codes based on the drug names. E.g. "inFLIXimab Ab [Mass/volume] in Serum or Plasma" -> "infliximab".
The columns include:
FINNGENID
Study ID
APPROX_EVENT_DAY
Approximate day of event
EVENT_AGE
Age of event
ATC
ATC code of the prescription
VNR
VNR code of the prescription
Substance
Name of drug
MedicineName
Commercial name of medication
PackageSize
Size of medication package
DDDPerPack
Defined daily dose per package
Dosage
Dosage of the medication
DosageUnit
The unit of dosage
MERGED_SOURCE
Source of data, listed also above
OMOP_CONCEPT_ID
OMOP ID of the mapping
NOMESCO
NOMESCO code of the medication
OMOP_CONCEPT_NAME
NOMESCO name of the medication
Last updated
Was this helpful?