New Python openEHR Synthetic Data Generator

Koray_Atalag · 2 March 2026 12:14

Howdy, my weekend hack turned into something I thought might be helpful to others running after test data - heaps of them!

Project started from a fork of Berlin-Institute-of-Health / Genkidata (https://github.com/Berlin-Institute-of-Health/Genkidata).
It has significant additions:

instead of just duplicating existing Compositions, it uses:
- An NLP library to change text to synonyms for DV_TEXT. While not perfect from a clinical semantics point of view, it’s much better than lorem ipsum stuff!
- For quantities it changes values randomly between -15 <> +15 percent so it’s likely to be clinically plausable.
in addition a new feature to create canonical Compositions from Webtemplates (it’s a biggie! and possibly still has errors but it passed all tests from ehrbase SDK test webtemplates using Pablo’s validation tool).

When you run the app, it prompts three options:

API Upload (into ehrbase or other CDR)
Jitter Existing Compositions (\source_models\compositions) but rather than just duplicating in the original app it creates new values)
Stored (Source Webtemplates)

Existing Compositions are taken from test data from https://github.com/ehrbase/openEHR_SDK so they pretty much cover all possible variations.

The amount of Compositions and EHRs is defined by user input.

Resulting canonical Compositions are saved into:

/dist/compositions

You can put your own Compositions (to duplicate but with new values) and Webtemplates into:

/source_models/compositions

/source_models/webtemplates

Enjoy! And comments / tickets / pull requests welcome.

Topic		Replies	Views
Synthetic openEHR Data Generator - Open Source Tools test-data	0	42	28 March 2026
Any experiences of synthetich openEHR data, e.g. using Synthea? Apps test-data	2	786	4 February 2020
Example data in JSON or XML Implementation	2	514	9 August 2023
Synthea data in openEHR format Tools	10	194	24 February 2026
Automatic openEHR Template Tester Tools rest-apis , ehrbase	1	891	8 July 2021
New openEHR OPT v1.6 openEHR Toolkit	0	1155	2 May 2021
SynPuf: syntetic data (into openEHR) Implementation	13	1029	7 July 2022
Composition Examples Integration	5	641	18 September 2023
openEHR Web Template to FHIR Questionnaire converter Tool Support questionnaire , tools , fhir , template	8	219	6 October 2025
Example / conformance checking files New to openEHR?	2	61	23 September 2025