Representative synthetic dataset of Luxembourg’s citizens SDF

Updated on December 1, 2023 — Creative Commons Zero (CC0)

Luxembourg National Data Service

Enabling value creation from secondary use of data – with transparency, control and trust. Luxembourg National Data Service (LNDS) is a brand of PNED G.I.E. an economic interest group created by the Luxembourg Government, to implement Luxembourg’s strategies in research, innovation, and…

4 datasets


Creative Commons Zero (CC0)


Temporal coverage
2022/12/01 to 2023/12/01
Creation date
December 1, 2023
Latest resource update
December 1, 2023

Geographic dimensions

Territorial coverage granularity
Territorial coverage




The dataset has been created by using the open-source code released by LNDS (Luxembourg National Data Service). It is meant to be an example of the dataset structure anyone can generate and personalize in terms of some fixed parameter, including the sample size. The file format is .csv, and the data are organized by individual profiles on the rows and their personal features on the columns. The information in the dataset has been generated based on the statistical information about the age-structure distribution, the number of populations over municipalities, the number of different nationalities present in Luxembourg, and salary statistics per municipality. The STATEC platform, the statistics portal of Luxembourg, is the public source we used to gather the real information that we ingested into our synthetic generation model. Other features like Date of birth, Social matricule, First name, Surname, Ethnicity, and physical attributes have been obtained by a logical relationship between variables without exploiting any additional real information. We are in compliance with the law in putting close to zero the risk of identifying a real person completely by chance.

Files 2

Community resources 0

You have built a more comprehensive database than those presented here? This is the time to share it!

Reuses 0

Explore the reuses of this dataset.

Did you use this data ? Reference your work and increase your visibility.

Discussion between the organization and the community about this dataset.