This dataset was initiated in 2019 to introduce one of the first apps for Amazon Alexa in Luxembourg. This project aimed to release a real use-case of local services on a voice assistant platform, and we developed a waste pickup calendar.

The first challenge was accessing the raw data; at that time, the only choice was to scrap it from official websites. So we developed a nodejs modular scraping tool that connects to multiple sources, which are to this day:

  1. HTML from sidec.lu using cheerio library
  2. json from valorlux.lu
  3. ICS files from vdl.lu using node-ical library

When scraping is complete, the tool unifies all results into a single format, normalizes pickup types, matches against the CACLR address database and writes 1 json file per postal code in a simple format :

[
  {
    "uid": "5e8a5f0732fc6",
    "event_date": "1608073200000",
    "city": "Luxembourg",
    "location": "Côte d'Eich",
    "streetNumbers": "1-25, 2-24",
    "codepostal": 1450,
    "summary": "BULKY"
  },
  {
    "uid": "5e8a5f074f2c3",
    "event_date": "1608505200000",
    "city": "Luxembourg",
    "location": "Côte d'Eich",
    "streetNumbers": "1-25, 2-24",
    "codepostal": 1450,
    "summary": "PAPER"
  }
]

Note: The dataset does not cover the entire country (yet). Several other websites/sources should be crawled and consolidated to have a complete picture.

Ressources

API

myWastePickupCalendar

Basic API exposing calendar data in json format. Only one method available (getNextCollectesByPostalCode) and mapped on the API root /. Hosted on AWS Lambda and served by AWS API…

Dépôt de code

Source code

This code repository contains 3 tools: web_crawler : offline tool which crawls waste pickup calendars from various cities in Luxembourg. Datasets are matched against CACLR…

Discussions

Discussion entre l'organisation et la communauté à propos de ce jeu de données.

Ressources communautaires

Vous avez construit une base de données plus complète que celles présentées ici ? C'est le moment de la partager !

Réutilisations

Vous avez réutilisé ces données et publié un article, une infographie, ou une application ? C'est le moment de vous faire connaître ! Référencez votre travail en quelques clics et augmentez votre visibilité.