Open in app
Home
Notifications
Lists
Stories

Write
Coussement Bruno
Coussement Bruno

Home

Published in datamindedbe

·Apr 13

Batch orchestration on Azure flowchart

Managed solutions vs building it yourself Batch data lake architectures often require a component that orchestrates data pipelines and ingests flows. On Azure, the go-to service for this is Azure Data Factory (ADF). Although this is a great option to start with, there is more to be said. Most orchestrators…

Orchestration

5 min read

Batch orchestration on Azure flowchart
Batch orchestration on Azure flowchart

Published in datamindedbe

·Jun 9, 2021

What I wish I knew before going into Data Engineering

Disclaimer: this is my opinion, not necessarily the one of my employer or any organisation. There is a clear difference between what I expected of the job, vs what I know now after 2.5 years on the job. Maybe not every item stated below applies to you, but some might. My background …

Data Engineering

7 min read

What I wish I knew before going into Data Engineering
What I wish I knew before going into Data Engineering

Published in datamindedbe

·Jun 7, 2021

ML Pipelines in Azure Machine Learning the right way

A code example to get you up-and-running quickly — The official Azure Machine Learning Studio documentation, the Python SDK reference and the notebook examples are often out-of-date, or don’t cover all important aspects, or don’t provide a compelling end-to-end example. …

Azure Ml

8 min read

ML Pipelines in Azure Machine Learning Studio the right way
ML Pipelines in Azure Machine Learning Studio the right way

Published in datamindedbe

·Mar 9, 2021

What to consider before choosing Argo Workflow?

To go full Kubernetes-native or not? — The recent explosion of tools including task and data orchestration tools should make you wonder if you’re still doing the right thing. Purely based on Github-stars of the open-source frameworks, Airflow is still the most popular one. This does not take into account the popularity of closed-source, or cloud vendor…

Argo

6 min read

What to consider before choosing Argo Workflow?
What to consider before choosing Argo Workflow?

Published in datamindedbe

·Nov 18, 2020

How to share tabular data in a privacy-preserving way

Adding noise to existing rows, only adding noise to outcomes of tasks performed on that data, or synthetic data generation? An intuition. — As companies grow, or as regulations get more strict, or as senior IT architects get up to speed with the latest trends, the need (or obligation) to mitigate privacy and leakage risks get stronger for data processing entities. Data anonymization or data tokenization techniques are widely used in this context…

Differential Privacy

8 min read

How to share tabular data in a privacy-preserving way
How to share tabular data in a privacy-preserving way

Published in Towards Data Science

·Oct 13, 2020

Which cloud service provider ML platform do you need?

AWS Sagemaker, Azure ML platform or GCP AI platform? It actually doesn’t matter. Not for industrialisation. — First, I’m going to assume that you have chosen a cloud service provider (CSP), or in the position to choose one for your organisation. Secondly, I’m also assuming that you need to be able to build, train, tune, evaluate and deploy a machine learning model, then the first thing you…

Azure Ml

7 min read

Which cloud servicer provider ML platform do you need?
Which cloud servicer provider ML platform do you need?
Coussement Bruno

Coussement Bruno

Data Engineer with a focus on MLOps

Following
  • Niels Claeys

    Niels Claeys

  • Kris Peeters

    Kris Peeters

  • Wannes Rosiers

    Wannes Rosiers

  • Stijn De Haes

    Stijn De Haes

  • Jonathan Merlevede

    Jonathan Merlevede

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Knowable