Skip to content
View CyprienKelma's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@Summers-Team

Block or report CyprienKelma

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
CyprienKelma/README.md

Hey there!

I’m Cyprien Kelma, a Cloud Data Engineer based in Lille, France.

✨​ Short Summary :

I love designing, implementing, and maintaining data processing systems that transform corporate data into actionable and profitable return on investment.

After a successful four-month Data Engineering internship at Decathlon in Brussels, I landed a one-year work-study (apprenticeship) program at Decathlon in France. Here I'm furthering my technical skills while completing my last year of software engineering degree at ISEN Lille.

Rather than rushing headlong into things, my approach is to always prioritize system designs that best meet requirements and good practice. As well as ensuring that solutions are optimized and robust in the long term.


⭐​ Tech Skills :

While I advocate for using the right tech for the right task (rather than choosing with the tempting shiny object syndrome 👀​), I mostly leverage the following technologies:

Core DE Tools :

  • The good timeless classics : SQL and Python (with dbt, PySpark and Pandas)
  • Spark for distributed computing
  • Airflow and Prefect for batch pipeline orchestration
  • Relationnal Databases (PostgreSQL, SQLite) and NoSQL (MongoDB, Cassandra, Redis)
  • Java (and Spring Boot for Backend)
  • Databricks and Delta Lake, on which I'm currently working to pass the Data Engineer Professional Certification this year
  • DevOps/DataOps Concepts and CI/CD with GitHub Action, especially to manage artifact publication and project deployment on dev/preprod/prod environments
  • Docker and Kubernetes (on premise or with Cloud services)
  • IaC and Backend State with Terraform

Cloud Computing :

I love working with Cloud solution 🙂. And even if I believe more in deep understanding of concepts rather than debate which tool to use, I have a personal preference building Cloud System with products from the Google Cloud Platform​​. Especially those : BigQuery, Cloud Storage, Cloud Runner, GKE, Cloud Run, Dataform, and DataFlow.

I'm currently actively working on the GCP Professional DE Certification this year too. That said, I'm far from disliking AWS, especially MWAA, S3, EC2, Lambda and ECR/EKS.


🛠️​ Current side project :

I'm currently working on this Cloud Data Engineering project.

  • It's a complete ELT pipeline architecture template that can be reused by anyone. The goal is to pre-build a fully working data storage and processing system that cover everything from infrastructure to orchestration and configuration, so that it can be ready to use in less than 20 minutes.
  • Perfect for startup or small company that want to start getting insight from their raw data without spending to much time and energy on infrastructure and pipeline creation.
  • Stack : GCP (Cloud Storage Bucket, BigQuery, Cloud Run), Prefect Cloud, dbt, Power BI (other choices is possible)

Apart from this one, I’ve got a bunch of interesting public pinned projects. For exemple, you can check this scalable, distributed, data system architeture.


👋​ Let's Connect!

I am always open to discussing any questions or freelance work opportunities to do in addition to my main job :)

You can text me on Linkedin : Cyprien Kelma

Pinned Loading

  1. Projet-M1 Projet-M1 Public

    Entreprise-grade, scalable and resilient architecture for data management and processing.

    Jupyter Notebook

  2. Summers-Team/school-m2-bi-project Summers-Team/school-m2-bi-project Public

    A complete data analysis pipeline, modeled with dbt, orchestrated with Prefect and automatically provisioned with Terraform on GCP

    Python

  3. JRestoManager JRestoManager Public

    Java-based restaurant management system developed as a school project.

    Java 3

  4. Summer-Market Summer-Market Public

    Web application designed for scanning and ordering products from a vacation equipment store.

    JavaScript 4