Skip to content
This repository was archived by the owner on Dec 24, 2025. It is now read-only.

chovman/redback-data-warehouse

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

redback-data-warehouse

Data Warehouse storage of code and configurations

Garmin Run Data – ETL Pipeline Update

This ETL pipeline processes Garmin_run_data.csv and includes:

Data cleaning:

  • Removes duplicate rows
  • Standardizes column names (lowercase, underscores)
  • Converts timestamps to datetime
  • Fills missing numeric values with column means
  • Removes outliers in heart_rate (keeps values between 30–220 bpm)
  • Converts distance from meters to kilometers
  • Converts speed from m/s to km/h

Data aggregation:

  • Groups data by year and week
  • Calculates total runs, total distance (km), average speed (km/h), and average pace (min/km) per week

Outputs:

  • cleaned_garmin_run_data.csv → cleaned dataset

About

Data Warehouse storage of code and configurations

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 50.5%
  • Jupyter Notebook 43.2%
  • HTML 3.0%
  • CSS 1.3%
  • Dockerfile 1.2%
  • PHP 0.6%
  • Shell 0.2%