Medallion Architecture¶
LiveF1 implements a three-layer data processing architecture known as the Medallion Architecture. This design pattern organizes data into Bronze (raw), Silver (cleaned), and Gold (analytics-ready) layers, ensuring data quality and efficient processing.
data:image/s3,"s3://crabby-images/a3196/a319660654782fa55f1fe2cec9b1ca36faa426a2" alt=""
Raw data ingestion Raw logs and records Single source of truth
data:image/s3,"s3://crabby-images/dbb96/dbb968aa9da162033a334720d3e6abbc4f5773a6" alt=""
Cleaned & enriched data Standardized formats Quality assured
data:image/s3,"s3://crabby-images/146f1/146f116d15dbc1a9a40e17ca6bd05696595165df" alt=""
Analytics-ready data Optimized queries Business metrics
Data Flow Architecture¶
Layer Details¶
Bronze Layer (Raw Data)¶
The Bronze layer stores data in its original format, serving as the foundation of our data lake.
# Get raw timing data
session = livef1.get_session(2024, "Spa", "Race")
raw_data = session.get_data("TimingData") # Loads to Bronze lake
print(raw_data.head())
Key Features
Unmodified source data
Complete data history
Audit trail support
Quick ingestion
Schema-on-read
Silver Layer (Refined Data)¶
The Silver layer contains cleaned, validated, and enriched data ready for analysis.
# Generate silver layer tables
session = livef1.get_session(2024, "Spa", "Race")
session.generate(silver=True) # Process Bronze to Silver
# Access refined data
laps_data = session.get_laps() # Get from Silver lake
print(laps_data.head())
Data Quality Checks
Data type validation
Duplicate removal
Missing value handling
Format standardization
Cross-reference validation
Gold Layer (Analytics Ready)¶
The Gold layer provides optimized, aggregated data ready for business intelligence and machine learning.
Note
Gold layer is not implemented yet. There will be an option for developers to generate their own gold tables. This is why gold layer is for in LiveF1.
Implementation Details¶
Data Lake Structure¶
The data lake implementation in LiveF1 uses a class-based structure:
session.data_lake/
├── bronze/ # Raw data storage
│ ├── timing/ # Timing data
│ ├── telemetry/ # Car telemetry
│ └── weather/ # Weather data
│
├── silver/ # Cleaned data
│ ├── laps/ # Lap time analysis
│ ├── car_data/ # Processed telemetry
│ └── track_status/ # Track conditions
│
└── gold/ # Analytics data
├── performance/ # Performance metrics
├── strategy/ # Strategy insights
└── predictions/ # ML predictions
Processing Methods¶
LiveF1 provides methods for processing data through each layer:
Method |
Description |
---|---|
|
Loads raw data into Bronze layer |
|
Processes data to Silver layer |
|
Creates Gold layer analytics |
|
Retrieves lap data from Silver |
|
Accesses processed telemetry |
Best Practices¶
When working with the Medallion Architecture in LiveF1:
Data Loading - Always load raw data to Bronze first - Use parallel loading for multiple feeds - Implement error handling
Data Processing - Generate Silver tables as needed - Cache frequently used data - Monitor processing time
Data Access - Use appropriate layer for needs - Implement data validation - Follow access patterns
See also
LiveTiming Data Topics for available data feeds
api_reference for detailed API documentation
Next Steps¶
Learn about data_objects in LiveF1
Explore Examples for practical usage
Read about quick_start for getting started