Yildirimhan Aydin

Wind Forecast ML Pipeline

🌬️ Wind Forecast ML Pipeline

This project presents a modular machine learning pipeline designed to forecast wind speed and wind direction for multiple wind farms using weather forecast model data.

Built as part of a broader energy optimization effort, this system helps improve the forecast accuracy of wind resources by training per-site ML models, automating predictions, and logging performance daily.


🧠 Key Goals


🏗️ System Architecture

The pipeline is built using modular notebooks (or scripts), each responsible for a distinct stage:

1. Feature Selection

2. Model Training

3. Daily Prediction


📥 Data Sources (Generalized)


🗃️ Output Tables (Generalized Schema)

🔹 model_features

Tracks selected features and performance per model.

Facility ID Model Type Selected Features MAE RMSE MAPE Timestamp

🔹 model_registry

Stores paths, parameters, and metrics for trained models.

| Facility ID | Model Path | Best Params | MAE | R² | Timestamp | ... |

🔹 daily_predictions

Daily predictions stored per facility, per hour.

| Facility ID | DateTime | Wind Speed | Wind Direction | Model Version | Prediction Time |

🔹 skipped_sites

Logs skipped predictions due to missing features.

| Facility ID | Date | Missing Inputs | Logged At |


🛠️ Tools & Tech


📈 Outcomes


🚀 Possible Enhancements


📝 Notes