ML Challenge
This year’s challenge focuses on training models to predict gain profiles for Commercial Erbium-Doped Fiber Amplifiers (EDFAs). EDFAs are critical components in optical communication systems, and accurate gain profile prediction is essential for optimizing network performance and reliability, especially under dynamic channel loading conditions.
Motivation
EDFA gain profile prediction is crucial for:
- Network planning and optimization
- Dynamic power management
- System reliability and performance monitoring
- Cost-effective network operations
Especially under dynamic channel loading conditions. Recently, researchers acheives high accuracy over gain profile predictions, but there are still some challenge unsolved. In our testing, we will evaluate the following three aspects of your EDFA DWDM channel gain model:
- EDFA channel gain aging effect
- Spectrum hole burning effect
- Unmeasured gain and tilt settings of EDFA
Here are the examples for typical gain spectrum under these three test goals, detailed description can be found in the kaggle competition description
- EDFA channel gain aging effect
- EDFA gain profile changes over 3 years on the same device
- We will only provide a small amount of the EDFA gain profile with aging effect for the training set
- Spectrum hole burning effect
- Gain profile especially under goalpost channel loading
- We will not provide any goalpost channel loading data for the training set
- Unmeasured gain and tilt settings of EDFA
- We will NOT provide the gain profile data for 20 dB and test your model generizability for unseen gain
- We will provide small amount of the EDFA gain profile with tilt and a larger dataset with all tilt equaling to 0 dB, and test your model performance on the tilted gain
Challenge Details
- Objective: Develop Theoretical, analytical or ML models that can accurately predict the gain profile of commercial EDFAs under various operating conditions
- Dataset: We will provide a large COSMOS EDFA gain profile dataset which is collected three years ago, and a small amount of the EDFA measurements we collect recently aiming the three technique issue we mentioned above.
- Evaluation: Models will be scored based on prediction accuracy under an hidden testset with three aspects mentioned above.
Dataset repo and submission platform
Dataset and example code
- We will provide all the codes for pre-processing the measurements data and a simple ML-based model just for reference
- Please refer to the dataset website for more details
- Dataset repo including:
Submission platform
- We provide detailed tutorial for the dataset explaination
- This is the official submission platform where you can see your score on the test set for real time
- There is a discussion panel on the competition page where you can post your questions
- Submission platform including:
Results submission
- Please submit your predicted
csvfile to this year Kaggle submission platform - We will also evaluate your model based on factors such as model size, inference time, and the novelty of the model architecture. Please create a GitHub repository (example) and provide clear step-by-step instructions on how to run your model.
Organizer
Acknowledgement