Google Sumer of Code 2025: Improving Probabilistic Solar Forecasts #30
Replies: 21 comments 11 replies
-
|
Hi @peterdudfield and @felix-e-h-p ! 👋 I'm Arpit Yadav, a third-year undergraduate student at IIT Kanpur, majoring in Mechanical Engineering. My background includes computational modeling, machine learning, and sustainability research. I have contributed to cargo-semver-checks, an open-source linter in the Rust ecosystem, and have worked with data-driven forecasting techniques. I am also active as a student researcher in the field of sustainable engineering (you can check out my work here ). I'm particularly excited about this project because probabilistic solar forecasting is a crucial challenge in renewable energy, and using GMMs to capture multimodal uncertainties is an innovative approach. I'm looking forward to making meaningful contributions moving forward. |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield and @felix-e-h-p, The GMM approach to probabilistic solar forecasting is an elegant way to capture multimodal uncertainties. A few considerations that might be worth exploring:
I recently worked on a similar system using PyTorch's mixture module with custom loss functions that penalize physically implausible transitions between mixture states, improving forecast stability during dawn/dusk transitions and partly cloudy conditions. Would love to hear your thoughts on incorporating exogenous variables like cloud opacity or air mass into the mixture components. |
Beta Was this translation helpful? Give feedback.
-
|
Hello, @peterdudfield and @felix_h_p,
I have experience working with TensorFlow and Keras, particularly in training deep learning models for classification tasks. I'm also minoring in Statistics. I'd love to understand how I can best prepare to contribute to this project effectively. Are there any specific resources or areas I should focus on? Thanks! |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield,@felix-e-h-p sir I’m chaitanya from iit jammu ,India a machine learning and deep learning practitioner with experience in probabilistic modeling and optimization. I’ve worked on YOLOv8 pipelines for document classification, focusing on uncertainty estimation, as well as applying evolutionary learning techniques like NEAT. I’m particularly interested in how different modeling approaches handle uncertainty in real-world scenarios. This project looks really exciting! Moving from quantile regression to a Gaussian Mixture Model approach seems like a strong way to better capture multimodal uncertainties, especially during transitions. Have you considered potential computational trade-offs with multiple GMM components, and would adaptive mixture weights be an option to dynamically adjust component importance based on context? |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield and @felix-e-h-p , hope you're well! I love the idea and I would like to contribute. A bit of myself: My name is Víctor and I'm currently studying a Master in Statistics and Operations Research at UPC (Barcelona, Spain) where I'm specializing both in statistical inference (Frequentist, Bayesian, Time Series) and optimization (Large Scale Problems, Stochastic programming). Previously, I've got some experience working with DL in satellite orbit determination. Moreover, I'm currently an Software/Data engineer working at Seita (maintainers of FlexMeasures - a Linux Energy Foundation project). I have quite a lot of experience developing forecasting/optimization systems, and handling data in general. I'd be very interested in applying my current knowledge and expanding it to improve probabilistic forecast. In addition to GMM, we could try to use Cauchy Mixture Models to better model fat tails (which are common in renewables) and Bayesian Deep Learning using dropout layers. Nonetheless, for a base demo, we can try something in the lines of [1], using a Negative Log-Likelihood cost function. Please, let me know what do think :D Cheers, |
Beta Was this translation helpful? Give feedback.
-
|
Hello @peterdudfield , @felix-e-h-p and the Open Climate Fix team, I am Almouthana Taha Khalfallah an Applied Mathematics and modelization Engineering student specializing in data science and AI, with a strong passion for coding and mathematical modeling. I am particularly excited about the opportunity to contribute to the "Improving Probabilistic Solar Forecasts" project as it aligns perfectly with my academic background. I have a little experience in machine learning, deep learning, and statistical modeling, but I have a strong foundation in the mathematical concepts required for this project, including Gaussian Mixture Models and probability theory. I am eager to apply my skills to improve solar forecasting models and help contribute to more accurate and reliable renewable energy predictions. I am excited to collaborate with your team and learn from this experience to advance this field. Looking forward to hearing from you. Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi @felix-e-h-p, @felix-e-h-p and OpenClimateFix Team, I’m Dakshbir Singh, a Computer Science student from NSUT, India passionate about Machine Learning, Deep Learning, NLP, and Generative AI. I have a strong background in mathematics and statistics, along with expertise in Python, C++ (DSA), and ML/DL frameworks like PyTorch. Given my interest in probabilistic modeling and AI-driven forecasting, I’m excited about contributing to the "Improving Probabilistic Solar Forecasts" project for GSoC 2025. Understanding of the Project I have a solid grasp of statistical modeling and probability distributions, but I am relatively new to Gaussian Mixture Models (GMMs). I'm eager to gain hands-on experience in implementing GMMs from scratch and applying them in real-world forecasting scenarios. Initial Steps for Contribution
I’m eager to start contributing and appreciate any guidance you can provide on the best way to proceed! 🚀 Looking forward to your insights! |
Beta Was this translation helpful? Give feedback.
-
|
Hey @felix-e-h-p and OpenClimateFix Team, I'm Nadhif, Data scientist with background in physical oceanography. I have experience in training model with python (pytorch), deploying model to production, data analysis (including marine and weather data), and ocean modelling (especially forecasting ocean current with Fourier Neural Operator and Graph Based model (GraphSage)). I'm excited about this project, which aims to create continuous distributions using Gaussian Mixture Models (GMMs). This approach promises a more complete and accurate representation of uncertainty compared to fixed quantiles, especially for transitions and bimodal outcomes. The focus on uncertainty prediction is particularly appealing, as it has significant real-world and production applications. I have a few questions:
I'm particularly interested in the challenge of determining appropriate initial means and variances, especially considering the constraints involved (such as matrix positive definiteness, etc). Best regards, Nadhif |
Beta Was this translation helpful? Give feedback.
-
|
Hi @felix-e-h-p and @peterdudfield, I’m Dakshbir Singh, an enthusiastic contributor to Open Climate Fix, and I’m incredibly excited about the Improving Probabilistic Solar Forecasts using GMMs project. I’ve been actively contributing to OCF over the past few weeks, and I’m eager to make more meaningful contributions as I prepare my GSoC 2025 proposal. I find this project particularly compelling because Gaussian Mixture Models (GMMs) offer a more nuanced way to capture multimodal uncertainties in solar forecasting compared to fixed quantiles. This approach could significantly enhance the accuracy of uncertainty estimation, which is critical for real-world applications in energy dispatch and grid balancing. After reviewing the project description and previous discussions, I had a few technical questions and would love to get your insights:
I am genuinely excited to collaborate and learn from your guidance. Thank you for your time and dedication—I look forward to contributing further and helping OCF drive impactful, sustainable innovations! 😊 Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield and @felix-e-h-p and the OpenClimate Fix team , hope you're doing well! I love the Open Data PVNet project and would love to contribute. A bit about myself: My name is Akshat Soni, and I'm currently pursuing a B.Tech in Computer Science and Engineering at SRM University. My focus is on AI, Data Science, and Cloud Computing, and I have strong experience in ML model training, data analysis, and cloud-based workflows. I’ve worked extensively with PyTorch, NumPy, Pandas, and Xarray, making me well-equipped to handle large-scale Numerical Weather Prediction (NWP) datasets and train robust solar forecasting models. I’m particularly interested in optimizing forecast accuracy while ensuring the model remains fully reliant on open data sources. I’d love to explore techniques like hyperparameter tuning, model benchmarking, and possibly integrating advanced feature selection to enhance predictive performance. A couple of questions regarding the project:
Looking forward to your insights! Cheers, |
Beta Was this translation helpful? Give feedback.
-
|
Hello @peterdudfield , I'm Prajakta, an aspiring GSoC contributor with experience in machine learning, time series forecasting, and probabilistic modeling. I’ve worked with Python, data preprocessing, and model evaluation, and I’m particularly excited about this project. I had a few questions:
Looking forward to your insights. Thanks for your time! Best, |
Beta Was this translation helpful? Give feedback.
-
|
Respected sir/mam, |
Beta Was this translation helpful? Give feedback.
-
|
Hello @peterdudfield @felix-e-h-p ! Really excited about the "Improving Probabilistic Solar Forecasts" project in GSOC 2025 ! The idea of switching to a GMM approach to capture those complex uncertainties in solar data sounds super interesting. I've got some experience implementing GMMs from scratch, and I'm pretty comfortable with ML/DL/PyTorch. To get a better handle on things, I was hoping you could clarify a few points:
Thanks for your time, and I'm really looking forward to potentially contributing to this project! |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield, I am Aditya (GitHub), a final-year Computer Science student from India. I have some experience in Python, Django, Django Rest Framework, and contributing to open-source projects and working with APIs, databases, and deployment. I have one question where should I submit my proposal I am looking for a review before submitting on the GSoC program site, thanks :) |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield and @felix-e-h-p , I am a fairly recent masters graduate from The Ohio State University in Geospatial engineering with a focus in photogrammetric computer vision. I am quite fascinated by the idea of addressing climate change with the use of renewable energy! I currently work part time for a startup company which shares similarities in this initiative. Although my main focus of research DL techniques for urban scene understanding in remote sensing images, I have experience with creating GMM models for vision tasks like object detection and photo editing processes like lazy snapping. I was also curious if the team had explored some type of data fusion techniques like Bayesian uncertainty fusion to improve current model accuracies? I believe this would be a great learning experience for myself in such an essential field of study like climate change. |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield, @felix-e-h-p, and OpenClimateFix Team, I hope this message finds you well. My name is Bassel, currently a 3rd-year Computer Systems Engineering student (BUE/LSBU dual degree). I'm deeply passionate about leveraging technology, especially open-source tools, to address climate change challenges, which strongly aligns with Open Climate Fix's mission. I'm writing to express my strong interest in the GSoC project "Improving Probabilistic Solar Forecasts" (#30). Having been involved in climate policy discussions (e.g., COP28, COP29) and sustainability initiatives (UNGC, Oxford Net Zero, IRENA), I understand the critical need for accurate renewable energy forecasting. The project's focus on improving uncertainty quantification using Gaussian Mixture Models, moving beyond simple quantiles, strikes me as a technically fascinating and impactful approach. My engineering background provides a solid foundation in programming (Python) and analytical thinking, complemented by foundational ML knowledge. While implementing GMMs from scratch will be a key learning area, I'm very eager to dive into this challenge within the PyTorch framework. I'm very enthusiastic about the possibility of contributing to this project and learning from the team. Thanks for considering my interest! Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hello @peterdudfield and @felix-e-h-p, I’m Gianluca Ferro, a Master’s graduate in Electronic Engineering and currently a research fellow in Artificial Intelligence at ciparlabs , applied to smart grids and renewable energy communities. My Master’s thesis focused on time series forecasting for both energy consumption and renewable generation, where I explored gradient boosting methods, statistical models, and neural networks. I find your project on transitioning from quantile regression to Gaussian Mixture Models especially compelling, and I’m eager to experiment with these data-driven approaches and refine them for more accurate, multimodal forecasting. I do have a couple of questions:
I look forward to your thoughts on these points and to potentially contributing to this initiative. Thank you, |
Beta Was this translation helpful? Give feedback.
-
|
Hi everyone, My name is Claudio Gonzalez. I’m currently a third-year Data Science student at Tec de Monterrey in Guadalajara, Mexico, with a strong background in machine learning and deep learning, particularly using PyTorch. I’ve worked on various projects involving statistical modeling and have hands-on experience implementing advanced algorithms such as Gaussian Mixture Models. I'm excited about the potential of applying these techniques to improve probabilistic solar forecasting and contribute to this innovative project. I have a couple of questions:
Thank you in advance for your insights. I’m really looking forward to learning from your experiences and contributing to this project! Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hello @felix-e-h-p , I'm Prajakta, an aspiring GSoC contributor with experience in machine learning, time series forecasting, and probabilistic modeling. I’ve worked with Python, data preprocessing, and model evaluation, and I’m particularly excited about this project. I had a few questions: What are the biggest challenges in improving the accuracy of probabilistic solar forecasts? Are there any specific probabilistic models or frameworks currently preferred for this project? How do we plan to validate and benchmark improvements—what metrics matter most? Looking forward to your insights. Thanks for your time! Best, |
Beta Was this translation helpful? Give feedback.
-
Google Summer of Code 2025 applications are now closed.We are currently reviewing all applications. Contributors will be announced 8 May 2025. Thank you! |
Beta Was this translation helpful? Give feedback.
-
|
I'm closing this discussing now as GSOC 2025 is nearly over. Thank you for everyones input and help. We hope to take part next year and we'll be posting info here |
Beta Was this translation helpful? Give feedback.

Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
This space is for you to ask any questions you have about this project. We're here to provide clarifications and help you understand the project's goals, scope, and requirements. Feel free to ask about anything that interests you!
Please note that this discussion is for questions and clarifications, not for formal applications.
Project Description
Quantile regression approach in probabilistic solar forecasting directly estimates specific quantiles of the predictive distribution. Instead of direct quantity outputs, the proposition is modification of final layers to predict parameters of Gaussian Mixture Model(s) - for each component the mean, STD and coefficient. This should fundamentally capture more complex multimodal uncertainties inherent within probabilistic solar forecasting.
Expected Outcome
Complete continuous probability distribution as opposed to fixed quantiles. Better representation of uncertainties (such as transitions) and better capture of bimodal scenarios. Overall hopefully more accurate modelling and pattern comprehension - stronger uncertainty bounds perhaps.
Other Key Information
Beta Was this translation helpful? Give feedback.
All reactions