A Multi-Resolution Approach Based on the Integration of a Nonlinear Physical Model and Long Short-Term Memory Network for Photovoltaic Power Modeling and Forecasting

Original scientific paper

Journal of Sustainable Development of Energy, Water and Environment Systems
Volume 14, Issue 2, June 2026, 1140677
DOI: https://doi.org/10.13044/j.sdewes.d14.0677
Kpatchaa Tombana Baba , Eyouléky Palanga
University of Lomé, Lomé, Togo

Abstract

In the current context of energy transition, accurately forecasting solar power production is a major challenge for the efficient operation of electricity grids that include renewable energy sources. This work aims to improve prediction performance by combining a nonlinear physical model based on the principles of photovoltaic conversion with a recurrent neural network designed for time series analysis. The underlying hypothesis is that integrating physical knowledge with data-driven learning can better capture the complexity of solar energy patterns. The proposed method involves careful selection of input variables through correlation analysis and embedding the physical model within a deep learning structure. Results, evaluated using standard error metrics, demonstrate a clear improvement in forecasting accuracy compared to conventional approaches. While the physical model considered on its own produces a high error level (root mean square error = 338.55 and mean absolute error = 182.08), methods based on artificial intelligence significantly reduce these values (long short-term memory network: root mean square error = 3.29; recurrent neural network: 2.87). The hybrid method developed in this study achieves the best overall performance (root mean square error = 2.83 and mean absolute error = 1.26). This study contributes to the development of more reliable prediction systems capable of anticipating fluctuations in solar power generation due to changing environmental conditions.

Keywords: Photovoltaic power prediction; Solar power prediction; Hybrid modeling approach; LSTM neural networks; Nonlinear physical models; Time series forecasting

Creative Commons License
Views (in 2026): 534 | Downloads (in 2026): 202
Total views: 534 | Total downloads: 202
Introduction

In a world undergoing an energy transition, the integration of renewable energies has become a priority. It is shown in [1] that photovoltaic (PV) solar energy is one of the most promising renewable sources, while [2] highlights recent technological improvements in PV systems. Furthermore, [3] stresses that renewable energy adoption is essential for meeting today’s environmental challenges, and [4] demonstrates that large-scale PV deployment offers significant economic advantages.

The rapid global growth of solar installations has increased the need for accurate forecasting. In [5] Z. Garip et al. examined the expanding use of forecasting in solar energy systems, and [6] shows that forecast accuracy directly impacts energy management. In addition, [7] proves that improved forecasting methods enhance grid operation. Recent global assessments, such as [8], confirm unprecedented increases in annual PV capacity additions. Similarly, [9] emphasizes that accurate power forecasting is crucial for maintaining grid stability as PV penetration rises.

However, predicting PV power remains challenging due to environmental variability. It is identified in [10] that irradiance fluctuations, cloud cover, and temperature as major uncertainties affecting PV output. Moreover, [3] explains that the physical processes governing PV production add further complexity.

To structure this research field, several reviews categorize the main forecasting approaches. The paper [4] provides a classification into physical models, statistical methods, and AI-based or hybrid models.

Physical models rely on meteorological variables and PV system characteristics. Y. Zhi et al. explain in [8] how atmospheric data drive physical simulations, while [11] evaluates the robustness and theoretical reliability of such models. Still, [5] notes that physical models require intensive calibration and may be computationally demanding, which limits their deployment.

Statistical models offer a simpler alternative. K. Kamberi et al. show in [3] that ARMA and SVR-based statistical methods can yield good results under stable weather but degrade when conditions shift rapidly.

Machine learning and deep learning approaches have shown stronger predictive capabilities. Tripathi et al. [7] presents early advances in ML-based solar forecasting, while [12] discusses contributions from deep learning architectures. In addition, [13] demonstrates that neural networks significantly improve prediction accuracy under diverse conditions, confirming observations in [9]. Long short-term memory (LSTM) networks, described in [14], are especially effective for time series because they capture long-term dependencies, a finding further supported by [15].

Nonetheless, deep learning models face certain limitations. H. Hafdaoui et al. highlight in [13] their limited interpretability, and Z. Garip in [5] reports reduced generalization when exposed to unfamiliar atmospheric conditions. To address these issues, hybrid approaches have emerged. So, [16] and [17] show that integrating physical knowledge into ML architectures increases robustness and accuracy. Moreover, [18] confirms that combining physically derived information with data-driven strategies improves short-term forecasting.

Hybrid approaches typically incorporate physically meaningful variables. U. Singh et al. propose in [19] integrating irradiance and temperature estimates into models, while [20] uses clear-sky indices and radiation transposition outputs as additional features. Furthermore, [21] shows that such physical features improve generalization under varying atmospheric conditions.

Several illustrative hybrid architectures have been proposed. So, [22] introduces a framework combining satellite imagery, NWP forecasts, and machine learning for intra-day irradiance prediction. Also, [21] presents a hybrid model merging CEEMDAN signal decomposition with CNN-LSTM networks to handle non-stationary conditions more effectively. Additionally, [4] discusses physics-informed neural networks that enforce physical constraints within learning processes.

More recently, [23] summarizes hybrid irradiance forecasting approaches combining satellite inputs, NWP models, and AI tools. A complementary study in [24] proposes a SARIMA-LSTM hybrid enhanced through STL decomposition using LOESS, demonstrating improved short-term solar radiation forecasting. In PV power prediction, [18], shows that decomposition-based hybrid models outperform standalone neural networks, especially during transitional weather periods.

Despite these advances, many hybrid architectures remain loosely coupled. In [25], it is noted that physical and AI models often operate in parallel rather than being deeply integrated. Consequently, [12] calls for unified frameworks embedding physical principles directly into neural network learning mechanisms.

To address this gap, the present work proposes a tightly integrated hybrid method combining a nonlinear physical model with an LSTM neural network. This idea aligns with insights from [26], which demonstrates that fusing physical forecasts with data-driven learning enhances both interpretability and accuracy. Building on this foundation, our approach aims to improve short-term PV power prediction under variable environmental conditions, consistent with findings reported in [27].

The rest of the paper is organized as follows: Section II presents the methodology, Section III describes implementation and results, and Section IV concludes the study.

Materials And Methods

The data used come from sensors that record daily measurements [2] at regular one-minute intervals, Table 1.

Dataset overview

A

B

C

D

E

F

G

H

I

J

K

L

M

N

O

P

01/01/2021 00:00

1.6727

−1.4862

118.65

− 0.67286 9999999 9999

13.652

0.9666

−0.04227

0.00885 9999999 9999

0.00354

−1.7702

−3.5522

−3.5336

−3.2592

-2.6814

10

01/01/2021 00:01

1.6713

−1.4529

118.65

−0.73143

13.653

0.94264

− 0.04354 9999999 9999

− 0.44307 9999999 9999

0.0178

−1.7842

−3.6531

−3.499

−3.2125

−2.7644

10

01/01/2021 00:02

1.6719

−1.464

118.72

−0.79227

13.655

0.92725

−0.0441

−0.08418

0.00430 9999999 9999

−1.7876

−3.6049

−3.4852

−3.1522

−2.7367

10

01/01/2021 00:03

1.6719

−1.4307

118.84

−0.8224

13.654

0.90502

− 0.05086 9999999 9999

−0.27914

0.025

−1.7785

−3.489

−3.4639

−3.1803

−2.8473

10

For clarity and consistency, Table 2 shows how each alphabetical column label (A-P) was converted to its corresponding variable name.

Table 2. Column label mapping

A

measured on

B

ac_current__427

C

ac_power__423

D

ac_voltage__426

E

ambient_temp__428

F

das_battery_voltage__434

G

das_temp__433

H

dc_pos_current__425

I

dc_pos_voltage__424

J

dc_power__422

K

inverter_temp__432

L

module_temp_1__429

M

module_temp_2__430

N

module_temp_3__431

O

poa_irradiance__421

P

system id

The dataset contains 15 data columns collected over a period of more than three years, including information on ambient temperature, panel temperature, irradiation, current intensity and voltage, as well as data related to the batteries and the inverter.

An exploratory data analysis (EDA) was conducted to assess data quality [3], identify missing or anomalous values, detect trends and seasonal patterns, and better understand the relationships between variables relevant to photovoltaic power generation [3], Figure 1.

HeatMap

The proposed approach integrates two complementary components for photovoltaic power prediction [12].

The first component is a nonlinear physical model, eq. (1) derived from the fundamental equations governing photovoltaic effects, which provides an initial theoretical estimation of the output power [13].

The second component is a Long Short-Term Memory (LSTM) neural network designed to refine this estimation by learning the residual discrepancies between the measured and physically estimated power values [6].

The physical model is implemented as a function f(G,V,T) encapsulated within a Lambda layer. This layer allows arbitrary computations – here, the physical power estimation – to be seamlessly included in the computational graph of the neural network. This function computes the theoretical photovoltaic power output as a function of irradiance (G), module voltage (V), and cell temperature (T), representing the deterministic part of the system behaviour:

P=G×V×[1β×(T T Ref )]

where are: G – the irradiance (incident solar power), T- the module temperature, V- the electrical voltage, ß– the temperature coefficient of the panel efficiency, with this coefficient varying between 0.004 and 0.006 (ß = 0.005 ), and TRef - the reference temperature of the panel ( TRef = 25 °C ).

Subsequently, the LSTM network models the remaining nonlinearities and temporal dependencies not captured by the physical formulation, thereby adjusting and improving the overall prediction accuracy.

It is important to highlight that LSTM networks are a category of recurrent neural networks specifically designed to learn and retain long-term dependencies in historical data [4]. These models, first introduced by Hochreiter and Schmid Huber, have since been refined and are widely used by many researchers [5]. They are organized into LSTM units, and each unit integrates a memory cell and three gates (input gate, forget gate, and output gate) that regulate the flow of information through the network [6]:

f t =σ( W f [ h t1, x t ]+ b f ) i t =σ( W i [ h t1, x t ]+ b i ) C ˜ t =tanh( W c [ h t1, x t ]+ b c ) C t = f t × C t1 + i t × C ˜ t o i =σ( W o [ h t1, x t ]+ b o ) h t = o t ×tanh( C t )

where are: ft - the forget gate, it - the input gate, C ˜ t - the new cell information, Ct- the memory cell, ot- the output gate, and ht - the hidden state.

The output of the physical model and the output of the neural network are added together in the ADD layer to obtain a final prediction [4]. The idea is that the output of the physical model gives an approximation of power based on physical laws, while the LSTM recurrent network refines this prediction by modeling more subtle relationships in the data [7],Figure 2.

Model architecture

To improve the algorithm's performance, variable selection was performed based on a visual correlation analysis using a heatmap [8]. This graphical representation allowed us to observe the linear relationships between all the variables in the database and the target variable, namely the alternating current power ac_power__423.

By identifying the variables with the highest absolute correlation coefficients with the target, nine (9) variables were selected as the most relevant for model training:

  • dc_power__422: direct current (DC) power, measured at the output of the photovoltaic modules.

  • dc_pos_current__425: positive DC current from the panels.

  • ac_current__427: alternating current injected into the grid, directly related to the target variable.

  • poa_irradiance__421: plane-of-array (POA) irradiance, indicating the amount of solar radiation received.

  • module_temp_3__431, module_temp_2__430, module_temp_1__429: temperatures measured at different points on the photovoltaic modules.

  • dc_pos_voltage__424: positive DC voltage.

  • inverter_temp__432: internal temperature of the inverter.

The Base 1 dataset was then split, with 80% allocated for training and 20% reserved for testing [22].

Results

To evaluate prediction performance, two standard metrics were employed: the Root Mean Squared Error (RMSE) and the Mean Absolute Error (MAE) [9].

Root Mean Squared Error

RMSE is a widely used metric for assessing the performance of regression models [7]. It is the square root of the average of the squared differences between the predicted values ŷi and the actual values yi [8] :

MSE= 1 n i=1 n ( y i y ^ i ) 2 RMSE= 1 n i=1 n ( y i y ^ i ) 2

where are: yi- the actual value, ŷi - the predicted value, and n- the total number of data points.

This metric is sensitive to large errors and retains the same units as the original data, which facilitates interpretation [10].

Mean Absolute Error

The MAE measures the average absolute difference between actual values yi and predicted values ŷi, [11] regardless of the direction of the error [28]:

MAE= 1 n i=1 n | y i y ^ i |

where are: yi- the actual value, ŷi- the predicted value, and n - the total number of data points.

The Table 3 presents the results obtained for four models used to predict AC power output, evaluated using RMSE and MAE.

Models comparison

Model

RMSE

MAE

Model 1: Physical Model

338.55

182.08

Model 2: AI Model (LSTM)

3.29

1.51

Model 3: AI Model (RNN)

2.87

1.33

Model 4: Proposed Hybrid Approach

2.83

1.26

Discussion

The results presented in Table 3 reveal significant differences in performance among the various modeling approaches. Model 1, based on a classical physical modeling technique, shows very high error values (RMSE = 338.55 and MAE = 182.08), highlighting its inadequacy for accurate predictions in variable environments.

In contrast, Models 2 and 3, built on artificial intelligence architectures (LSTM and RNN), demonstrate significant improvements, with errors reduced by over 99% compared to the physical model. The RNN model (RMSE = 2.87; MAE = 1.33) slightly outperforms the LSTM model (RMSE = 3.29; MAE = 1.51), possibly due to a better adaptation to the sequential structures present in time series data.

The proposed Hybrid Approach (Model 4), which integrates a physical model with an LSTM neural network, yields the best performance with an RMSE of 2.83 and an MAE of 1.26. This slight improvement over purely AI-based models highlights the relevance of combining theoretical modeling with neural network learning capabilities [12].

In summary, these results underscore the superiority of artificial intelligence-based methods – particularly the effectiveness of integrating a physical model within a deep learning pipeline.[13] The proposed approach stands out for its accuracy, robustness, and adaptability in photovoltaic power forecasting contexts.

Conclusion

This article presents a novel hybrid approach for predicting photovoltaic power, integrating a nonlinear physical model with an LSTM neural network. This multi-resolution strategy effectively combines the theoretical rigor and interpretability of physical models with the adaptability and learning capacity of neural networks, enabling the capture of complex, nonlinear, and temporal dynamics inherent in photovoltaic systems.

Experimental results, assessed using RMSE and MAE metrics, clearly demonstrate that the proposed hybrid model outperforms both conventional physical models and standalone AI models. The substantial reduction in prediction errors highlights the practical effectiveness of this integrative approach, particularly under conditions of high environmental variability and fluctuating solar irradiance.

The study underscores the practical potential of hybrid modelling for smarter and more resilient energy management systems, providing a framework that can enhance decisionmaking in operational photovoltaic installations. Furthermore, the methodology offers a generalizable approach that could be adapted to other forms of renewable energy generation, broadening its applicability.

For future work, it would be particularly valuable to explore approaches that enable hybrid models to predict photovoltaic power on new installations without relying on historical operational data for training the LSTM component. Developing such a capability would significantly increase the practical utility of hybrid models, allowing rapid deployment and accurate prediction in emerging photovoltaic sites or for systems with limited historical records.

REFERENCES
  1. , , https://pubs.aip.Org/aip/acp/article-abstract/2900/1/020020/3295421/The-critical-review-of-solar-panels
  2. Rawat V., Athab A., Joshi S., Jayasree S., Dhabaliya D., Devika J., ‘Optimising Solar Energy: An Evaluation of IoT-Based Solar Panel Monitoring Systems’, E3S Web Conf, Vol. 540 , :080052024, https://doi.org/10.1051/e3sconf/202454008005
  3. Batista da Silva H., Thakur J., Uturbey W., Martin V., ‘Analysis of residential rooftop photovoltaic diffusion in India through a Bass model approach’, J. Sustain. Dev. Energy Water Environ. Syst, Vol. 10 (4), :1-102022, https://doi.org/10.13044/j.sdewes.d10.0423.
  4. Agga A., Abbou A., Labbadi M., Houm Y., Ou Ali I., ‘CNN-LSTM: An efficient hybrid deep learning architecture for predicting short-term photovoltaic power production’, Electr. Power Syst. Res, Vol. 208 , :1079082022, https://doi.org/10.1016/j.epsr.2022.107908.
  5. Garip Z., Ekinci E., Alan A., ‘Day-ahead solar photovoltaic energy forecasting based on weather data using LSTM networks: a comparative study for photovoltaic (PV) panels in Turkey’, Electr. Eng, Vol. 105 (5), :3329-33452023, https://doi.org/10.1007/s00202-023-01883-7.
  6. Wu G., Wang Y., Zhou Q., Zhang Z., ‘Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate Interactions’, In 2024 IEEE 8th Conference on Energy Internet and Energy System Integration (EI2), 856-8612024, https://doi.org/10.1109/ei264398.2024.10990816.
  7. Tripathi A., ‘Advancing solar PV panel power prediction: A comparative machine learning approach in fluctuating environmental conditions’, Case Stud. Therm. Eng, Vol. 59 , :1044592024, https://doi.org/10.1016/j.csite.2024.104459.
  8. Vicente-Gabriel J., Gil-González A., Luis-Reboredo A., Chamoso P., Corchado J., ‘LSTM Networks for Overcoming the Challenges Associated with Photovoltaic Module Maintenance in Smart Cities’, Electronics, Vol. 10 (1), 2021, https://doi.org/10.3390/electronics10010078., Art. no. 1
  9. Abuella ., Chowdhury B., ‘Solar power forecasting using artificial neural networks’, In 2015 North American Power Symposium (NAPS), 1-52015, https://doi.org/10.1109/naps.2015.7335176.
  10. Dadjiogou K., Ajavon A., Bokovi Y., ‘Energy Flow Management in a Smart Microgrid Based on Photovoltaic Energy Supplying Multiple Loads’, J. Sustain. Dev. Energy Water Environ. Syst, Vol. 12 (1), :1-272024, https://doi.org/10.13044/j.sdewes.d11.0473.
  11. Putra A., Paimin S., Alfaani S., Nita I., Arifin S., Munir M., ‘A Machine Learning Approach to Estimating Land Use Change and Scenario Influence in Soil Infiltration at The Sub-Watershed Level’, J. Sustain. Dev. Energy Water Environ. Syst, Vol. 12 (1), :1-182024, https://doi.org/10.13044/j.sdewes.d11.0477.
  12. Carrera B., Sim M., Jung J., ‘PVHybNet: a hybrid framework for predicting photovoltaic power generation using both weather forecast and observation data’, IET Renew. Power Gener, Vol. 14 (12), :2192-22012020, https://doi.org/10.1049/iet-rpg.2018.6174.
  13. Hafdaoui H., Boudjelthia E., Bouchakour S., Belhaouas N., ‘Use an artificial intelligence method (Machine Learning) for analysis of the performance of photovoltaic systems’, J. Renew. Energ, Vol. 25 (2), 2022, https://doi.org/10.54966/jreen.v25i2.1082., Art. no. 2
  14. Danoun A., Prulière E., Chemisky Y., ‘Thermodynamically consistent Recurrent Neural Networks to predict non linear behaviors of dissipative materials subjected to non-proportional loading paths’, Mech. Mater, Vol. 173 , :1044362022, https://doi.org/10.1016/j.mechmat.2022.104436.
  15. Hochreiter S., Schmidhuber J., ‘Long Short-Term Memory’, Neural Comput, Vol. 9 (8), :1735-17801997, https://doi.org/10.1162/neco.1997.9.8.1735.
  16. Wu Y., Liu J., Li S., Jin M., ‘Physical model and long short-term memory-based combined prediction of photovoltaic power generation’, J. Power Electron, Vol. 24 (7), :1118-11282024, https://doi.org/10.1007/s43236-024-00782-9.
  17. Raya-Armenta J., Ortega P., Bazmohammadi N., Spataru S., Vasquez J., Guerrero J., ‘An Accurate Physical Model for PV Modules With Improved Approximations of Series-Shunt Resistances’, IEEE J. Photovolt, Vol. 11 (3), :699-7072021, https://doi.org/10.1109/JPHOTOV.2021.3056668.
  18. El Alaoui A., Razak T., Jarimi H., Ismail M., A Hybrid Prophet–Lstm Forecasting Framework for Strategic Utility Planning in Energy Finance, Social Science Research Network, Vol. 5352181 , 2025, https://doi.org/10.2139/ssrn.5352181
  19. Singh U., Singh S., Gupta S., Alotaibi M., Malik H., ‘Forecasting rooftop photovoltaic solar power using machine learning techniques’, Energy Rep, Vol. 13 , :3616-36302025, https://doi.org/10.1016/j.egyr.2025.03.005.
  20. Zhi Y., Sun T., Yang X., ‘A physical model with meteorological forecasting for hourly rooftop photovoltaic power prediction’, J. Build. Eng, Vol. 75 , :1069972023, https://doi.org/10.1016/j.jobe.2023.106997.
  21. Camacho ., Maldonado-Correa J., Torres-Cabrera J., Martin-Martinez S., Gómez-Làzaro E., ‘Short-Medium-Term Solar Irradiance Forecasting with a CEEMDAN-CNN-ATT-LSTM Hybrid Model Using Meteorological Data’, Appl. Sci, Vol. 15 (3), :12752025, https://doi.org/10.3390/app15031275.
  22. , ‘Review on Spatio-Temporal Solar Forecasting Methods Driven by In Situ Measurements or Their Combination with Satellite and Numerical Weather Prediction (NWP) Estimates’, , https://doi.org/10.3390/en15124341,
  23. Sebastianelli A., Serva F., Ceschini A., Paletta Q., Panella M., Le Saux B., ‘Machine learning forecast of surface solar irradiance from meteo satellite data’, Remote Sens.Environ, Vol. 315 , :1144312024, https://doi.org/10.1016/j.rse.2024.114431.
  24. Yeşil F., Serttaş T., ‘A novel approach to solar radiation forecasting using a SARIMA-LSTM hybrid model with seasonal trend loess decomposition’, Neural Comput. Appl, 2025, https://doi.org/10.1007/s00521-025-11442-9.
  25. Dolara A., Leva S., Manzolini G., ‘Comparison of different physical models for PV power output prediction’, Sol. Energy, Vol. 119 , :83-992015, https://doi.org/10.1016/j.solener.2015.06.017.
  26. Chen Y., Lin C., Liu J., Yu D., ‘One-hour-ahead solar irradiance forecast based on real-time K-means++ clustering on the input side and CNN-LSTM’, J. Atmospheric Sol.-Terr. Phys, Vol. 266 , :1064052025, https://doi.org/10.1016/j.jastp.2024.106405.
  27. Salman D., Direkoglu C., Kusaf M., Fahrioglu M., ‘Hybrid deep learning models for time series forecasting of solar power’, Neural Comput. Appl, Vol. 36 (16), :9095-91122024, https://doi.org/10.1007/s00521-024-09558-5.
  28. Voyant C., ‘Prédiction de séries temporelles de rayonnement solaire global et de production d’énergie photovoltaïque à partir de réseaux de neurones artificiels’, phdthesis, Université Pascal Paoli, 2011https://theses.hal.science/tel-00635298

DBG