rdtools.soiling.soiling_srr

rdtools.soiling.soiling_srr(energy_normalized_daily, insolation_daily, reps=1000, precipitation_daily=None, day_scale=13, clean_threshold='infer', trim=False, method='half_norm_clean', clean_criterion='shift', precip_threshold=0.01, min_interval_length=7, exceedance_prob=95.0, confidence_level=68.2, recenter=True, max_relative_slope_error=500.0, max_negative_step=0.05, outlier_factor=1.5)

Functional wrapper for SRRAnalysis. Perform the stochastic rate and recovery soiling loss calculation. Based on the methods presented in Deceglie et al. JPV 8(2) p547 2018.

Parameters
  • energy_normalized_daily (pandas.Series) -- Daily performance metric (i.e. performance index, yield, etc.) Alternatively, the soiling ratio output of a soiling sensor (e.g. the photocurrent ratio between matched dirty and clean PV reference cells). In either case, data should be insolation-weighted daily aggregates.

  • insolation_daily (pandas.Series) -- Daily plane-of-array insolation corresponding to energy_normalized_daily. Arbitrary units.

  • reps (int, default 1000) -- number of Monte Carlo realizations to calculate

  • precipitation_daily (pandas.Series, default None) -- Daily total precipitation. Units ambiguous but should be the same as precip_threshold. Note default behavior of precip_threshold. (Ignored if clean_criterion='shift'.)

  • day_scale (int, default 13) -- The number of days to use in rolling median for cleaning detection, and the maximum number of days of missing data to tolerate in a valid interval. An odd value is recommended.

  • clean_threshold (float or 'infer', default 'infer') -- The fractional positive shift in rolling median for cleaning detection. Or specify 'infer' to automatically use outliers in the shift as the threshold.

  • trim (bool, default False) -- Whether to trim (remove) the first and last soiling intervals to avoid inclusion of partial intervals

  • method (str, {'half_norm_clean', 'random_clean', 'perfect_clean'} default 'half_norm_clean') --

    How to treat the recovery of each cleaning event

    • 'random_clean' - a random recovery between 0-100%

    • 'perfect_clean' - each cleaning event returns the performance metric to 1

    • 'half_norm_clean' - The starting point of each interval is taken randomly from a half normal distribution with its mode (mu) at 1 and its sigma equal to 1/3 * (1-b) where b is the intercept of the fit to the interval.

  • clean_criterion (str, {'shift', 'precip_and_shift', 'precip_or_shift', 'precip'} default 'shift') --

    The method of partitioning the dataset into soiling intervals

    • 'precip_and_shift' - rolling median shifts must coincide with precipitation to be a valid cleaning event.

    • 'precip_or_shift' - rolling median shifts and precipitation events are each sufficient on their own to be a cleaning event.

    • 'shift', only rolling median shifts are treated as cleaning events.

    • 'precip', only precipitation events are treated as cleaning events.

  • precip_threshold (float, default 0.01) -- The daily precipitation threshold for defining precipitation cleaning events. Units must be consistent with precip.

  • min_interval_length (int, default 7) -- The minimum duration, in days, for an interval to be considered valid. Cannot be less than 2 (days).

  • exceedance_prob (float, default 95.0) -- the probability level to use for exceedance value calculation in percent

  • confidence_level (float, default 68.2) -- the size of the confidence interval to return, in percent

  • recenter (bool, default True) -- specify whether data is centered to normalized yield of 1 based on first year median

  • max_relative_slope_error (float, default 500.0) -- the maximum relative size of the slope confidence interval for an interval to be considered valid (percentage).

  • max_negative_step (float, default 0.05) -- The maximum magnitude of negative discrete steps allowed in an interval for the interval to be considered valid (units of normalized performance metric).

  • outlier_factor (float, default 1.5) -- The factor used in the Tukey fence definition of outliers for flagging positive shifts in the rolling median used for cleaning detection. A smaller value will cause more and smaller shifts to be classified as cleaning events.

Returns

  • insolation_weighted_soiling_ratio (float) -- P50 insolation weighted soiling ratio based on stochastic rate and recovery analysis

  • confidence_interval (numpy.array) -- confidence interval (size specified by confidence_level) of degradation rate estimate

  • calc_info (dict) --

    • 'renormalizing_factor' - value used to recenter data

    • 'exceedance_level' - the insolation-weighted soiling ratio that was outperformed with probability of exceedance_prob

    • 'stochastic_soiling_profiles' - List of Pandas series corresponding to the Monte Carlo realizations of soiling ratio profiles

    • 'soiling_ratio_perfect_clean' - Pandas series of the soiling ratio during valid soiling intervals assuming perfect cleaning and P50 slopes

    • 'soiling_interval_summary' - Pandas dataframe summarizing the soiling intervals identified. The columns of the dataframe are as follows:

      Column Name

      Description

      'start'

      Start timestamp of the soiling interval

      'end'

      End timestamp of the soiling interval

      'soiling_rate'

      P50 Soiling rate for interval, in day^−1 Negative value indicates soiling is occurring. E.g. a rate of −0.01 indicates 1% soiling loss per day.

      'soiling_rate_low'

      Low edge of confidence interval for soiling rate for interval, in day^−1

      'soiling_rate_high'

      High edge of confidence interval for soiling rate for interval, in day^−1

      'inferred_start_loss'

      Estimated performance metric at the start of the interval

      'inferred_end_loss'

      Estimated performance metric at the end of the interval

      'length'

      Number of days in the interval

      'valid'

      Whether the interval meets the criteria to be treated as a valid soiling interval