Volume 2016, Issue 1 9525204
Research Article
Open Access

Evaluating Correlations and Development of Meteorology Based Yield Forecasting Model for Strawberry

Tapan B. Pathak

Corresponding Author

Tapan B. Pathak

Division of Agriculture and Natural Resources, University of California, Davis, CA, USA ucdavis.edu

Search for more papers by this author
Surendra K. Dara

Surendra K. Dara

Division of Agriculture and Natural Resources, University of California, Davis, CA, USA ucdavis.edu

Search for more papers by this author
Andre Biscaro

Andre Biscaro

Division of Agriculture and Natural Resources, University of California, Davis, CA, USA ucdavis.edu

Search for more papers by this author
First published: 20 October 2016
Citations: 15
Academic Editor: Hiroyuki Hashiguchi

Abstract

California state is among the leading producers of strawberries in the world. The value of the California strawberry crop is approximately $2.6 billion, which makes it one of the most valuable fruit crops for the state and nation’s economy. California’s weather provides ideal conditions for strawberry production and changes in weather pattern could have a significant impact on strawberry fruit production. Evaluating relationships between meteorological parameters and strawberry yield can provide valuable information and early indications of yield forecasts that growers can utilize to their advantage. Objectives of this paper were to evaluate correlations of meteorological parameters on strawberry yield for Santa Maria region and to develop meteorology based empirical yield forecasting models for strawberries. Results showed significant correlation between meteorological parameters and strawberry yield and provided a basis for yield forecasting with lead time. Results from empirical models showed that cross-validated yields were closely associated with observed yield with lead time of 2 to 5 months. Overall, this study showed great potential in developing meteorology based yield forecast using principal components. This study only looked at meteorology based yield forecasts. Skills of these models can be further improved by adding physiological parameters of strawberry to existing models for strawberry.

1. Introduction

California produces 88% of nation’s fresh and frozen strawberries. The value of the California strawberry crop is approximately $2.6 billion, which makes it one of the most valuable fruit crops for the state and nation’s economy. Favorable climate conditions and technological advancements among other factors support strawberries to be approximately four times higher than other production areas within and outside United States. According to [1], since 1990 strawberry acreage has approximately doubled and is projected to increase due to high value and favorable conditions.

Since strawberry is a high value crop with fruit production spread over several months, proper agronomic practices are important to ensure optimal yields. Additionally, environmental factors can play a very important role during the growth and development of strawberries. Water management is also an important aspect of strawberry production not only for plant growth and yields but also for leaching out of salts from the root zone. Avoiding water stress is also critical for reducing the damage from twospotted spider mite (Tetranychus urticae), a major pest of strawberry.

Apart from water management, one of the major challenges in strawberry production is impacts and control of pests and diseases. The western tarnished plant bug (Lygus hesperus) and twospotted spider mite are two major pests of strawberry, which cause significant yield losses [2]. Spider mites thrive under warmer and dryer conditions. Such conditions also promote the migration of the western tarnished plant bug to strawberries and other cultivated hosts from wild hosts in the surrounding areas. Additionally, many pests have shorter life cycles under warmer conditions and their populations build up rapidly. Diseases such as charcoal rot (Macrophomina phaseolina), Fusarium wilt (Fusarium oxysporum f. sp. fragariae), Phytophthora crown rot (Phytophthora spp.), and Verticillium wilt (Verticillium dahliae) are a challenge in strawberry production especially in the absence of the fumigant, methyl bromide.

Despite these challenges, California’s Mediterranean climate offers ideal weather conditions for both nursery plant and strawberry fruit production. Transplants are produced in high elevation nurseries in northern California where cold temperatures allow nursery plants to go through cold hardiness and accumulate carbohydrates in the crowns for optimal growth in the fruit production fields. Fruits are produced in three main regions on the Central Coast where cool nighttime conditions are conducive for flower production and mild daytime temperatures are ideal for plant and fruit development. Additionally, as majority of the rainfall is during the winter months before the peak fruit production season, they do not typically interfere with fruit production. Variations in weather conditions in three strawberry production areas in California complement fruit production from each other and help avoid market glut. The warmer Oxnard area, the milder Santa Maria area, and the colder Watsonville area with minimal overlapping of their peak fruit production seasons allow yearlong strawberry production.

Weather influence on strawberry has been documented in various studies. For instance, [3] examined strawberry yield efficiency and its correlation with temperature and solar radiation and found strawberry yield was significantly correlated with solar radiation. There are various other studies that showed the importance of solar radiation in strawberry growth and development overall [46]. Studies by [7, 8] studied impacts on strawberry under high humidity. Study [9] evaluated relationships of various crops in California including strawberries with weather parameters.

Changes in weather pattern could have a significant impact on strawberry fruit production, timings, and ultimately the market value. Analyzing influence of weather information on strawberry yield and utilizing it to provide yield forecast early in the season may provide an opportunity to tailor agricultural practices for higher yields and profits. There are potential benefits of using climate information on decision-making processes in agriculture as a way to adapt to climate variability [1013]. Crop growth is weather dependent and thus it is a common practice to predict crop yield based on weather variables [1417]. Since strawberry production spreads across 4-5 months, evaluating relationships between meteorological parameters and strawberry yield can provide valuable information and early indications of yield estimations that growers can utilize to their advantage.

Objectives of this paper are to evaluate correlations of meteorological parameters on strawberry yield for Santa Maria region and to develop meteorology based empirical yield forecasting models for strawberries.

2. Materials and Methods

2.1. Strawberry Yield Data

This paper is focused on strawberry yield data for Santa Maria region of California. Two-thirds of the total strawberry production acreage is located in the Central Coast and Santa Maria Valley. These regions encompass the coastal regions of Santa Cruz, Santa Clara, Monterey, San Luis Obispo, and northern Santa Barbara counties [18]. According to [19] two primary fall planted cultivars grown in Santa Maria strawberry production district are “San Andreas” and “Monterey,” accounting for 39.9% and 32.9% of the district, respectively, for 2016. Additionally, “San Andreas” cultivar accounted for 22.4%–39.9% of the district and “Monterey” cultivar accounted for 2.3%–32.9% of the district during last five years (2012–2016). Strawberry is an annual crop with plants first grown in the nurseries and then transplanted into the fields. For Santa Maria region, transplanting typically occurs between late July and September. Strawberry production from April to July accounts for most of the yearly strawberry production with the peak production typically during the month of May. Since April–July accounts for most of the yearly values, this paper is focused on yield analysis for this time period.

Daily strawberry yield data for Santa Maria county was obtained from the California Strawberry Commission’s website [19]. This information is publically available and is originally compiled from the United States Department of Agriculture Market News/Fruits & Vegetables website [20]. Daily strawberry yield data for the month of April through July were aggregated to weekly values. For this analysis we used weekly strawberry yield data for 2009 through 2015. While working with large number of historical yield data, it is important to examine if there is a significant upward trend in yield over time, which could be due to technological improvements over time. Since the number of years used in this study was relatively low, historical strawberry yield data obtained from [19] were directly utilized for correlation analysis and yield forecasting model development.

2.2. Meteorological Parameters

Meteorological data were obtained from the California Irrigation Management Information System (http://www.cimis.water.ca.gov/), a network of over 145 automated weather stations in California. Specific meteorological parameters used in this study were net radiation, air temperature (minimum and maximum), relative humidity (minimum and maximum), dew point temperature, soil temperature (minimum and maximum), vapor pressure (minimum and maximum), reference evapotranspiration, and average wind speed.

Total incoming solar radiation from the CIMIS station was measured using pyranometers, which was then used in the calculation of net radiation. Air temperature data is measured at a height of 1.5 meters above the ground using a thermistor. Instead of using average temperature, minimum and maximum temperatures averaged over a weekly period are used. Daily temperature data obtained from the CIMIS station is aggregated at a weekly time scale for correlation and model development purposes. Soil temperature data are collected at 15-centimeter depth below ground using a thermistor with resistance that varies with temperature. Minimum and maximum soil temperatures averaged over a weekly timescale were used for this study. Relative humidity is defined as the amount of water vapor present in air expressed as a percentage of the amount needed for saturation at the same temperature. The relative humidity sensor is sheltered in the same enclosure with the air temperature sensor at 1.5 meters above the ground. Relative humidity is a very important meteorological parameter that can impact fruits such as strawberry. This is because relative humidity is also a good indicator of pests and diseases to which strawberry yield is highly sensitive. In this study, minimum and maximum relative humidity averaged over a weekly timescale have been utilized. Wind speed used in this study was obtained through the CIMIS station that is measured using three-cup anemometers at 2.0 meters above the ground. There is a published result documenting the impacts of wind speed on strawberry yields [6]. Wind speed on a weekly time scale was utilized to analyze its impacts on strawberry yield. Vapor pressure of the atmosphere is the partial pressure exerted by atmospheric water vapor. It is a calculated parameter from relative humidity and air temperature data. Reference evapotranspiration is evapotranspiration from standardized grass (ETo). The CIMIS ETo and ETr values are calculated using the modified Penman equation. Since ET has direct influence on crop growth, ETo information was utilized in this study. Weekly meteorological data for this study was obtained from the CIMIS for the duration of 2007–2015.

2.3. Correlation Analysis

Correlation analysis between meteorological parameters and strawberry yield was performed using the Pearson product-moment correlation. This is a widely used methodology to measure linear dependence between two variables. In this case, linear dependence was tested between meteorological parameters and strawberry yield.

Weekly values of meteorological parameters from October of the year prior to harvest to February of current year of strawberry harvest were correlated with weekly strawberry yield from April through July and tested for significance at p < 0.05. Each meteorological variable was correlated with strawberry yields from April to July. This thorough correlation analysis was done in order to understand influence of meteorological parameters on strawberry yield on a more detailed basis. Meteorological parameters that exhibit significant correlation with strawberry yield were then used to develop empirical model to forecast strawberry yields.

2.4. Principal Component Regression

Meteorological parameters utilized as independent variables to develop empirical relationship to forecast strawberry yields exhibit colinearity. Typically, meteorological parameters exhibit significant correlations. If these explanatory variables were utilized directly into regression models, it would violate the assumption of nonconlinearity of explanatory variables. Use of principal component regression has multiple benefits. It can reduce the number of explanatory variables utilized in the model significantly. This is specifically important and useful if we have high correlation among the explanatory variables such as that for meteorological parameters. Another advantage is that the principal components are mutually independent and thus solve the issue of multicolinearity in regression models.

Instead of using meteorological parameters as explanatory variables, principal component regression uses principal components derived from these meteorological parameters. The dependent variable for this model was weekly strawberry yield and independent variables were principal components of meteorological parameters. The general form of model is as follows:
(1)
where Y is predicted weekly strawberry yield, X1Xp are principal components of meteorological parameters, m1mp represent estimated parameters for corresponding principal components, and ε represents residual error.

2.5. Cross Validation

This is a widely utilized statistical method to test model’s validity with independent dataset. There are various forms of cross validation where iteratively certain size of data is used for training and rest of them is used for evaluation. With leave one out cross validation approach, observed data are iteratively and exhaustively used for model testing, resulting in more reliable evaluation than getting estimates from the two-group partition method and less biased than estimates derived from calibration-dependent dataset [21]. This approach is specifically more efficient when there is limited observed dataset available.

3. Results and Discussion

3.1. Correlation Analysis

Table 1 shows statistically significant correlation of meteorological parameters with strawberry yield for Santa Maria region. It is evident from this analysis that the fall and winter weather conditions have significant influence on strawberry yields during their peak season, that is, during the month of May through July for Santa Maria region. This lagged correlation indicates potential for forecasting strawberry yields with the lead time of two to five months with acceptable level of accuracy.

Table 1. Correlation matrix of monthly meteorological parameters (Oct–Feb) and strawberry yields (Apr–July).
Weather parameters April May June July
Oct ETo (−) (+)
Oct Net radiation (−) (+)
Oct Max vapor pressure (+) (+) (+)
Oct Min vapor pressure (+) (+)
Oct Max relative humidity (+) (+)
Oct Min relative humidity (+)
Oct Dew point (+) (+)
Oct Maximum soil temperature (+) (+) (+)
Oct Minimum soil temperature (+) (+) (+)
Nov ETo (−) (+) (+) (+)
Nov Net radiation (−) (+) (+) (+)
Nov Max vapor pressure (+)
Nov Min vapor pressure (+)
Nov Maximum air temperature (−)
Nov Min air temp (+)
Nov Dew point (+)
Nov Maximum soil temperature (+)
Nov Minimum soil temperature (+)
Dec Net radiation (−)
Dec Max vapor pressure (+)
Dec Min vapor pressure (+)
Dec Dew point (+)
Dec Average wind speed (−) (−)
Dec Maximum soil temperature (+)
Dec Minimum soil temperature (+) (+)
Jan ETo (+)
Jan Net radiation (+)
Jan Max vapor pressure (−)
Jan Min vapor pressure (−)
Jan Min air temp (−)
Jan Max relative humidity (−)
Jan Min relative humidity (−) (−)
Jan Dew point (−)
Jan Average wind speed (−)
Jan Maximum soil temperature (−)
Jan Minimum soil temperature (−)
Feb Net radiation (+) (−)
Feb Average wind speed (−)
Feb Maximum soil temperature (−)

Net radiation during the fall season generally showed positive correlation with late season strawberry yield. Solar radiation has direct impact on strawberry growth and development, as it is the source of energy that strawberry plant utilizes during photosynthesis.

Results show that the relative humidity during the month of October is positively correlated with peak strawberry yields whereas the relative humidity during the month of January is negatively correlated with strawberry yields. Vapor pressure which is calculated based on relative humidity also showed similar correlation trend with strawberry to that of relative humidity. It has been documented in the literatures that the increase in relative humidity tends to increase fruit weight. It is also associated with increased leaf expansion and increase in photosynthesis, which can justify positive correlations with strawberry yields. However, high humidity could also result in tip burn for strawberry plants [7, 8], which could reduce strawberry yield.

Soil temperature during the fall time showed positive correlations with strawberry yield. This could be due to the fact that soil temperature during the early stage of strawberry might provide favorable conditions for plant establishment. However, soil temperature during January and February showed negative correlations with strawberry yields. Dew point temperature, that is, a temperature at which dew can start to form, during the fall season showed positive correlation with June and July strawberry yields. If dew point goes down, there are increasing concerns of frost damage to crops and thus the higher the dew point, the lower the risk for strawberry plants. Wind speed during December and January was negatively correlated with strawberry yields. Excessive wind speed can create bruising on the leaves and could impact strawberry yields. These findings are consistent with the literature. For instance, [6] found 56% increase in the yield of the strawberry with reduction in mean wind speed from 1.6 m/s to 1.1 m/s.

It is evident that many meteorological parameters during the early stages of strawberry growth and development phase exhibit statistically significant correlation with strawberry yields from April to July. This finding is consistent with what [9] studied for strawberry and other crops in California. They examined correlations at state average strawberry yield data on a yearly time scale. This study analyzed correlations on weekly timescale and also developed principal component models to provide weekly strawberry yield forecasts with the lead time of 2 to 5 months.

3.2. Yield Forecasting

Figure 1 and Table 2 show the predictability measures of weekly strawberry yield using meteorological parameter based principal component regression models. Figure 1 shows observed versus predicted yields on 1 : 1 line and good agreement between observed and predicted strawberry yields can be observed. The root mean squared error (RMSE) between observed and cross-validated strawberry yield is 747 kg/ha, 627 kg/ha, 518 kg/ha, and 384 kg/ha for April, May, June, and July, respectively. These agreements between observed and predicted strawberry yields are also statistically significant at 0.05 probability level.

Table 2. Observed versus cross-validated strawberry yield.
Year April strawberry yield (kg ha−1) May strawberry yield (kg ha−1) June strawberry yield (kg ha−1) July strawberry yield (kg ha−1)
Observed Predicted Residuals Observed Predicted Residuals Observed Predicted Residuals Observed Predicted Residuals
2007 Week 1 2152 1772 380 4710 4913 203 3409 2782 626 1406 1915 510
2007 Week 2 2944 2960 16 4807 5387 580 2436 2666 231 1278 1523 245
2007 Week 3 2930 2872 57 5091 4750 341 1734 2283 549 936 1126 190
2007 Week 4 2632 2406 226 3803 3395 408 1613 1279 334 810 954 143
2008 Week 1 1395 2509 1113 3686 3774 87 3049 2341 707 1441 949 493
2008 Week 2 2245 2318 73 4138 4248 110 2504 2514 10 940 1706 766
2008 Week 3 2289 2273 15 4300 4332 32 1521 2137 616 1021 983 38
2008 Week 4 2675 3084 409 3550 3675 125 905 1531 627 782 1074 292
2009 Week 1 1782 1952 169 4147 4856 708 2655 3278 623 1186 1798 612
2009 Week 2 2432 3009 577 4108 4383 275 2473 1850 623 921 1319 398
2009 Week 3 2823 2694 130 3368 3592 223 1216 2031 815 827 1139 311
2009 Week 4 2136 3836 1699 2718 4088 1370 1119 2041 923 740 1047 307
2010 Week 1 1893 2191 298 4664 4128 536 4139 2808 1332 1792 1602 190
2010 Week 2 1849 2738 888 4398 3598 800 2477 2364 113 1763 1502 261
2010 Week 3 2417 1845 571 4159 4120 39 1907 2473 566 1271 1647 376
2010 Week 4 2702 2326 376 3644 3535 109 1512 1799 287 1032 856 177
2011 Week 1 880 1816 937 3274 4823 1549 2980 3091 110 2643 1873 770
2011 Week 2 1883 2115 232 5332 4492 840 3337 2853 483 2074 2129 56
2011 Week 3 3224 4020 796 4652 3683 968 3209 2757 452 1636 1401 236
2011 Week 4 3283 3132 151 3853 3597 257 2070 1451 619 1454 1149 305
2012 Week 1 1047 2042 995 4811 4696 115 3393 3227 166 1847 1927 80
2012 Week 2 1762 3238 1476 5343 4999 344 2677 2531 146 2039 1372 667
2012 Week 3 3545 2930 616 5636 4556 1080 2172 2213 40 1708 1487 221
2012 Week 4 2294 2963 668 3985 3847 138 2052 1476 576 1198 1214 16
2013 Week 1 1784 1791 7 5716 5054 662 2874 3454 580 2233 1816 417
2013 Week 2 4390 2787 1604 5441 5062 379 2225 2028 197 1539 1673 135
2013 Week 3 4613 3302 1311 4599 4691 93 2220 2520 300 1364 1307 57
2013 Week 4 3811 3367 444 3079 3644 565 1752 1259 492 1036 802 233
2014 Week 1 1927 1546 381 4593 4197 396 2122 2456 334 1992 1448 543
2014 Week 2 3041 3407 366 4630 4057 573 2406 1968 438 1759 1253 506
2014 Week 3 4282 3299 983 4227 3915 312 2018 2078 60 1344 1198 146
2014 Week 4 3124 2755 369 2960 3200 239 1426 1799 373 934 1193 260
2015 Week 1 3152 2148 1004 4233 4931 698 3351 3009 343 1358 1371 12
2015 Week 2 3449 2953 496 4008 3741 267 2937 3010 73 1325 1025 300
2015 Week 3 3731 3100 631 3209 4143 934 2394 2721 327 547 1288 741
2015 Week 4 2715 2691 24 2512 3490 978 1615 1544 71 937 1088 151
Details are in the caption following the image
Observed and cross-validated strawberry yield forecasts for Santa Maria region for April (a), May (b), June (c), and July (d).
Details are in the caption following the image
Observed and cross-validated strawberry yield forecasts for Santa Maria region for April (a), May (b), June (c), and July (d).
Details are in the caption following the image
Observed and cross-validated strawberry yield forecasts for Santa Maria region for April (a), May (b), June (c), and July (d).
Details are in the caption following the image
Observed and cross-validated strawberry yield forecasts for Santa Maria region for April (a), May (b), June (c), and July (d).

Skills of these forecasts are higher for the month of June compared to other months. This is because higher number of meteorological parameters exhibited significant correlations. However, given the fact that these forecasts are obtained with 2 to 5 months of lead time, these empirical models showed potential for early estimates on expected yields.

It is important to note that there are limitations on how much variability in yield data that can be explained by meteorological parameters as many other factors such as management practices, pests, and diseases can also significantly impact yield variability. Additionally, strawberry yield data obtained from California Strawberry Commission provides an average estimate for Santa Maria region. That may add some uncertainty in calculation.

In this study we explored the use of meteorological parameters in developing and testing forecasting models that can provide yield forecasts with certain lead time, which can enable growers to make strategic decisions. This study only looked at meteorology based yield forecasts. Skills of these models can be further improved by adding physiological parameters of strawberry to existing models for strawberry. Additionally, there are various other forecasting approaches documented in the literature. Efforts should be made to compare these various approaches to enhance forecasting skills as well as increase the lead time of yield forecasts.

4. Conclusions

This study analyzed correlations on weekly timescale and also developed principal component models to provide weekly strawberry yield forecasts with the lead time of 2 to 5 months. Several meteorological parameters exhibited significant correlations with strawberry yields. Principal component regression models developed using meteorological parameters provided promising strawberry yield forecasts for Santa Maria strawberry production region. Agreement between observed strawberry yield and cross-validated yield forecasts was statistically significant for April through July. Future research could evaluate skills of empirical models that combine both meteorology and agronomic variables.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.