Background

Grape is among the most ancient, widely cultivated fruit crops. The keen interest in the understanding of grape berry ripening is justified by the economic relevance of the quality of grapes and their processed products, such as wine, juice and dried fruit. The onset of the genomic era has brought unprecedented progress into our knowledge of Vitis molecular biology, and powerful genetic and genomic tools are now available or are being developed, such as map** populations, genetic and physical maps [1, 2], an extensive ESTs collection (342,576 sequences deposited at the NCBI EST database-March 19, 2007, [3]) and several tools for gene expression analysis. These resources enable us to identify grape as a model plant for studies on non-climacteric anthocyanin accumulating fruits, beside tomato, which is currently the model plant for climacteric carotenoid accumulating fruits. Grape berry development from anthesis to ripening is classically divided into three phases on the basis of chemical and morphological traits [4]: berry formation, characterized by exponential growth of the berry and accumulation of organic acids, mainly malate, in the vacuole; véraison, a transition phase during which growth declines and berries start to change colour and soften; ripening, characterized by an increase in pH, marked berry growth and accumulation of sugars, anthocyanins and flavour-enhancing compounds. Published studies based on EST analysis [57], differential display [8], cDNA-arrays [9] and oligo arrays [10] have shown distinct and extensive changes in the transcriptome occurring during ripening. These studies report up to 740 developmentally regulated genes mainly involved in defence, stress response, primary and secondary metabolism, berry growth and hormonal metabolism. The very recent study of Grimplet et al. [11], provides an extensive description of these functional classes investigating their tissue-specific expression in the ripe berry by means of the Affymetrix Vitis GeneChip®.

With the aim of understanding the transcriptional changes underlying grapevine berry ripening and the regulatory mechanisms acting at this level, we were able to identify 1477 genes modulated during berry development, comprising 282 regulatory factors in addition to the structural genes, by the use of the Affymetrix Vitis GeneChip®. The results, obtained from replicated experiments carried out in three growing seasons, provide also insights into the relationship between genetic and environmental effects on berry development control.

Results and Discussion

Grape berries phenotypic evaluations

Figure 1 shows the changes in average berry weight, organic acids, total soluble solids and anthocyanin content of grape berries sampled weekly during the 2003, 2005 and 2006 growing seasons. The date of véraison was set at approximately 8 weeks post-flowering (wpf) in all years. In order to characterize berry ripening, we chose three distinct moments according to [12]: the last stage of berry formation, time-point A (TP A, stage E-L 33), characterized by small hard green berries still accumulating organic acids; and two stages during berry ripening, TP B (stage E-L 34), just before véraison, characterized by berries in the green state with maximum acidic content starting to soften and TP C (stage E-L 36), when the ripening process is well established and berries are growing fast, colouring and accumulating sugars. The identification of the corresponding developmental stages in the three years was based on the biochemical profiles. As average temperatures during 2003 remained constantly unusually higher from April to October [see Additional file 1], flowering and harvest occurred approximately one week earlier in 2003.

Figure 1
figure 1

Biochemical changes observed during Pinot Noir berry development in 2003 (red), 2005 (blue) and 2006 (green). Total acids (expressed as grams of tartaric acid per liter of must), total soluble solids, and anthocyanins content and berry weight averaged on a pool of 50 berries are reported. The dashed area indicates véraison. The weeks post flowering (wpf) corresponding to the samples used for the hybridization experiments are enclosed in boxes.

Statistical analysis

RNA extracted in triplicate from berries harvested at three time points was used to hybridize 27 Affymetrix Vitis GeneChips®. Datasets from each season were analyzed separately. The quality of the biological replicates was evaluated by the R-squared coefficient [see Additional file 2], which ranged from 0.89 to 0.99. Minimum R-squared values were found for TP B samples, suggesting a higher sample heterogeneity approaching véraison, when dramatic transcriptional changes occur in less than 48 hours [10].

Principal Component Analysis (PCA) confirmed the uniformity of the biological replicates as the nine groups of replicates clusterised tightly (Fig. 2). The greatest variance in gene expression was found between samples from the three growing seasons, as they were separated along the first component (51% of the total variance). In particular, the largest difference was evident between the 2003 expression data compared to the 2005 and 2006, suggesting that the environmental conditions greatly affect the berry transcriptome. [see Additional file 1]. On the other hand, the second principal component (24% of the total variance) separated neatly the pre-véraison stages (TP A and TP B) from the post-véraison one (TP C), confirming previous observations of extensive transcriptional changes occurring from berry formation to berry ripening [10].

Figure 2
figure 2

Principal component analysis (PCA) of the 27 expression datasets. Scaled expression data relative to 2003 (red), 2005 (blue) and 2006 (green) samples are sharply separated according to the sampling year (first principal component) and to the developmental stage (second principal component). Biological replicates relative to time point A, B and C are represented as circles, triangles and downwards triangles, respectively.

Class comparison analysis by means of Significance Analysis of Microarray [13], imposing a minimum fold change of 2 in at least one comparison (TP A vs. TP B and TP B vs. TP C), produced three sets of 2632, 4238 and 3856 differentially expressed probesets for the 2003, 2005 and 2006 samples, respectively. The size difference between the three sets likely reflects inter-seasonal biological differences. The comparison of the three datasets identified a common set of 1700 modulated transcripts and among these, 1477 transcripts with a conserved profile in the three years (Pearson's correlation coefficient >0.5 in any pairwise comparison). This 1477 core dataset, which corresponds to 9% of the chip probesets, is listed in Additional file 3. It represents a highly conserved expression network reasonably involved in an internal basic functional program, such as berry ripening.

Annotation of the differentially expressed genes

Automatic annotation of all the Vitis GeneChip® sequences was performed using the Gene Ontology (GO) classification [14]. The core set of 1477 differentially expressed genes was subsequently manually checked and integrated with additional GO 'biological process' terms. Transcripts were then grouped into 17 GO functional categories [see Additional file 3 and 4] whose distribution among the entire chip and the core set was statistically compared (Fig. 3). Nine categories were differentially represented at significant levels during berry development and five of these were overrepresented in the regulated gene core set. These were "carbohydrate metabolism", "cell wall organization and biogenesis," "response to stimulus," "secondary metabolism" and "photosynthesis," all processes that ought to be specifically regulated during grape berry development. The four underrepresented categories were "nucleobase, nucleoside, nucleotide and nucleic acid metabolism," "protein biosynthesis," "protein metabolism" and "transport." These classes may either play a minor role in the developmental program or their involvement could be restricted to a small subset of genes.

Figure 3
figure 3

Functional categories distribution in the core set of the modulated genes (red) and in the entire Vitis GeneChip® (grey). Frequencies are calculated as percentage of the whole number of 'GO biological process' terms (1,825 and 21,890 in the modulated and chip sets, respectively). Nine classes, marked by an asterisk and written in bold, resulted differently represented in the modulated set compared to the entire chip after statistical analysis (p-value < 0.001). In italics are the five categories overrepresented in the modulated set.

The functional class distribution of the Vitis GeneChip® was also compared to that of the Arabidopsis genome available at TAIR ([15], [Additional file 5]). The overall agreement in the class distributions confirms the Vitis GeneChip® as a reliable toolkit for the investigation of the grape expressed genome.

Cluster analysis and functional categories modulation during berry ripening

Cluster analysis of the gene core set was based on the k-means method using Pearson's correlation distance calculated on the gene expression profiles. Transcripts were divided into eight groups representing the minimum number of profiles that could be obtained with three time-points. We observed good agreement between clustering in the three gene sets (80% of the transcripts fell in the same cluster in all seasons). We decided to use the 2005 expression data for the cluster representation (Fig. 4), due to the smaller variance among replicates [see Additional file 2]. As expected from PCA, the two most populated clusters were number 4 and 8, composed by genes modulated positively and negatively from TP B to TP C, respectively. Conversely, clusters 2 and 6, consisting of genes modulated only during the pre-véraison interval, were the less populated ones. Approximately the same number of genes was positively or negatively modulated along the whole study interval (cluster 3 and 7). Genes specifically induced around véraison should fall in cluster1 while genes with roles in early and late stages of berry development should fall in cluster 5.

Figure 4
figure 4

Cluster analysis of the expression profiles of the modulated core set. The expression profiles of the 1477 modulated genes during P. Noir berry ripening were clusterised in eight clusters which represent the minimum number of profiles considering three time points. Clusters were obtained by the k-means method using Pearson's correlation distance. The representative profile and the number of genes in each cluster are indicated.

Functional class distribution frequency was then calculated for each cluster and represented as histogram [Additional file 6]. To better understand the modulation of the functional classes during berry ripening, their frequencies were determined after splitting the experimental interval into two phases: one from TP A to TP B and one from TP B to TP C. The genes modulated during the first phase derived from clusters 1, 2, 3 (induced) and 5, 6, 7 (repressed), while the genes modulated during the second phase derived from clusters 3, 4, 5 (induced) and 1, 7, 8 (repressed). During the pre-véraison phase, the number of the induced genes (408) exceeded that of the repressed ones (270), while the opposite occurred during the following phase (651 induced vs. 1118 repressed). As depicted in Figure 5, all but two categories showed a general tendency towards an opposite modulation between the first and the second phase. Furthermore in the first phase a general bias towards induction for the GO categories involved in regulatory mechanisms, namely "nucleobase, nucleoside, nucleotide and nucleic acid metabolism" (mainly transcription factors), "response to endogenous stimulus" (mainly hormone metabolism), "signal transduction," and "protein metabolism" was observed. This suggests a strong cell re-programming taking place in berry cells up to véraison. During the second phase, a marked negative regulation is evident for categories involved in cell division, such as "organelle organization and biogenesis" and "protein biosynthesis," and "photosynthesis." This behaviour is coherent with the slowing down of cell replication and the loss of photosynthetic capacity. On the other hand, the classes "amino acid and derivative metabolism," "carbohydrate metabolism," "cell wall organization and biogenesis," "development," "lipid metabolism," "primary" and "secondary metabolism" display a positive regulation. This indicates the prevalence after véraison of metabolic processes involved in cell wall loosening and synthesis, sugar accumulation and synthesis and transport of metabolites responsible of grape colour and flavour. Finally, the class "response to stimulus" appeared induced in both phases, suggesting that throughout the entire berry development process the cell devotes considerable effort to face different kinds of stimuli, presumably including osmotic, oxidative and biotic stress.

Figure 5
figure 5

Functional categories distribution before and after-véraison. Induced and repressed functional categories during the first stage (from TP A to TP B, green histograms) derive from clusters 1, 2, 3 and 5, 6, 7, respectively, while categories induced and repressed during the second stage (from TP B to TP C, violet histograms) derive from clusters 3, 4, 5 and 1, 7, 8, respectively. The relative number of induced (Ni) or repressed (Nr) genes for a specific functional category was calculated with respect to the total number of induced or repressed genes in each stage. The estimate of induction or repression within each functional category was then calculated as follows: estimate of induction/repression = (Ni - Nr)/(Ni + Nr). Positive values represent an overall induction and negative values an overall repression.

Gene regulation of grape berry ripening

Our study of the berry ripening process identified 125 transcription factors, 65 genes involved in hormone metabolism and response and 92 involved in signal transduction, corresponding to 8.5%, 4.4% and 6.2% of the regulated core set, respectively. These figures and similar data recently obtained for tomato and Citrus fruit [16, 17], suggest that a consistent fraction of the modulated fruit transcriptome is devoted to the control of the developmental program. In this section we present our main contribution to the knowledge of gene regulation during berry ripening in grapevine. A restricted list of selected genes involved in the process and discussed in the text is reported in Table 1 with the specific AffyID, whereas the complete list is provided in part A of Additional file 7.

Table 1 Selection of genes putatively involved in the regulation of berry ripening.

Hormones metabolism and signalling

Amongst the genes related to hormone metabolism in the core set, those related to auxin and ethylene were the most represented, followed by those related to abscisic acid and brassinosteroids. Few genes related to cytokinines, gibberellins or jasmonic acid were found. These observations are consistent with our knowledge of the relative importance of these signalling pathways during grape berry development [18, 19].

In fruits from both climacteric and non-climacteric species, auxins are known to be more abundant immediately after ovary fertilization and to mediate important signals for the onset of fruit development [20, 21]. In grape, it has been generally accepted that indole-3-acetic acid (IAA) content reaches its maximal level just after anthesis and then declines to very low levels in the ripe fruit [22], even if a recent study on Vitis vinifera cv Cabernet Sauvignon did not confirm it [23]. The expression of genes involved in cell wall metabolism (GRIP4), anthocyanin synthesis (CHS and UFGT) and sugar storage (GIN1) and the accumulation of the correlated metabolites were all shown to be retarded by auxin treatments at véraison [19].

In the present study, we have identified a gene homologous to the Arabidopsis amidase AtAMI1, which in vitro synthesizes IAA from indole-3-acetamide [24], and its descending profile during ripening agrees well with the decline in IAA levels observed in Vitis labrusca berries [22]. We have also found a gene homologous to an Arabidopsis IAA-amino acid synthetase which may contribute to IAA intracellular homeostasis via amino acid conjugation of excess IAA [25]. Two auxin carriers (an AUX1-like and a PIN1-like), putatively mediating auxin efflux, were also expressed before véraison.

We detected numerous auxin-regulated genes belonging to the Aux/IAA, Small Auxin-Up RNA and GH3 gene families. These families are characterized by the presence of an Auxin Responsive Element in the promoter region of their members, recognized by specific Auxin Response Factors (ARFs). Two ARFs, ARF5 and ARF18 and an auxin receptor of the ABP family were found expressed during the pre-véraison stage and then steeply repressed during ripening. With the exception of two Aux/IAA proteins, which were induced during ripening, all the other transcripts presented profiles with either a peak of expression around véraison or a decreasing profile after it. The auxin responsive genes seem therefore timely co-ordinated with the cellular auxin level, reflecting its prevalent role in the first stages of berry development. We have also detected a gene (1612001_s_at) homologous to a Capsicum chinense GH3 gene, which has been proposed as point of convergence between auxin and ethylene signals in non-climacteric fruit development [26]. While in pepper this GH3 transcript is up-regulated during ripening due to its ethylene-inducibility, our analyses showed a steep repression of the grape homologue after TP B.

The role of ethylene during development and ripening of some non-climacteric fruits was suggested since the 1970s and has recently been well documented [2729]. In grape, a small and transient increase of endogenous ethylene production occurs just before véraison, in concomitance with an increase in 1-aminocyclopropane-1-carboxylic acid (ACC) oxidase activity [29]. Our results well suit these reports, as we observed a peak of expression of ACC oxidase around véraison, while ACC synthase and a putative ethylene-forming-enzyme dioxygenase, detected with similar profile also by others [10], were expressed until véraison and then repressed.

We were not able to identify any receptor or kinase involved in ethylene signal transduction in grape, within the modulated gene set. Instead, we detected four transcription factors of the Ethylene Responsive Factors (ERFs) and AP2/EREBP families. The transcript homologous to tomato LeERF1 and a putative AP2/EREBP transcription factor were induced during berry ripening, while the homologous of ERF3b and another putative AP2/EREBP transcription factor were repressed. Considering that in tomato, tobacco and Arabidopsis, ERF1 genes are induced by ethylene and wounding and positively regulate downstream genes involved in biotic and abiotic stresses [30], the induced transcription factors that we have identified could be involved in ethylene mediated responses to plant stresses in grape as well [31, 32]. The Vitis homologues to osmotin, PR-1 and PR-4, downstream genes known to be regulated by ERFs in the species above mentioned, were indeed found to be induced in our study, suggesting that they could be under the same regulatory mechanism. The homologous of tobacco nitrilase NtNIT4B [33], another gene related to the transcriptional control of ethylene responsive defence genes, was found to be induced during ripening in our analysis as well. We could not thus exclude that an additional role of ethylene during grape berry development is related to biotic stress response.

Sequence annotation revealed the modulation of some transcripts homologous to the tomato genes ER6 and ER33, whose functions are still unknown but are reported to be induced by ethylene during tomato ripening [34]. In our experiment, however, these genes did not show strong modulation around véraison and they were all repressed during ripening. On the other hand, a transcript homologous to tomato E8, was found induced in grape, in agreement with tomato and strawberry gene expression data [35, 36]. Although E8 resembles ACC synthase structure, it apparently exerts a negative control on ethylene biosynthesis [35] and its expression seems affected by the homeotic transcription factor LeMADS-RIN [37]. E8 similar modulation in both climacteric and non-climacteric fruits raises the possibility of a common regulatory mechanism for ethylene homeostasis control in fleshy fruit ripening.

The phytohormone abscisic acid (ABA) regulates processes such as floral transition, embryo maturation, seed development and tolerance to abiotic and biotic stress [38]. In grape berries, ABA begins to accumulate at véraison and is thus considered a good candidate for triggering berry ripening [18]. The present analysis found three ABA-related transcripts induced after véraison: one is homologous to the pea nine-cis-epoxycarotenoid dioxygenase 4 and thus putatively involved in ABA synthesis; the other two are homologous to two ABA responsive proteins. Recently, a Vitis dehydrin, DHN1 [63] and pear [64], is the response to oxidative stress. In recent years a dual role for reactive oxygen species (ROS) in plants has been identified: toxic by-products of the aerobic metabolism and important regulators of growth, development and defence reactions. A complex network of genes finely tuning ROS concentration as the result of their production and scavenging has been documented [65].

The occurrence of oxidative stress during grape berry development has been rather controversial: catalase was identified among the abundant proteins in the ripe berry [57], but at the transcriptional level the typical oxidative stress markers seemed absent or negatively regulated [10]. The present study clearly indicated that oxidative processes take place also during berry ripening and that an enzyme-mediated scavenging system is activated. Berry H2O2 content increased significantly in correspondence of the véraison stage, reaching its maximum one-two weeks after, and then decreasing at a slower pace towards harvest time (Fig. 6). At least 32 transcripts of the modulated core set appeared involved in the enzymatic detoxification from H2O2 and the other ROS species which accumulate in the cell starting from véraison (Fig. 7). Some of them, such as ascorbate peroxidase, glutathione peroxidase and peroxiredoxins, catalyze the direct reduction of hydrogen peroxide to water; the others (thioredoxins, glutaredoxins, glutathione-S-transferases and metallothioneins) regulate the balance of the oxidized and reduced forms of the antioxidants ascorbic acid and glutathione. Although the regulation of the detoxifying enzymes must be very complex, isozyme specific, and occurring at different levels (transcriptional- and post-transcriptional regulation, subcellular compartimentalization, etc) as previously shown for some of them [61], most of the enzymes shown in Figure 7 appeared to be unchanged from TP A to TP B and be strongly modulated from TP B to TP C. Further studies are needed to address how ROS molecules are taking part to these regulatory circuits and what is the main reason of ROS accumulation at véraison. The superoxide dismutase (SOD) and catalase (CAT) transcripts which also codify for two important detoxifying enzymes were not significantly modulated in our study. The CAT gene (1618705_s_at) was highly expressed in all the three TPs, the SOD encoding transcripts (5 probes) showed very different expression levels but no one fell into the modulated core set.

Figure 6
figure 6

H 2 O 2 content during berry ripening in 2005 (blue) and 2006 (green) samples. Data are means ± SE of three replicates. The weeks post flowering (wpf) corresponding to the samples used for the hybridization experiments are enclosed in boxes. Véraison time is delimited within dotted lines.

Figure 7
figure 7

Genes involved in the response to oxidative stress modulated during ripening. List of the ROS-scavenging genes found in the modulated core set and comparative analysis of their expression in the three sampling TPs. Log2 ratios of the expression values observed in TPs A, B and C (B/A and C/B) are visualized by a colour scale, where red indicates an increase and green a decrease. For each functional group the corresponding enzymatic reaction is reported. * refer to probesets probably corresponding to the same transcript.

The Nudix gene family may also be involved in the oxidative stress response, as three transcripts were found modulated (one in cluster 3 and two in cluster 8). In Arabidopsis, Nudix hydrolases appear to be involved in the sanitization of the oxidized nucleotide pool accumulating during oxidative stress [66].

Ten genes involved in programmed cell death (PCD) appeared to be modulated. In particular, three transcripts with putative function of PCD suppression are grouped in cluster 1, while a putative death associated protein is induced after véraison (cluster 4). In addition, listed in the protein metabolism category there are two cysteine proteases (cluster 4) and a cystatin-like gene (cluster 5). Our data are insufficient to draw any conclusion, however they may suggest the occurrence of a PCD process in grape berry during ripening, as highlighted for strawberry [20].

Sugar/acid ratio and water balance

Our results support the model of sugar phloem unloading via an apoplastic pathway by means of plasma membrane sucrose and hexoses transporters as proposed by [67] and recently confirmed by [68, 75]. Two isoforms of 4-coumarate-CoA ligase with opposite profiles were detected (clusters 3 and 8) and their pattern of expression is consistent with an involvement into the anthocyanin and lignin synthesis pathways, respectively.

Eleven members of the ATP-binding cassette (ABC) and multidrug and toxic compound extrusion (MATE) transporters families which are involved in several cellular transport processes (reviewed in [76]), were modulated. Among them, the three transporters positively modulated during berry development (1610275_at, 1607763_at, 1616929_at) represent possible candidates for secondary metabolites transport.

Gene expression profiles highly influenced by seasonal variation

The influence on gene expression by environmental conditions is well reported in literature [77] and was evident in our PCA analysis (Fig. 2). The comparison of the three sampling years highlighted a large number of genes modulated only during one or two seasons, and thus excluded by the core set of ripening-specific genes: 938 genes in 2003, 2530 in 2005 and 2143 in 2006. Two examples are the isoform 2 of the phenylpropanoid biosynthetic genes chalcone synthase and chalcone isomerase whose expression profiles are depicted in Fig. 8E and 8F; such discrepancy in their modulation is indeed expected for non-ripening specific isoforms. Although these genes are probably highly affected by changes in the environment their GO functional categories distribution was not significantly different to that of the core set genes (data not shown). This result suggests that, on a long time frame, the plant reacts to the seasonal variations by adjusting the whole metabolism, and not just a part of it, to maintain homeostasis.

Figure 8
figure 8

Real time RT-PCR validation of the expression profiles of six genes highly influenced by the season. A: 1615199_at, Cryptochrome 1; B: 1616872_at, Pseudo response regulator 9; C: 1614764_at, VvHT2, hexose transporter 2; D: 1606663_at, VvChs3, chalcone synthase isoform 3; E: 1607732_at, VvChs2: chalcone synthase isoform 2; F: 1620424_at, VvChi2, chalcone isomerase isofom 2. 2003 profiles are represented in red, 2005 in blue and 2006 in green. Relative expression data of the array hybridization experiments were centered on the average of the log2 values of each season. Expression profiles measured by RT-PCR experiments (insets) were first centered on the mean Ct value calculated on the three seasons for each gene and then log2 transformed. RT-PCR data are reported as means ± SE of three technical replicates.

The group of genes modulated in all three seasons with different transcriptional profiles, deserves in our opinion a special discussion as they are likely involved into the ripening process but in the meantime highly influenced by the climatic conditions. These 223 genes represent about 13% of the modulated gene set (1700 transcripts). Among them, at least four transcripts are putatively involved in light perception and signalling: cryptochrome 1 (1615199_at) and pseudo-response regulator 9 (APRR9, 1616872_at) showed opposite behaviours in 2003 vs. 2005 and 2006 (Fig. 8A and 8B); pseudo-response-regulator 7 (APRR7, 1608006_at) displayed a "V" profile in 2003 and 2005 and the opposite pattern during 2006; finally LONG HYPOCOTYL 5 (HY5, 1609930_at) showed a different profile from TP A to TP B in 2003 and 2005 vs. 2006 while it was repressed in all three seasons from TP B to TP C. Interestingly, APRR7 and APRR9 seem involved in the temperature responsiveness of the Arabidopsis circadian clock, synchronizing photo- and thermo-cycles [78] and HY5 is a positive regulator of light signalling which acts downstream of several photoreceptors and mediates the response to different types of light (red/far red, blue and UV-B [79, 80]).

Out of the three previously characterized grape hexose transporters present on the chip, two (Vvht1 and Vvht2) seemed influenced by the season. Vvht1 (1616083_at) showed a "V" profile in all the three years, but with different slopes, suggesting a strong environmental conditioning along with the reported cultivar specificity [10, 81, 82]. Unlike Vvht1, Vvht2 is considered a ripening specific isoform [10, 82, 83]; however, the present work highlighted a certain degree of variability in its induction time in the three years (1614764_at, Fig. 8C). A similar behaviour was observed for another ripening specific gene, namely the isoform 3 of the chalcone synthase family [75, 84] (Fig. 8D).

To ensure that the observed differences among seasons and among gene isoforms were not technical artefacts, the expression profile of six genes was validated by real time RT-PCR (Fig. 8, insets). In all but one case (Vvht2 year 2003) there was complete agreement with the array data.

Conclusion

The time-course gene expression analysis of Pinot Noir berry development, based on a systematic and genome-wide approach, has identified the occurrence of two well distinct phases along the process. The pre-véraison phase represents a reprogramming stage of the cellular metabolism, characterized by the expression of numerous genes involved in hormonal signalling and transcriptional regulation. The post-véraison phase is characterized by the onset of a ripening-specialized metabolism responsible for the phenotypic traits of the ripe berry. A similar switch in fruit metabolism has been recently reported also in Citrus [16]. Five functional categories are overrepresented in this ripening-specialized metabolism: cell wall, carbohydrate and secondary metabolisms, stress response and photosynthesis; the latter being obviously switched off. An oxidative burst, previously not detected in grapevine and characterized by a rapid accumulation of H2O2 starting from véraison, was demonstrated. A concurrent modulation of the enzymatic ROS scavenging network was also highlighted, suggesting a possible role of ROS species in the regulation of berry ripening. We have found a very large number of transcription factors and transcripts related to hormonal metabolism and signal transduction that appear to be regulated during berry development. These represent 19% of the modulated gene set, corroborating the widely-accepted hypothesis that fruit development is under tight transcriptional control. These data provide the basis for subsequent correlation analysis aimed at highlighting gene co-regulation and metabolic networks [85]. Our strategy to perform the experiment during three seasons allowed us to restrict the dataset of modulated genes to those more likely to be ripening-specific. Nonetheless, we were able to separate ripening specific isoforms within gene families and to identify ripening related genes which appeared strongly regulated also by the seasonal weather conditions.

In perspective, the wealth of information we present here will provide a new extended platform to study the complex process of grapevine berry development and more in general of non-climacteric fruits.

Methods

Plant Material and Experimental Design

Ten grape clusters of Vitis vinifera cv. Pinot Noir were collected weekly between 8 a.m and 9 a.m, from flowering to over-ripening during 2003, 2005 and 2006 seasons at the IASMA study site 'ai Molini' (San Michele all'Adige-TN Italy). To be more representative of the canopy five clusters were collected from the north side (shady) and the other five from the south side (sunny). Berries were rapidly detached with their petioles and divided randomly into groups of 50 that were either immediately frozen for subsequent transcriptional analysis or subjected to biochemical analyses, including must acid and sugar content, average berry weight determination and anthocyanin concentration [68]. Based on these parameters, the peak of acidity just before véraison was estimated and two additional time-points (two weeks before and three weeks after véraison) were chosen for gene expression analysis. The three TPs correspond to the 33, 34 and 36 of the modified E-L system [12]. For each time-point three biological replicates were obtained by performing an independent RNA extraction from three sub-pools of 8 berries derived from the group of 50 berries. Consequently nine arrays were utilized for each season.

RNA Preparation, Array Hybridization, Data Analysis

Total RNA was extracted from frozen deseeded berries according to [86] and further purified using RNeasy spin columns (Qiagen). RNA quality and quantity were assessed by gel-electrophoresis and by absorbance measurements. Ten μg of each sample were sent to the IFOM (Milan-I) Affymetrix platform facility for probe synthesis, GeneChip® Vitis arrays hybridization, washing, staining and scanning with GeneChip Scanner 3000, according to the 'Affymetrix GeneChip® Expression Analysis Technical Manual'. Data referred to 2003, 2005 and 2006 were analyzed independently in order to obtain one set of differentially expressed genes for each season. Bioconductor packages [87] in R v.2.1.0 [88] were used for the statistical analyses. Data were pre-processed using the GCRMA method [89], which performs background correction, quantile normalization and summarization, and returns log2 converted expression values. Biological replicates quality was assessed by means of R-squared coefficient, which represents the fraction of variance explained by a linear model. For variation assessment of the gene expression values between all 27 data sets, data from the three seasons were scaled and Principal Components Analysis (PCA) was performed. In order to identify genes modulated during berry development, a filtering step based on the inter-quantile range method (IQR = 0.25) was used. This restricted dataset, enriched in differentially expressed genes, was suited to run a multiclass comparison method of Significance Analysis of Microarrays (SAM), with false discovery rate =0.15% [13]. SAM output was further restricted to genes with fold-change greater than 2 in at least one of the two comparisons (TP A vs. TP B and TP B vs. TP C). The final datasets relative to the three seasons were intersected obtaining a dataset of 1700 common modulated genes. Pearson's correlation coefficients between genes profiles relative each pair of seasons were then calculated and a threshold of 0.5 was fixed to obtain a berry ripening-modulated core dataset of 1477 genes. Expression profiles clustering was carried out by the k-means method (k = 12) with Pearson's correlation distance using the T-MeV software [90]. The number of clusters was manually reduced to eight, which represents the minimum number of expression profiles considering three time-points.

Affymetrix Vitis Array

The GeneChip® Vitis vinifera genome array (Affymetrix) consists of 16,436 probesets: 14,496 derived from V. vinifera transcripts and 1,940 derived from other Vitis species or hybrids transcripts. It can interrogate 12,908 GenBank accessions of V. vinifera and 1,547 of other Vitis species or hybrids. Sequences used in the design of the Vitis GeneChip® were selected from GenBank, dbEST, and RefSeq. The sequence clusters were created from the UniGene database (Build 7, Oct 2003). V. vinifera sequences represented on the chip correspond to 10,042 TIGR Tentative Consensus and 1,940 Singletons (Release 4, Sept 2004), while 102 GenBank accessions are not present in the TIGR database. Overall chip redundancy is estimated to be 16.6%.

Annotation

The consensus sequence of each Vitis GeneChip® probeset (available at [91] was analyzed to identify the coding sequence (cds) by means of an in-house developed method, which combines the results of two cds predictors: Estscan [92] and FrameFinder (at ESTate, [93]). 13,923 cds were obtainded and used to interrogate the Uniprot database (Sept. '05) by means of 'blastp', while the remaining 2,513 sequences were used to interrogate the same database by means of 'blastx' [94]. Blast results (E-value <e-10) with GO associated terms ([14]) were analyzed by the in-house developed program 'GORetriever' for annotation and 12,242 sequences were annotated with high confidence (> 80%). The automatic annotation results for the 1700 modulated genes were manually inspected and integrated with GO 'biological process' terms supported by literature evidences.

Genes were grouped into 17 functional categories based on GO 'biological process' terms by means of the in-house developed program 'GOSlimer' [see Additional file 4 for the list of the GO categories and Additional file 3 for genes annotation]. Functional categories distribution in the modulated and chip sequences were compared by means of Chi square and Fisher statistical tests (p-value < 0.001).

Determination of the H2O2 content

H2O2 content was measured by a fluorimetric assay based on the substrate 10-acetyl-3,7-dihydroxyphenoxazine which is oxidized in the presence of H2O2 and peroxidase, using the Amplex Red Hydrogen Peroxide/Peroxidase Assay (Molecular Probes, Eugene, USA) following the manufacturer's instructions. Ten frozen berries for each time point considered were ground in liquid nitrogen and 0.2 g of fine powder were then solubilized, for each replicate, with 0.5 ml of 50 mM phosphate buffer (pH 7.4) and kept 5 min on ice. After centrifugation at 20,000 g for 15 min, the cleared supernatant was extracted with an equal volume of 2:1 (v/v) chloroform:methanol mixture and centrifuged at 12,000 g for 5 min. To perform the H2O2 assay 50 μl of the aqueous phase were added to 50 μl of the reagent working solution. After 30 min of incubation at 25°C, relative fluorescence (excitation at 544 nm and emission at 590 nm) was measured in a black 96 wells plate. Absolute quantification was determined by the use of a H2O2 standard curve.

Real time RT-PCR

First strand cDNA synthesis was performed on one mRNA sample of the three biological replicates used for the microarray experiments using the SuperScript™ III Reverse Transcriptase kit (Invitrogen) according to the manufacturer instructions. Primers [see Additional file 8 for sequences] and cDNA were mixed with the Platinum® SYBR® Green qPCR SuperMix-UDG (Invitrogen) and the reaction was carried out on an ABI PRISM 7000 Sequence Detection System (Applied Biosystems). Cycling conditions were: 50°C for 2 minutes, 95°C for 2 minutes, then 40 cycles of 95°C for 15 seconds and 60°C for 1 minute. Raw data were analyzed with ABI PRISM 7000 SDS software to extract Ct values and with the LinReg software to calculate the reaction efficiency [95]. Relative expression of each gene (target) was then calculated according to the equation by [96] using actin for normalization (reference) and centered on the mean Ct calculated on the three seasons and on the three time points (control): RelExp = E(target)exp[ΔCt(target)(sample-control)]/E(reference)exp[ΔCt(reference)(sample-control)].

Data Availability

All microarray expression data produced by this work are available at EBI ArrayExpress [97] under the series entry E-MEXP-1282.