Chapter 3: Results - Shodhganga

Page created by Keith Burgess
 
CONTINUE READING
Chapter 3: Results - Shodhganga
Chapter 3:

             Results
Chapter 3: Results - Shodhganga
Chapter 3: Results

3.1. Morphological study of flowers

3.1. 1. Microscopic study

The microscopic observation of dissected matured flowers showed no structural differences
in first three whorls. All flowers of the two investigated rice lines have one lemma, one palea,
two lodicules, and six stamens at first, second and third whorls respectively (Table 5).
However, a significant alteration observed in the fourth whorl; when the normal rice (IR36)
flowers possess a single carpel, majority of the mutant rice (Jugal) flowers bear more than
one carpel. Carpels of both Jugal and IR36 flowers are presented on Figure 6. Figure 6a
shows a dissected mature Jugal flower which possesses three carpels, whereas, Figure 6b
shows a mature dissected IR36 flower having a single carpel. The Table 6 represents the
number of flowers with their carpel numbers; among the 200 IR36 flowers that were
investigated, all flowers have single carpels. Whereas, in case of Jugal, only 58 flowers
among the 200 investigated flowers have single carpels; the remaining flowers bear more
than one carpels. Interestingly, Jugal flowers with more than one carpels, showed presence of
two, three even in some cases four carpels also. Among the 200 Jugal flowers investigated,
89 flowers found to have two carpels. Similarly, the number of Jugal flowers with three
carpels and four carpels were 46 and 7 respectively among the 200 investigated Jugal
flowers.

Table 5. Number of floral organs in Jugal and IR36 flowers

                   Flower Lemma Palea Lodicules               Pistil   Carpel

                    Jugal        1        1          2          6       1-4

                    IR36         1        1          2          6        1

                                                                                             32
Chapter 3: Results - Shodhganga
Results

Figure 6. a Dissected Jugal flower with three carpels; b Dissected IR36 flower with single
carpel. The arrows indicate carpels.

Table 6. Details of carpel numbers of Jugal and IR36 flowers

                     Flower Flowers Flowers Flowers Flowers
                     Name     with    with    with    with
                              One     Two    Three   Four
                            Carpel Carpels Carpels Carpels
                      Jugal    58      89      46      07
                      IR36    200       0       0       0

                                                                                             33
Chapter 3: Results - Shodhganga
Results

3.1. 2. Scanning electron microscopy (SEM) study

The Scanning Electron Microscopic (SEM) observation of dissected mature flowers of both
Jugal and IR36 are presented in Figure 7. The image of Figure 7a represents a mature
dissected Jugal flower which showed to bear two carpels. Figure 7b displays a mature and
dissected IR36 flower possessing a single carpel. SEM analysis reveals that both the two
carpels of Jugal flower have developed from the same whorl and they are fused at the base.
SEM study also shows that the shape and the arrangement both the two carpels of Jugal
flower are almost similar to that of the IR36 carpel. Thus the SEM study confirmed the
presence of more than one carpel in Jugal flowers.

Figure 7. a SEM image of Jugal flower; the two arrows indicate presence of two carpels. b
SEM image of IR36 flower. The carpel was indicated by the arrow.

                                                                                            34
Chapter 3: Results - Shodhganga
Results

3.2. Characterization of floral meristem
Microscopic study of floral meristems showed that the floral meristems of Jugal are enlarged
when compared with IR36 floral meristems (Figure 8). The sections of Jugal and IR36 floral
meristems are presented in Figure 9.

                 Figure 8. a Jugal Floral Meristem. b IR36 floral meristem.

Figure 9. a Microtomic section of Jugal floral meristem. b Microtomic section of IR36 floral
meristem.

                                                                                         35
Chapter 3: Results - Shodhganga
Results

3.3. Hybridization program

Forty two traits of both Jugal and IR36 plants were studied in field for two successive years
(2015-16, and 2016-17), which showed significant differences between these varieties
regarding most of the traits studied. While Jugal has long stem, late heading time, few
panicles par plant, and late time of maturity, IR36 has very short stem, early heading time,
much more panicles per plant than Jugal and very early maturity time (Table 7).

Table 7. Characterization of Jugal and IR36 plants

  Sl            Characteristics                      States          Jugal         IR36
  No
  1     Coleoptile: Colour                Colourless/ Green/       Colourless Colourless
                                          Purple

   2    Basal leaf: Sheath colour         Green/ Light Purple/       Green        Green
                                          Purple Lines/
                                          Uniform Purple
   3    Leaf: Intensity of green colour   Light/ Medium/ Dark         Dark        Medium

   4    Leaf: Anthocynin coloration       Absent/ Present            Absent       Absent

   5    Leaf: Distribution of             On tip only/ On              NA          NA
        Anthocyanin coloration            margins only/ In
                                          blotches only/
                                          Uniform
   6    Leaf Sheath: anthocyanin          Absent/ Present            Absent       Absent
        coloration
   7    Leaf: Pubescence of Blade         Absent/ Weak/              Weak         Absent
        Surface                           Medium/ Strong/ Very
                                          Strong
   8    Leaf: Auricles                    Absent/ Present            Absent       Absent

   9    Leaf: Anthocyanin Coloration      Colorless/ Light             NA          NA
        of Auicles                        Purple/ Purple
   10   Leaf: Collar                      Absent/ Present            Present      Absent

   11   Leaf: Anthocyanin coloration      Absent/ Present            Absent        NA
        of collar
   12   Leaf: Ligule                      Absent/ Present            Present      Present

                                                                                            36
Chapter 3: Results - Shodhganga
13   Leaf: Shape of Ligule            Truncate/ Acute/ Spilt   Truncate   Truncate

14   Leaf: Colour of the Ligule       White/ Light Purple/      White      White
                                      Purple
15   Leaf: Length of Blade            Short (45cm)
16   Leaf: Width of Blade             Narrow(2cm)
17   Culm : Attitude                  Erect/ Semi Erect/        Erect      Semi
                                      Open/ Spreading                      Erect

18   Time of Heading (50% of the      Very Early (131
                                      Days)
19   Flag Leaf: Attitude of Blade     Erect/ Semi Erect/        Semi       Semi
     (Early Observation)              Horizontal/ Drooping      Erect      Erect

20   Lemma: Anthocyanin               Absent/ Very week/       Absent     Absent
     coloration on keel               Week/ Medium/
                                      Strong/ Very Strong
21   Lemma: Anthocyanin               Absent/ Weak/             Weak      Absent
     coloration of area below apex    Medium/ Strong/ Very
                                      Strong
22   Lemma: Anthocyanin               Absent/ Weak/            Strong     Absent
     coloration of apex               Medium/ Strong/ Very
                                      Strong
23   Spikelet: Color of Stigma        White/ Light Green/       Purple     White
                                      Yellow/ Light Purple/
                                      Purple
24   Stem: Thickness                  Thin(0.55cm)
25   Stem: Length (excluding          Very Short(150cm)

26   Stem: Anthocyanin coloration     Absent/ Present          Present    Absent
     of nodes
27   Stem: Intensity of Anthocyanin   Weak/ Medium/             Weak        NA

                                                                                     37
Chapter 3: Results - Shodhganga
coloration of nodes              Strong
28   Stem: Anthocyanin coloration     Absent/ Present         Absent     Absent
     of internode
29   Panicle: Length of Main Axis     Very Short (30cm)
30   Flag Leaf: Attitude of Blade     Erect/ Semi Erect/       Semi       Erect
     (Late Observation)               Horizontal/ Deflexed     Erect

31   Panicle: Curvature of main       Straight/ Semi          Straight   Straight
     Axis                             Straight/ Deflexed/
                                      Dropping
32   Panicle: Number per Plant        Few(20)
33   Spikelet: Colour of the tip of   White/ Yellowish/        Brown      White
     Lemma                            Brown/ Red/ Purple/
                                      Black

33   Lemma and Palea: Colour          Straw/                  Gold and   Gold and
                                      Gold and gold/           gold       gold
                                      furrow on straw
                                      background/
                                      Brown spots on straw/
                                      Brown furrows on
                                      straw/
                                      Brown(tawny)/
                                      Reddish to light
                                      purple/
                                      Purple spot/
                                      furrows on straw/
                                      Purple/
                                      Black
34   Panicle: Awns                    Absent/ Present         Absent     Absent

35   Panicle: Presence of Secondary   Absent/ Present         Present    Absent
     Branching
36   Panicle: Secondary Branching     Weak/ Strong/            Weak        NA
                                      Clustered
37   Panicle: Attitude of Branches    Erect/ Erect to Semi     Erect       NA
                                      Erect/ Semi Erect/
                                      Semi Erect to
                                      Spreading/ Spreading
38   Time of maturity (days)          Very Early(160)

                                                                                    38
Chapter 3: Results - Shodhganga
39    Sterile lemma: Color             Straw/ Gold/ Red/          Straw       Straw
                                          Purple
   40    Grain : Weight of 1000 fully     Very low(30gm)
   41    Grain: Length                    Very Short (12.5mm)
   42    Grain : Width                    Very                       Very       Narrow
                                          Narrow(3.5mm)

The crosses between Jugal and IR36 produce F1 progeny only in the combinations where
Jugal plants were selected as male parent donating pollen and IR36 plants were recipient of
those pollens and act as female parent (Figure 10). Interestingly, there were no F1 seeds
produced on crosses between Jugal as female parent and IR36 as male parent. However, Jugal
female plants produce seeds when they were self-pollinated which clearly indicates that they
are not female sterile.

Figure 10. a Seeds of F1 generation in cross between male Jugal and female IR36; b a viable
germinating seed of F1 generation.

                                                                                         39
Chapter 3: Results - Shodhganga
Results

     3.4. Study of sequence diversity of the major genetic loci involved in floral
     organ development

     3.4.1. PCR amplification and sequencing of Jugal floral organ developmental genes

     The details of the floral organ developmental genes selected for this present investigation are
     presented in Table 8, which included both the biological and the molecular function of the
     genes studied. Most of the genes selected for this study are transcription factors and belong to
     the gene groups described in ABCDE model; however, the DL, LOG, and FON1 genes are
     associated with development and maintenance of floral meristem.

     Table 8. Details of the floral organ developmental genes investigated

Sl        Genes        RAP-DP                             Functional Annotation
No                     Gene ID                 Biological Function            Molecular Function
 1         DL        Os03g0215200             Specification of carpel         Transcription factor
 2        LOG        Os01g0588900      Regulation of meristem development      Hydrolase activity
 3        FON1       Os06g0717200      Specification of floral organ number     Serine/threonine
                                                                                 protein kinase
4      OsMADS1       Os03g0215400         Floral meristem determinacy         Transcription factor
5      OsMADS3       Os01g0201700              Carpel development             Transcription factor
6      OsMADS6       Os02g0682200      Specification of floral organ identity Transcription factor
7      OsMADS7       Os08g0531700              flower development             Transcription factor
8      OsMADS8       Os09g0507200              flower development             Transcription factor
9      OsMADS13      Os12g0207000              Ovule development              Transcription factor
10     OsMADS17      Os04g0580700      Specification of floral organ identity Transcription factor
11     OsMADS18      Os07g0605200              flower development             Transcription factor
12     OsMADS21      Os01g0886200              Ovule development              Transcription factor
13     OsMADS34      Os03g0753100              flower development             Transcription factor
14     OsMADS58      Os05g0203800              Carpel development             Transcription factor

                                                                                                  40
Results

3.4.2. Bioinformatical analysis of the derived sequences

The sequences of PCR derived amplicons were first subjected to BLASTN with the reference
genome in ensemble plant database for quality check and the outcome of that is presented in
Table 9.

Table 9. BLAST analysis of the sequences

Sl     Gene          Gene ID          Genomic      Alignment Alignment E-value      Percentage
No     Name                            location      length    score                 identity
 1      DL        Os03g0215200     3:6042148 to      525        521      0.0           99.8
                                   6042672 (+)
 2     FON1       OS06G0717200    6:30470192 to      432         428        0.0        99.8
                                   30470623 (+)
 3      LOG       OS01G0588900    1:22955171 to      372         348        0.0        98.4
                                   22955542 (+)
 4   OsMADS3      OS01G0201700     1:5568017 to      527         511        0.0        99.2
                                   5568543 (+)
 5   OsMADS6      OS02G0682200    2:27883673 to      360         360        0.0       100.0
                                   27884032 (+)
 6   OsMADS7      OS08G0531700    8:26511898 to      467         463        0.0        99.8
                                   26512364 (+)
 7   OsMADS8      OS09G0507200    9:19659409 to      459         447        0.0        99.3
                                   19659867 (+)
 8   OsMADS1      OS03G0215400     3:6061106 to      174          70      7.0E-32      85.1
                                    6061279 (-)
 9   OsMADS13     OS12G0207000     1:5568068 to      181         169      6.1E-91      98.3
                                    5568248 (-)
10 OsMADS17       OS04G0580700    4:29308380 to      154          74      2.5E-34      87.0
                                   29308533 (+)
11 OsMADS18       OS07G0605200    7:24788591 to      122          46      9.8E-18      84.4
                                   24788712 (+)
12 OsMADS21       OS01G0886200    1:38501862 to      152          76      1.6E-35      87.5
                                   38502013 (+)
13 OsMADS34       OS03G0753100    3: 31054588 to     122          46      9.8E-18      84.4
                                   31054709 (-)
14 OsMADS58       OS01G0201700     1:5568065 to      178         166      3.7E-89      98.3
                                    5568242 (-)

In the above mentioned table, genomic location is the location of the query sequence on rice
genome. Alignment length is the full length of the alignment which includes all the gaps
presented either in the query (input) sequence or subject (hit) sequence. Alignment score was
calculated from the alignment, which mainly indicates the number of matches in the
alignment. Percentage identity is the percentage of the query (input) sequence identical to the
subject (hit) sequence.

                                                                                              41
Results

E-value is the probability that the alignment between the query sequence (input) and the
subject sequence (hit) due to chance; i.e. higher the E-value, higher the probability of error.
In this experiment though we sequenced 14 genes, 7 of them (OsMADS1, OsMADS13,
OsMADS17, OsMADS18, OsMADS21, OsMADS34, OsMADS58) were found to have higher
E-value and therefore excluded from further investigation.

The PCR derived sequences of FON1, DL, LOG, OsMADS3, OsMADS6, OsMADS7, and
OsMADS8 were submitted to Gene bank of NCBI with the accession numbers - MH053403,
MH053404, MH053405, MH053406, MH065616, MH065617, and MH065618, respectively
and were further analyzed bioinformatically. At first, the sequences were investigated to find
out genetic variations, which were indicated by sequence variation between Jugal floral organ
developmental gene sequences and the rice reference genome sequences of the same genes.
The outcome of pair-wise alignment between Jugal and the rice reference genome genes
showed that, six among the seven Jugal floral organ developmental genes studied possess
nucleotide sequence variations. Only, sequence of OsMADS6 did not showed any alteration
between Jugal and the reference genome. The total number of variations in each individual
gene sequences were counted, and also whether the variations were transitions (T/C and G/A)
or transvertions (T/G, C/G, T/A, and C/A) were estimated (Table 10). The sequence
variations are presented in Figure 11, where the nucleotide variations were highlighted with
colours.

To find out the consequences of nucleotide variations at amino acid level, the Jugal floral
organ developmental gene sequences were bioinformatically converted into their
corresponding amino acid sequences. The tool used for conversion of nucleotide conversion
was NCBI ORF finder (Wheeler et al. 2003), a web based tool which analyze Open Reading
Frame (ORF) of any sequence that had been input. The derived amino acid sequences were
pair-wise aligned which showed that three (FON1, LOG, and OsMADS8) among the seven
investigated genes have changes into their amino acid sequences (Table 10). In case of
OsMADS6, as there was no variation in nucleotide sequence thus there is no changes in
amino acids. Interestingly, DL, OsMADS3, and OsMADS7 genes have changes in their
nucleotide sequences, but still that did not confer any alterations in amino acid sequence. The
outcome of pair-wise alignment with amino acid sequence variations are presented in Figure
12.

                                                                                            42
Results

Table 10. Variations presented in nucleotide sequences of Jugal floral organ developmental
genes and their corresponding amino acid sequences

Sl     Gene          RAP-DP        Variations of Nucleotide     Variations of Amino acid
No                                         Sequence                     sequences
                     Gene ID
                                      No       Transition       Presence of      No of
                                      of         (Ts) or         variation     variation
                                   variation Transversion
                                                   (Tv)
1       DL       Os03g0215200          1           Ts-1             No              NA
2     FON1       OS06G0717200          1           Ts-1             Yes              1
3      LOG       OS01G0588900          6       Ts-2, Tv-4           Yes              4
4    OsMADS3     OS01G0201700          4           Ts-4             No              NA
5    OsMADS6     OS02G0682200         NA            NA              No              NA
6    OsMADS7     OS08G0531700          1           Ts-1             No              NA
7    OsMADS8     OS09G0507200          3        Ts-1, Tv-2          Yes              1

Figure 11. Variations presented in Jugal floral organ developmental genes. The variations are
highlighted with colours; green colour was used to highlight the reference genome sequences,
while the Jugal sequences were highlighted with red colour. a Alignment of DL; b Alignment
of FON1; c Alignment of LOG; d Alignment of OsMADS3; e Alignment of OsMADS7; f
Alignment of OsMADS8.

                                                                                          43
Results

Figure 12. Variations presented in Jugal floral organ developmental genes corresponding
amino acid sequences. The variations are highlighted with colours; green colour was used to
highlight the amino acids in reference genome sequences, yellow colour was used to Jugal
amino acids which are related to the reference genome amino acids; red colour was used to
highlight Jugal amino acids which are completely different than the reference genome
sequences. a Alignment of FON1; b Alignment of LOG; c Alignment of OsMADS8.

Table 11 contains the open reading frame analysis of the Jugal floral organ developmental
genes, which showed that out of the seven genes investigated only Jugal LOG gene have
changes in its open reading frame when compared with the reference genome genes. Six
among the seven genes studied, both in Jugal and the reference genome, the number of ORFs
present, presence of ORFs in positive (+) or negative (-) stand, the ORF staring nucleotide
number in sequence and the ORF ending nucleotide in sequence, length of the ORF by the
presence of number of nucleotide residues, and the number of amino acids that the ORF
codes are same. In case of LOG gene the starting point of ORF varies in Jugal and the
reference genome sequences. While in Jugal sequence, the ORF stars from the 357th
nucleotide, in the reference genome sequence the 330th nucleotide is the starting nucleotide.
However, both in Jugal and the reference genome LOG gene, the end of ORF are same (the
106th nucleotide). This change in LOG gene ORF starting point resulted alterations in the
length of ORF as well as the corresponding number of amino acid residues. While the
reference genome LOG gene ORF contains 225 nucleotides which encodes 74 amino acids,
the Jugal genome LOG gene ORF contains 252 amino acids and encodes 83 amino acids.

                                                                                          44
Results

Table 11. Open Reading Frame (ORF) analysis of Jugal and the reference genome floral
organ developmental genes by NCBI ORF Finder

 Sl       Gene       Genome       Number   Strand   Start   End       Length Amino
 No                                 of                of     Of       of ORF  Acid
                                   ORF              ORF     ORF         (bp) Residues
  1        DL        Reference      2        +       178    309         132     43
                       Jugal        2        +       178    309         132     43
  2      FON1        Reference      2        +       142    432         291     96
                       Jugal        2        +       142    432         291     96
  3       LOG        Reference      1        -       330    106         225     74
                       Jugal        1        -       357    106         252     83
  4    OsMADS3       Reference      3        +        96    419         324    107
                       Jugal        3        +        96    419         324    107
  5    OsMADS6       Reference      1        +        94    276         183     60
                       Jugal        1        +        94    276         183     60
  6                  Reference      5        -       123     1          123     40
       OsMADS7
                       Jugal        4        -       123     1          123     40
  7                  Reference      3        +       156    308         153     50
       OsMADS8
                       Jugal        3        +       156    308         153     50

3.4.3. Validation of variations

The Jugal gene sequences were further compared and analysed with the data of Rice SNP-
SEEK (Figure 13).

                    Figure 13: Screenshot of Rice SNP-SEEK database

                                                                                    45
Results

The Table 12 displays the locations of the Jugal floral organ developmental genes on the
selected five reference genomes. The Jugal floral organ developmental genes sequences along
with their corresponding gene sequences from the five reference genomes were multiple
sequence aligned (MSA). The MSA data showed that, although, all the seven Jugal genes
investigated possess variations, only three genes possess unique variations. The Jugal LOG,
OsMADS3, and OsMADS8 genes have unique variations in their nucleotide sequence. The
number of variations and unique variations are presented in Table 13. FON1, DL, OsMADS6,
OsMADS7 genes were also showed variation in multiple sequence analysis; however, those
variations are not unique, as these variations are not only present in the Jugal genome but also
present in other genome gene sequences investigated. The MSA data are represented in
Figure 14 and Figure 15, where the nucleotide variations were highlighted with colours.

Table 12. Details of the five reference genome sequences on SNP-Seek database
corresponding to Jugal floral organ developmental gene sequences

Sl    Gene        Nipponbare          93–11            DJ 123            IR 64         Kasalath
No
 1     DL        chr03:6042148-    chr03:6211078-   scaffold_35:58   scaffold_365:2   chr03:589065
                     6042672           6211602      1285-581809       57499-258023     2-5891176
2     FON1      chr06:30470192-   chr06:30891262-   scaffold_95:48   scaffold_90:48   chr06:290668
                    30470623          30891693       2612-483043     5579-486010      04-29067235
3      LOG      chr01:22955171-   chr01:24717061-   scaffold_76:53   scaffold_286:2   chr01:213437
                    22955542          24717435       9124-539498      89110-289481    84-21344158
4    OsMADS3     chr01:5568017-    chr01:5703633-   scaffold_357:1   scaffold_81:38   chr01:527858
                     5568543           5704159      86246-186772       0374-380900     3-5279109
5    OsMADS6    chr02:27883673-   chr02:28402000-   scaffold_122:2   scaffold_276:2   chr02:269515
                    27884032          28402359      41261-241620      50977-251336    56-26951915
6    OsMADS7    chr08:26511898-   chr08:29013349-   scaffold_130:5   scaffold_348:1   chr08:250821
                    26512364          29013814        2429-52894      89405-189870    16-25082581
7    OsMADS8    chr09:19659409-   chr09:18988348-   scaffold_235:1   scaffold_144:2   chr09:189003
                    19659867          18988806      20837-121295      18517-218975    44-18900802

                                                                                                  46
Results

Table 13. Study of uniqueness of variations of Jugal sequences

                     Sl       Gene           No of         No of unique
                     No                   variations in    variations in
                                             Jugal            Jugal
                     1        DL                1                0
                     2      FON1                2                0
                     3       LOG                7                3
                     4     OsMADS3              4                3
                     5     OsMADS6              1                0
                     6     OsMADS7              1                0
                     7     OsMADS8              3                3

Figure 14. Results of Multiple sequence alignment (MSA) of Nipponbare, DJ123, 9311,
IR64, Kasalath, and Jugal sequences. The variations which present only in Jugal sequences
are highlighted with red colour; variations present in rice genomes other than Jugal are
highlighted with yellow colour, the other variations of the alignment are highlighted by green
colour. a MSA of DL gene; b MSA of FON1 gene; c MSA of LOG gene.

                                                                                           47
Results

Figure 15. Results of Multiple sequence alignment (MSA) of Nipponbare, DJ123, 9311,
IR64, Kasalath, and Jugal sequences. The variations which present only in Jugal sequences
are highlighted with red colour; variations present in rice genomes other than Jugal are
highlighted with yellow colour, the other variations of the alignment are highlighted by green
colour. a MSA of OsMADS3 gene; b MSA of OsMADS6 gene; c MSA of OsMADS7 gene; d
MSA of OsMADS8 gene.

                                                                                           48
Results

3.5. Whole genome sequencing of Jugal

Whole genome sequencing of Jugal was done by next generation sequencing technology.

3.5.1. Raw data statistics

Sequencing the whole genome of Jugal rice line has generated two files (VB-156-2_1.fastq
and VB-156-2_2.fastq). A total 47.59 Million reads were obtained of which 39.31 Million are
high quality reads and 8.28 Million reads are of low quality (Table 14). The total number of
reads with non-ATGC bases are 1,59,142 (0.33%). Total no of bases are 4,806,304,372
among which total no of high quality bases are 4,418,550468 (91.93%) and the total number
of non-ATGC bases are 8,60,683 (0.01%).

Table 14. Raw Data Quality Summery

         Total No. of reads     Total High Quality (HQ)       Total Low Quality
                                         reads                      reads
             47,587,172                39,311,826                 8,275,346

                        Figure 16. Summary of the sequenced reads

                                                                                         49
Results

3.5.2. Pre-processing and mapping of reads

Whole genome sequencing of var. Jugal generated total 4.09 GB of high quality sequence
data. Total 4.76X107 paired-end raw reads with 101bp length were obtained with an average
depth of coverage of 10X. After quality check 4.58X107 (96.2%) cleaned and high quality
reads were identified and studied for variant analysis and other downstream analysis (Figure
16). Around 4.08X107 (89.02 %) reads were mapped properly onto the reference genome and
out of which 3.39X107 reads were mapped uniquely which were further used for variant
discovery analysis. The aligned reads were converted into BAM file and submitted to the
Sequence Read Archive (SRA) at NCBI with the accession number SRP131720. Analysis of
mapped reads showed that maximum reads (3,966,575) were mapped to chromosome 1,
followed by chromosome 3 (3,550,428) and chromosome 2 (3,358,620); Chromosome 9 has
the minimum number of mapped reads (2,002,568) (Table 15). The unmapped reads were
further de novo assembled which was described later.

Table 15. List of chromosome number with uniquely mapped reads into the reference
genome

        Sl
        No          Chromosome            Oryza sativa japonica Uniquely paired
                     (Accession)          cv. Nipponbare (bp) mapped reads

         1         Chromosome 1
                                                43270923              3966575
                   (NC_029256.1)
         2         Chromosome 2
                                                35937250              3358620
                   (NC_029257.1)
         3         Chromosome 3
                                                36413819              3550428
                   (NC_029258.1)
         4         Chromosome 4
                                                35502694              3007257
                   (NC_029259.1)
         5         Chromosome 5
                                                29958434              2816056
                   (NC_029260.1)
         6         Chromosome 6
                                                31248787              2829818
                   (NC_029261.1)
         7         Chromosome 7
                                                29697621              2637964
                   (NC_029262.1)
         8         Chromosome 8
                                                28443022              2593109
                   (NC_029263.1)
         9         Chromosome 9
                                                23012720              2002568
                   (NC_029264.1)
         10       Chromosome 10                 23207287              2208380

                                                                                         50
(NC_029265.1)

        11        Chromosome 11
                                               29021106             2433673
                  (NC_029266.1)
        12        Chromosome 12
                                               27531856             2290523
                  (NC_029267.1)

Table 16. Chromosome wise variant distribution and density per 100 kb

Sl   Chromosome          Oryza sativa      Total Variants        SNPs           InDels
No    (Accession)        japonica cv.       (Density per     (Density per     (Density per
                       Nipponbare (bp)        100 kb)          100 kb)          100 kb)
1    Chromosome 1         43270923            129,870          110,196          19,674
     (NC_029256.1)                            (300.1)          (254.7)           (45.5)
2    Chromosome 2          35937250           107,230           91,546          15,684
     (NC_029257.1)                             (298.4)          (254.7)          (43.6)
3    Chromosome 3          36413819           109,464           93,481          15,983
     (NC_029258.1)                             (300.6)          (256.7)          (43.9)
4    Chromosome 4          35502694            84,725           73,253          11,472
     (NC_029259.1)                             (238.6)          (206.3)          (32.3)
5    Chromosome 5          29958434            84,282           72,643          11,639
     (NC_029260.1)                             (281.3)          (242.5)          (38.9)
6    Chromosome 6          31248787            96,356           83,055          13,301
     (NC_029261.1)                             (308.4)          (265.8)          (42.6)
7    Chromosome 7          29697621            86,599           74,765          11,834
     (NC_029262.1)                            (291.6)           (251.8)          (39.8)
8    Chromosome 8          28443022            85,179           73,370          11,809
     (NC_029263.1)                             (299.5)           (258)           (41.5)
9    Chromosome 9          23012720            70,611           60,850           9,761
     (NC_029264.1)                             (306.8)          (264.4)          (42.4)
10   Chromosome 10         23207287            75,563           65,582           9,981
     (NC_029265.1)                             (325.6)          (282.6)           (43)
11   Chromosome 11         29021106            90,428           78,460          11,968
     (NC_029266.1)                             (311.6)          (270.4)          (41.2)
12   Chromosome 12         27531856            75,075           64,922          10,153
     (NC_029267.1)                             (272.7)          (235.8)          (36.9)
13   Mitochondrion          490520               66                50              16
         genome                                 (13.5)           (10.2)           (3.3)
     (NC_011033.1)
14    Other genomic            -                971              860              111
         regions
15        Total                -             1,096,419         943,033          153,386

                                                                                          51
Results

3.5.3. Variant (SNP/InDels) Discovery

Total 3,574,389 variants (3,226,099 SNPs and 348,290 InDels) were detected initially using
default parameters. Several filtering parameters were used to remove the false positive
variants and finally 1,096,419 variants were obtained which contain 943,033 SNPs and
153,386 InDels. Out of the 153,386 InDels, 75,051 were insertion and 78,335 were deletions.
Moreover, a total of 962,268 and 134,151 variants were identified as homozygous and
heterozygous respectively. Chromosome 1 has the maximum number of variants i.e. 129,870,
followed by Chromosome 3 (109,464 variants) and Chromosome 2 (107,230 variants); least
number of variants were found in chromosome 9 i.e. 70,611 (Figure 17 and Figure 18).
Density distribution of all the chromosome were calculated per 100 kb, chromosome 11 has
the maximum density i.e. 311.6, followed by 308.4 and 306.8 in chromosome 6 and
chromosome 9, respectively (Table 16). Chromosome 4 has the lowest density i.e. 238.6.
Structural variant analysis detected 12,143 structural variants (SV) which included 896 intra-
chromosomal translocation (ITX), 3,895 inter-chromosomal translocation (CTX), 6,754
deletion (DEL), 459 inversion (INV), 87 unknown (UN) and 52 insertion (INS) (Figure 19).

3.5.4. Analysis of Variants (SNPs and InDels)

The 1,096,419 variants included 1,200 multiallelic sites and 600 multiallelic SNP sites.
Among the 943,033 SNPs detected, 668,660 are transitions (T/C and G/A) and 274,373 are
transversion (T/G, C/G, T/A, and C/A). The ratio of transition to transversion is 2.44. The
frequency of transversions is comparatively lower than transitions. In transition, maximum
SNPs are C to T i.e. 181,812 count and A to G is minimum in count i.e. 152,353. Whereas in
case of transversion, T to A is maximum in count i.e. 39,127 and C to G has 25,035 which is
minimum in count (Figure 20). In InDels, the insertions variants are ranged from 1bp to 29bp
and deletions are ranged in between -1bp to -44bp. In both insertion and deletion, maximum
changes are in 1bp, followed by 2bp and 3bp (Figure 21).

                                                                                           52
Results

Figure 17. Chromosome-wise distribution of SNPs and InDels

Figure 18. Circular plot of variants across 12 rice chromosome

                                                                      53
Results

            Figure 19. Distribution of structural variants

  Figure 20. Transitions/transversion (Ts/Tv) distribution in SNPs

Figure 21. Length variation found in insertion and deletion in InDels

                                                                             54
Results

Table 17: Summary of variants in the selected 23 flowering genes

                   Sl     Gene Name         Gene_ID        Total No of
                   No                                       variants
                   1       OsMADS1        Os03g0215400         23
                    2      OsMADS2        Os01g0883100             7
                    3      OsMADS3        Os01g0201700         20
                    4      OsMADS4        Os05g0423400             7
                    5      OsMADS5        Os06g0162800         17
                    6      OsMADS6        Os02g0682200         23
                    7      OsMADS7        Os08g0531700         16
                    8      OsMADS8        Os09g0507200         21
                    9     OsMADS13        Os12g0207000         15
                   10     OsMADS14        Os03g0752800         26
                   11     OsMADS15        Os07g0108900         24
                   12     OsMADS16        Os06g0712700         13
                   13     OsMADS17        Os04g0580700         22
                   14     OsMADS18        Os07g0605200         18
                   15     OsMADS21        Os01g0886200             5
                   16     OsMADS34        Os03g0753100         20
                   17     OsMADS58        Os05g0203800         20
                   18        FON1         Os06g0717200             9
                   19        FON2         Os11g0595400         13
                   20         DL          Os03g0215200         32
                   21         LOG         Os01g0588900         11
                   22         SNB         Os07g0235800         25
                   23        OSH1         Os03g0727000         32

                                                                              55
Results

3.5.5. Variants Annotation

Annotation of the variants against the reference genome was done of which 821,472 variants
were found in intergenic region and 123,739 variants were in exon region. In addition to this,
40,408 missense variants, 508 nonsense, and 34,924 silent variants were detected. Ratio of
Missense/Silent ratio was 1.157 (Figure 22). Out of 1,096,419 variants, 24,920 were non-
synonymous SNPs which found in 11,529 genes. SNPs per gene in non-synonymous SNPs
were ranged from 1 to 49.

           Figure 22. Annotation and distribution of Variants (SNPs and InDels)

3.5.6. De novo assembly of unmapped reads

Total 1,601,994 paired end unmapped reads were used for de novo assembly of var. Jugal.
After assembly, total 16,585 contigs were generated with minimum read length between 56bp
with kmer of 55bp. Contigs with sequence length less than 200bp were removed. Finally,
16,159 contigs were remained with N50 of 977bp. Maximum contigs have length in between
200 to 299 i.e. 6653, followed by 2430 and 1343 in base pair ranged 300 to 399 and 400 to
499, respectively. Out of 16,159 contigs, 9,075 contigs were not showed any similarity
against NR database, while 7,084 contigs matched with known genes. Maximum hits were
found with Oryza sativa Japonica and Oryza sativa Indica i.e. 1,478 and 1,298, respectively
(Figure 23). Annotation and gene ontology of contigs were performed and sub categorized
mapped genes into cellular component, molecular function and biological process

                                                                                           56
Results

(Figure 24). Variants were also mined against de novo assembly of unmapped reads. Total
2,772 variants were detected which contains 2,636 SNPs and 136 InDels. Out of 2,772
variants, 2,542 and 230 were heterozygous and homozygous, respectively. Out of 2,636
SNPs, 1,670 and 966 were transitions (Ts) and transversion (Tv), respectively with Ts/Tv
ratio of 1.73. In case of InDels, 87 were insertions and 49 were deletions. In insertion and
deletion both were ranged from 1 to 5 and -1 to -5.

3.5.7. Analysis of the variants in the floral organ developmental genes
Analysis of the variations of the 23 floral organ developmental genes shows that the selected
genes in Jugal contained 419 SNPs. The detail information of the genes studied was provided
in Table 18 and the summery of variants in the studied genes were presented in Table 17.
Among the 23 floral organ developmental genes studied, DL and OSH1 genes found to have
highest number of variants (32 variants); whereas, MADS21 gene has lowest number of
variants (5 variants).

      Figure 23. Top hit species distribution of de novo assembly of unmapped contigs

                                                                                          57
Results

                            Figure 24. Gene ontology of contigs

3.6. Bioinformatical analysis

For detail analysis of floral organ development in Jugal, twenty three genes involved in
different floral organ development pathways were selected. Table 18 displayed details of the
genes selected for this study which include name of the genes, their accession numbers in
RAP-DB database (Sakai et al. 2013), function of the genes, accession number of the protein
products of each gene in UniProt database and reference of the genes.

For OsMADS58 gene, two isoforms of protein were presented in the UniProt database; these
isoforms are produced by alternative splicing of the OsMADS58 gene. Therefore, for
OsMADS58 both the two isoforms are considered as reference; the isoform1
(identifier: Q2V0P1-1) is considered as reference genome isoform-1, whereas, isoform2
(identifier: Q2V0P1-2) is considered as reference genome isoform-2 in all the investigations.

                                                                                            58
Results

Table 18. Details of the floral organ developmental genes

Sl     Gene          RAP-DB              Function             UniProt         Reference
No     Names       Accession No                             Accession No
1      FON1        Os06g0717200        Enlargement of         Q5Z9N5        Suzaki et al.
                                       floral meristem                          2004
2       FON2       Os11g0595400        Enlargement of        A0MH06         Suzaki et al.
                                       floral meristem                          2006
3        DL        Os03g0215200             Carpel            Q76EJ0       Yamaguchi et al.
                                         development                            2004
4       LOG        Os01g0588900        Maintenance of         Q5ZC82       Kurakawa et al.
                                       floral meristem                          2007
5       OSH1       Os03g0727000        Maintenance of       A0A0P0W324     Sato et al. 1996
                                         reproductive
                                           meristem
6        SNB       Os07g0235800        Maintenance of        B9FWB1        Lee and An 2011
                                       floral meristem
7     OsMADS1      Os03g0215400          MADS-box             Q10PZ9         Agrawal et al.
                                     transcription factor                        2005
8     OsMADS2      Os01g0883100          MADS-box             Q40702       Prasad et al. 2003
                                     transcription factor
9     OsMADS3      Os01g0201700          MADS-box             S5CT07         Li et al. 2011
                                     transcription factor
10    OsMADS4      Os05g0423400          MADS-box             Q40703        Kang and An
                                     transcription factor                        2005
11    OsMADS5      Os06g0162800          MADS-box             Q0DEB8        Kang and An
                                     transcription factor                        1997
12    OsMADS6      Os02g0682200          MADS-box             Q6EU39        Tao et al. 2018
                                     transcription factor
13    OsMADS7      Os08g0531700          MADS-box             Q0J466       Kang et al. 1997
                                     transcription factor
14    OsMADS8      Os09g0507200          MADS-box             Q9SAR1       Kang et al. 1997
                                     transcription factor
15   OsMADS13      Os12g0207000          MADS-box            Q2QW53        Dreni et al. 2007
                                     transcription factor
16   OsMADS14      Os03g0752800          MADS-box           A0A0P0W361      Pelucchi et al.
                                     transcription factor                       2002
17   OsMADS15      Os07g0108900          MADS-box             Q6Q9I2        Pelucchi et al.
                                     transcription factor                       2002
18   OsMADS16      Os06g0712700          MADS-box             Q944S9        Yun et al. 2013
                                     transcription factor
19   OsMADS17      Os04g0580700          MADS-box            Q7XUN2          Ohmori et al.

                                                                                         59
transcription factor                        2009
20   OsMADS18      Os07g0605200          MADS-box              Q0D4T4        Fornara et al.
                                     transcription factor                        2004
21   OsMADS21      Os01g0886200          MADS-box              Q8RU31       Dreni et al. 2013
                                     transcription factor
22   OsMADS34      Os03g0753100          MADS-box              Q6Q9H6        Gao et al. 2010
                                     transcription factor
23   OsMADS58      Os05g0203800          MADS-box             Q2V0P1-1 &    Yamaguchi et al.
                                     transcription factor      Q2V0P1-2         2006

The derived amino acid sequences were aligned with the reference genome amino acid
sequences which showed that nineteen among the twenty three investigated amino acid
sequences of Jugal, are similar to the reference genome amino acid sequences. However, four
translated gene sequences (OsMADS3, OsMADS17, OsMADS58, and DL) of Jugal showed
alterations when compared with the reference genome amino acid sequences.

The OsMADS3 of the reference genome consists of 90 amino acid residues and Jugal
OsMADS3 has 154 amino acid residues. Thus Jugal OsMADS3 has 64 additional amino
acids when compared with the reference genome OsMADS3. The pair-wise alignment of
Jugal and the reference genome OsMADS3 are presented in Figure 25.

Figure 25. Pair-wise alignment of Jugal and the reference genome OsMADS3 amino acid
sequence; the variations are highlighted by yellow colour.

The reference genome OsMADS17 protein possesses 249 amino acids, whereas Jugal
OsMADS17 consists of 254 amino acids. Thus the Jugal OsMADS17 has 5 more amino acid
residues in comparison to the reference genome (Figure 26).

                                                                                         60
Results

Figure 26. Pair-wise alignment of Jugal and the reference genome OsMADS17 amino acid
sequences; the variations are highlighted by yellow colour.

Among the two isoforms of the reference genome OsMADS58 protein, the isoform-1 has 272
amino acids and the isoform-2 bears 233 amino acids. The Jugal OsMADS58 has 233 amino
acids which is the same number of amino acids that the reference genome OsMADS58
isoform-2 possess but 39 amino acids less than that of the reference genome OsMADS58
isoform-1 possess. The multiple sequence alignment of both the two reference genome
OsMADS58 isoforms along with the Jugal OsMADS58 amino acid sequences are presented
in Figure 27.

Figure 27. Pair-wise alignment of Jugal and the reference genome OsMADS58 amino acid
sequence; the variations are highlighted by yellow colour.

                                                                                    61
Results

The reference genome DL protein has 194 amino acids; while the translated gene sequence of
Jugal DL has 196 amino acids, i.e. there were two additional amino acids are present in the
Jugal DL sequence. In Jugal DL, the 24th amino acid is a leucine and the 25th amino acid is a
glutamine, which are not present in the reference genome DL (Figure 28).

Figure 28. Pair-wise alignment of Jugal and the reference genome DL amino acid sequence;
the variations are highlighted by red colour.

The physiochemical properties of OsMADS3, OsMADS17, OsMADS58, and DL proteins
were estimated by ProtParam (Figure 29). The estimated physiochemical properties of the
selected proteins are presented in Table 19. The parameters estimated include total number of
amino acids, molecular weight, theoretical pI, aliphatic index, and grand average of
hydropathicity (GRAVY).

                            Figure 29. Screenshot of ProtParam

                                                                                          62
Results

 Table 19. Physiochemial properties of the proteins

Sl    Protein      Genome     Number Molecular Theoretical Aliphatic     Grand
No                               of   weight       pI       index      average of
                               amino                                 hydropathicity
                               acids
1    OsMADS3  Reference          90   9462.96     8.96      77.11        0.000
                Jugal           154  16636.77     9.51      71.69        -0.455
2    OsMADS17 Reference         249  28183.94     9.08      76.39        -0.739
                Jugal           254  28802.60     8.97      74.88        -0.765
              Isoform-1         272  30830.68     8.60      67.79        -0.824
3    OsMADS58 Isoform-2         233  25868.16     9.02      72.45        -0.744
                Jugal           233  25868.16     9.02      72.45        -0.744
4       DL    Reference         194   21663.9     8.98      62.27        -0.547
                Jugal           196  21905.19      8.9      63.62        -0.540

 Domain prediction of the OsMADS3, OsMADS17, OsMADS58, and DL proteins were done
 by using InterPro (Figure 30).

                                  Figure 30. Screenshot of InterPro

 Domain prediction by InterPro showed that there are no changes in domain number in all the
 four Jugal proteins in comparison to the reference genome proteins (Table 20). However,
 there are some alterations of start and end point of domains. Both Jugal and the reference
 genome OsMADS3 have a single domain which is a MADS-box domain. In the reference
 genome OsMADS3, the MADS-box domain starts in 41th amino acid and ends on 90th amino
 acid. The Jugal OsMADS3 also possesses a single domain (MADS-box domain) and its
 starting position is also similar to that of the reference genome OsMADS3 i.e. the 41th amino

                                                                                            63
Results

acid. However, the Jugal MADS-box domain ends at 115th amino acid, while the reference
genome MADS-box ends at 90th amino acid. Thus the Jugal OsMADS3 MADS-box domain
has 25 more amino acids when compared with the reference genome OsMADS3.

Both Jugal and the reference genome OsMADS17 bear two domains, a MADS-box domain
and a K-box domain. In the reference genome OsMADS17 protein, the MADS-box domain
starts at 1st amino acid and ends at 77th amino acid. While in Jugal OsMADS17, the MADS-
box starts at 6th amino acid and ends at 82th amino acid. Thus though the number of amino
acids in the Jugal OsMADS17 MADS-box domain same with the reference genome
OsMADS17, there is a shift of 5 amino acids in both the start and end of the domain. The
second domain present in OsMADS17 is a K-domain which starts in the reference genome at
87th amino acid and ends at 178th amino acid. Whereas in Jugal, the start and end of the K-
domain is the 92th and the 183th amino acid respectively, which is also a shift of 5 amino
acids.

There are no changes in domains of both the two isoforms of reference genome OsMADS58
(isoform-1 and isoform-2) and Jugal OsMADS58. All the three OsMADS58 proteins possess
two domains, a MADS-box domain and a K-box domain. In all the three OsMADS58
proteins, the MADS-box domain starts from the 43th amino acid and ends in the 117th amino
acid. Whereas, the K-box domain starts from the 128th amino acid and ends in the 219th
amino acid. Thus despite the differences in their length, there are no differences in the
domains of the reference genome OsMADS58 isoform-1, isoform-2 and Jugal OsMADS58.

The DL protein of both the reference genome and Jugal has a single domain which is a
HMG-box domain. The domain of the reference genome DL starts from the 92th amino acid
and ends at the 158th amino acid. While Jugal DL HMG-box starts in the 94th amino acid and
ends in the 160th amino acid. Thus, there is a shift of 2 amino acids on the Jugal DL domain.

                                                                                            64
Results

Table 20. Domain prediction by InterPro

     Sl      Protein      Genome       No of  Name of Location Name of Location
     No       name                    Domains Domain           Domain
                                                1                2
      1    OsMADS3       Reference       1    MADS      41-90   N.A.     N.A.
                                               BOX
                              Jugal      1    MADS     41-115   N.A.     N.A.
                                               BOX
      2    OsMADS17 Reference            2    MADS       1-77  K BOX    87-178
                                               BOX
                              Jugal      2    MADS       6-82  K BOX    92-183
                                               BOX
                         Isoform-1       2    MADS     43-117  K BOX 128-219
      3    OsMADS58                            BOX
                         Isoform-2       2    MADS     43-117  K BOX 128-219
                                               BOX
                              Jugal      2    MADS     43-117  K BOX 128-219
                                               BOX
      4        DL        Reference       1     HMG     92-158   N.A.     N.A.
                                               BOX
                              Jugal      1     HMG     94-160   N.A.     N.A.
                                               BOX

Protein Structures were predicted by RaptorX (Figure 31). RaptorX is a Web Portal for
Protein    Structure    and     Function   Prediction;    RaptorX     Structure       Prediction
(http://raptorx.uchicago.edu/StructurePrediction/predict/) is a server that predicts three-
dimensional structures of proteins.

          Figure 31. Screenshot of RaptorX server for protein structure prediction.

                                                                                             65
Results

The RaptorX server provides several parameters for interpretation of the results as well as
estimation of the quality of the predicted structure (Ma et al. 2013). The H, E, and C are
assigned for secondary structure prediction and represent Helix, Beta-sheet and Loop,
respectively. The P-value used to evaluates the relative quality of a predicted structure;
usually, smaller the P-value, the higher quality of the model. For alpha proteins, P-value less
than 10^-3 is a good indicator; whereas, for beta proteins, P-value less than 10^-4 is a good
indicator. uGDT(GDT) measures the absolute model quality; uGDT is the unnormalized
GDT (Global Distance Test) score; whereas, GDT is calculated as uGDT divided by the
protein (or domain) length and multiplied by 100. For a protein with >100 residues,
uGDT>50 is a good indicator. A predicted structure with both good P-value and uGDT(GDT)
is very likely to be of high quality. However, a model with good P-value but poor uGDT
value may be of low quality. Whereas, a model with a good uGDT, but poor P-value often
considered as of good quality.

The structures of OsMADS3, OsMADS17, OsMADS58, and DL proteins were predicted by
Raptor X and the predicted models are presented in Figure 32, Figure 33, Figure 34, and
Figure 35, respectively. The predicted structure of Jugal OsMADS3 protein is presented in
Figure 32a and the predicted structure of the reference genome OsMADS3 is presented in
32b. Figure 33a shows Jugal OsMADS17 protein, while the reference genome OsMADS17
protein presented in Figure 33b. The reference genome OsMADS58 isoform-1, OsMADS58
isofom-2 and Jugal OsMADS58 are presented in Figure 34a, Figure 34b, and Figure 34c
respectively. The Figure 35a shows Jugal DL and Figure 35b represents the reference genome
DL. Each predicted protein structures in the Figure 32, Figure 33, Figure 34, and Figure 35
include 4 parameters, which are - the total number of domains presented in the protein,
information about the secondary structure (the percentage of the Helix (H), Beta-sheet (E),
and Loop (C)), overall p-value, and overall µGDT(GDT).

                                                                                            66
Results

Figure 32. Predicted structure of OsMADS3; a Jugal; b the reference genome.

Figure 33. Predicted structure of OsMADS17; a Jugal; b the reference genome.

                                                                                67
Results

Figure 34. Predicted structure of OsMADS58; a the reference genome isoform-1; b the
reference genome isoform-2; c Jugal.

           Figure 35. Predicted structure of DL; a Jugal; b the reference genome.

                                                                                         68
Results

Validation of the predicated structures was done by RAMPAGE (Figure 36).

                           Figure 36. Screenshot of RAMPAGE

RAMPAGE uses several parameters for interpretation of the output of RAMPAGE which
includes – the number of amino acids in the favoured region in the Ramachandran Plot will
be around 98%, the number of amino acids in the allowed region in the Ramachandran Plot
will be around 2%; RAMPAGE also shows the number of amino acids in the outlier region in
Ramachandran Plot. The Figure 37a and Figure 37b represent validation of Jugal OsMADS3
and the reference genome OsMADS3, respectively. Figure 38a shows validation of Jugal
OsMADS17 and Figure 38b shows validation of the reference genome OsMADS17. Figure
39a, Figure 39b, and Figure 39c represent validation of OsMADS58 isoform-1, OsMADS58
isofom-2 and Jugal OsMADS58 respectively. Validation of Jugal DL is presented in Figure
40a and validation of the reference genome DL is presented in Figure 40b.

                                                                                       69
Results

Figure 37. Structure validation OsMADS3 protein; a Jugal; b the reference genome.

Figure 38. Structure validation OsMADS17 protein; a Jugal; b the reference genome.

                                                                                     70
Results

Figure 39. Structure validation OsMADS58 protein; a the reference genome isoform-1; b the
reference genome isoform-2; c Jugal.

        Figure 40. Structure validation DL protein; a Jugal; b the reference genome.

                                                                                            71
Results

Alignment of the predicted protein structures was done by RaptorX Structure Alignment
Server (Figure 41).

                Figure 41. Screenshot of RaptorX Structure Alignment Server

RaptorX structure alignment server uses several parameters to evaluate the quality of a
aligned protein structure which included Lali, RMSD, uGDT(GDT), and TMscore. The
length of alignment abbreviated as Lali represents the length of core. The core of aligned
protein structure is the portion of the structures in an alignment which aligned perfectly, and
represented as fully-aligned columns. RMSD is root-mean-square deviation and calculated
only on the core residues; the RMSD value was used for calculation of uGDT. uGDT is the
unnormalized GDT (Global Distance Test) score; whereas, GDT is calculated as uGDT
divided by the shorter (shortest) protein length and multiplied by 100. For protein structure
alignment, TMScore is the average TMscore of all the pair-wise alignments; the range of
TMscore is between 0 to 1. If TMscore >0.6, it is very likely (90% of chance) that two
proteins share a similar fold. When TMscore
Results

The alignment of the reference genome OsMADS3 and Jugal OsMADS3 proteins structures
are presented in Figure 42. Figure 42a shows the alignment of the two OsMADS3 proteins of
both the reference genome and Jugal. The Lali, RMSD, uGDT(GDT), TMscore values of the
alignment are presented in Figure 42b. The amino acids of the core of the alignment are
presented in Figure 42c, where the top row of the alignment shows the reference genome
OsMADS3 amino acid sequences and the bottom row shows amino acid sequences of Jugal
OsMADS3. Residues in the alignment are coloured according to their chemical nature – the
red colour indicates the hydrophobic amino acids, blue indicates acidic amino acids, the
magenta colour corresponds to basic amino acids, and the green colour indicates hydroxyl
and amine amino acids. The length of the alignment (Lali) is 49, which indicates that the 90
amino acid residues long the reference genome OsMADS3 and the 154 amino acid long Jugal
OsMADS3 produce protein structures which have a region of similar protein structures of 49
amino acids long. The 49 amino acids long structure of both the reference genome and Jugal
OsMADS3 protein is also called as core of the alignment and indicated with an arrow in the
Figure 42a.

  Figure 42. Alignment of OsMADS3 protein structure of Jugal and the reference genome.

                                                                                         73
Results

The aligned protein structures of OsMADS17 are presented in Figure 43. Figure 43a contains
the alignment of the reference genome OsMADS17 and Jugal OsMADS17 protein structures,
where the core of the alignment is showed by an arrow. Figure 43b shows the Lali, RMSD,
uGDT(GDT), and TMscore values of the alignment. The amino acids of the core of the
alignment are presented in Figure 43c. The Lali of the alignment of OsMADS17 is 102, this
indicates that among the 249 amino acid long the reference genome OsMADS17 protein and
254 amino acids long Jugal OsMADS17 protein, structures produced by 102 amino acids are
identical. The 102 amino acids that included in the core of the alignment are 78th-179th amino
acids of the reference genome OsMADS17 and 83th-184th amino acids of Jugal OsMADS17.
In Figure 43c, the top row of the alignment shows Jugal OsMADS17 amino acid sequences,
while the bottom row shows amino acid sequences of the reference genome OsMADS17.

 Figure 43. Alignment of OsMADS17 protein structure of Jugal and the reference genome.

                                                                                           74
Results

The alignment of the reference genome OsMADS58 isoform-1, OsMADS58 isoform-2 and
Jugal OsMADS58 are presented in Figure 44a and the core of the alignment was indicated
with arrow. Figure 44b shows Lali, RMSD, uGDT(GDT), and TMscore values of the
alignment. Figure 44c shows alignment of the amino acids.

Figure 44. Alignment of OsMADS58 protein structure of Jugal, the reference genome
isoform-1, and the reference genome isoform-2.

                                                                                     75
Results

The alignment of the reference genome DL and Jugal DL are presented in Figure 45. The
alignment of protein structure is presented in Figure 45a, where the core of the alignment was
indicated by an arrow. The parameters for interpretation of the alignment data are represented
in Figure 45b, which shows that Lali of the alignment was 102. Figure 44c shows that 93th-
194th amino acids of the reference genome DL and 95th-196th amino acids of Jugal DL
possess similar protein structure and constitute the core of alignment. In Figure 45c, the top
row of the alignment showed Jugal DL amino acid sequences, while the bottom row showed
amino acid sequences of the reference genome DL.

      Figure 45. Alignment of DL protein structure of Jugal and the reference genome.

                                                                                           76
Results

3.6. Expression study of floral organ developmental genes
3.6.1. Expression analysis by Semi-quantitative PCR

The PCR derived amplicons were resolved in 1.5% agarose gel (Figure 46a and 46b); the
variations in expression of the selected genes were estimated as variations in band intensities.
The band intensities were measured by ImageJ (Rueden et al. 2017). The Figure 46a
represents expression of DL, FON1, and OsMADS3 genes from both Jugal and IR36; whereas
expression of OsMADS13, OsMADS58, and UBIQUITIN genes are presented in Figure 46b.
The band intensities of Jugal DL and OsMADS3 genes are higher when compared with the
IR36 DL and OsMADS3 genes; whereas, for FON1 gene, there were no detectable bands
observed (Figure 7a). Figure 46b shows that the expression of Jugal OsMADS13 gene is quite
higher when compared with IR36 OsMADS13; although the expression of OsMADS58 gene
both from Jugal and IR36 are very low and merely visible, still bands for Jugal OsMADS58
has batter band intensities than that of IR36 OsMADS58. The expression of Jugal
UBIQUITIN is higher in comparison to IR36 UBIQUITIN as indicated by band intensities.
Figure 46c shows ImageJ analysis of DL, FON1, and OsMADS3 genes and 46d shows
OsMADS13, OsMADS58, and UBIQUITIN genes. The values of each band that were
generated by ImageJ are presented at the base of every band in Figure 46c and Figure 46d.

Figure 46. Semi-quantitative PCR derived amplicons resolved in 1.5% agarose gel. a DL,
FON1, and OsMADS3 genes of Jugal and IR36. b OsMADS13, OsMADS58, and UBIQUITIN
genes of Jugal and IR36. c Study of DL, FON1, and OsMADS3 genes of Jugal and IR36 by
ImageJ. d Study of OsMADS13, OsMADS58, and UBIQUITIN genes of Jugal and IR36 by
ImageJ.

                                                                                             77
Results

3.6.2. Expression analysis by real time PCR

3.6.2.1. Expression analysis of OsMADS3

The expression of OsMADS3 is presented in Figure 47 which shows that the OsMADS3 gene
expression slowly increased from Sp5 to Sp6 and Sp7 developmental stages in IR36.
However, although expression of OsMADS3 was considerably low at initial developmental
stage (Sp5) in the mutant rice (Jugal), in later developmental stages, the OsMADS3 level has
increased and its expression was almost same as the normal rice at Sp6 and Sp7 stages.

                Figure 47. Expression of OsMADS3 in IR36 and Jugal flowers

3.6.2.2. Expression analysis of OsMADS13

The expression of OsMADS13 is presented in Figure 48. In IR36 flower, the OsMADS13
expression level slowly decreases in Sp6 when compared with the Sp5 and then a very little
increase in expression was observed in Sp7. However, in Jugal flower OsMADS13 expression
was considerably high at Sp5 then a sudden decrease in expression was observed at Sp6 and
at Sp7 its expression increased drastically.

                                                                                          78
Results

              Figure 48. Expression of OsMADS13 in IR36 and Jugal flowers

3.6.2.3. Expression analysis of OsMADS21

Figure 49 represents expression of OsMADS21 gene. The OsMADS21 gene showed a slow
but continuous decrease in expression from Sp5 to Sp7 stage in IR36. At Sp5 stage, in Jugal
flower the expression of OsMADS21 was almost similar as that of IR36, it then shows a
decrease in expression at Sp6 and at Sp7 its expression level again increased and expressed
much higher when compared with IR36.

              Figure 49. Expression of OsMADS21 in IR36 and Jugal flowers

                                                                                         79
Results

3.6.2.4. Expression analysis of OsMADS58

The expression of OsMADS58 is the most interesting among the five selected genes and
represented in Figure 50. In IR36 its expression was higher at Sp6 than both Sp5 and Sp7.
But in case of Jugal, level of expression was almost similar of IR36 at Sp5 stage; after that a
sudden decrease in expression was observed at Sp6 stage then its expression again started to
increase and at Sp7 OsMADS58 expression was much higher when compared with the IR36.

               Figure 50. Expression of OsMADS58 in IR36 and Jugal flowers

3.6.2.5. Expression analysis of DL

The expression pattern of DL gene was almost similar in both Jugal and IR36 at early stage of
floral organ development i.e. at Sp5 stage and Sp6 stage (Figure 51). However, after Sp6
stage expression of DL reduced in the normal rice (IR36) but in case of the mutant rice
(Jugal) the expression started to increase and at Sp7 the expression level was much higher in
the mutant line.

                   Figure 51. Expression of DL in IR36 and Jugal flowers

                                                                                            80
You can also read