Annotations

A major challenge of the sunflower genome project has been dealing with the large and repetitive nature of the genome. Below is a description of our custom annotation procedures and the results. You may skip to the results or download sections using the menu to the left.

Annotation results

Below are some basic descriptive statistics for the HA412HO bronze genome annotations.

Basic gene feature statistics

Feature nameCount
Total genes44144
Protein-coding genes44144
Protein-coding mRNAs44144
Genes with a Pfam domain36,969 (82.7%)
Exons202500
CDSs195630
five_prime_UTRs19447
introns316712
start_codons44144
stop_codons44144
three_prime_UTRs16244

Basic transposon statistics

  • Class I
    • LTR Retrotransposons
      • Gypsy (RLG)
      • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
        201851469542016128583.7624982
      • Copia (RLC)
      • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
        129811073828816587729.0724973
      • Unclassified (RLX)
      • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
        10978271432316105158.7224971
      • non-LTR Retrotransposons
      • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
        140NANA3442047.632618
      • TRIM Retrotransposons
      • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
        7286NANA357931.321987
      • Class II
        • Subclass I
          • Terminal-inverted repeat (TIR) transposons:
            • hAT (DTA)
            • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
              143NANA1331136.348572
            • Tc1-Mariner (DTT)
            • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
              1966NANA1192063.2711114
            • Mutator (DTM)
            • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
              914NANA1231047.229972
            • Unclassified (DTX)
            • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
              14726NANA1131866.4511233
        • Subclass II
          • Heltrons (DHH)
          • Total CountCount w/protein matchesNumber of familiesLength MinimumLength MeanLength Maximum
            4308NANA2027896.4519995