A major challenge of the sunflower genome project has been dealing with the large and repetitive nature of the genome. Below is a description of our custom annotation procedures and the results. You may skip to the results or download sections using the menu to the left.
Annotation results
Below are some basic descriptive statistics for the HA412HO bronze genome annotations.
Basic gene feature statistics
Feature name | Count |
---|---|
Total genes | 44144 |
Protein-coding genes | 44144 |
Protein-coding mRNAs | 44144 |
Genes with a Pfam domain | 36,969 (82.7%) |
Exons | 202500 |
CDSs | 195630 |
five_prime_UTRs | 19447 |
introns | 316712 |
start_codons | 44144 |
stop_codons | 44144 |
three_prime_UTRs | 16244 |
Basic transposon statistics
- Class I
- LTR Retrotransposons
- Gypsy (RLG)
- Copia (RLC)
Total Count Count w/protein matches Number of families Length Minimum Length Mean Length Maximum 12981 10738 288 1658 7729.07 24973 - Unclassified (RLX)
Total Count Count w/protein matches Number of families Length Minimum Length Mean Length Maximum 10978 2714 323 1610 5158.72 24971 - non-LTR Retrotransposons
Total Count Count w/protein matches Number of families Length Minimum Length Mean Length Maximum 140 NA NA 344 2047.63 2618 - TRIM Retrotransposons
Total Count Count w/protein matches Number of families Length Minimum Length Mean Length Maximum 7286 NA NA 357 931.32 1987 - Class II
- Subclass I
- Terminal-inverted repeat (TIR) transposons:
- hAT (DTA)
Total Count Count w/protein matches Number of families Length Minimum Length Mean Length Maximum 143 NA NA 133 1136.34 8572 - Tc1-Mariner (DTT)
Total Count Count w/protein matches Number of families Length Minimum Length Mean Length Maximum 1966 NA NA 119 2063.27 11114 - Mutator (DTM)
Total Count Count w/protein matches Number of families Length Minimum Length Mean Length Maximum 914 NA NA 123 1047.22 9972 - Unclassified (DTX)
Total Count Count w/protein matches Number of families Length Minimum Length Mean Length Maximum 14726 NA NA 113 1866.45 11233
- Subclass II
- Heltrons (DHH)
Total Count Count w/protein matches Number of families Length Minimum Length Mean Length Maximum 4308 NA NA 202 7896.45 19995
Total Count | Count w/protein matches | Number of families | Length Minimum | Length Mean | Length Maximum |
---|---|---|---|---|---|
20185 | 14695 | 420 | 1612 | 8583.76 | 24982 |