News

By:SES | Posted on: 15 Jul 2017

Shown above is a screen capture from the genome browser displaying expression levels of genes and transposons in a small window on chromosome 1. The transposon annotations in the genome browser and, available for download on this site, were generated with a tool called Tephra which is discussed on this page.

The primary challenge with the assembly and annotation of plant genomes is dealing with repetitive elements such as transposons. The sunflower genome has been particularly challenging due to nature of transposons. That is, most LTR retrotransposons in sunflower are very young, having accumulated since the origin of the species. While much is know about LTR retrotransposons, less attention has been given to other transposon types, in part due to a lack of published tools and methods.

The command line toolkit I have developed, called Tephra, aims to characterize all transposon types in the genome, as well as the structure and diversity of these elements in order to better infer their evolutionary history.

The usage of this tool follows Unix convention where you have a single command with subcommands that perform different functions (similar to samtools). To give an idea of the different utilities provided there is overview of the commands and their description below.

SubcommandDescription
allRun all subcommands and generate annotations for all transposon types
classifyltrsClassify LTR retrotransposons into superfamilies and families
classifytirsClassify TIR transposons into superfamilies
findhelitronsFind Helitons in a genome assembly
findltrsFind LTR retrotransposons in a genome assembly
findnonltrsFind non-LTR retrotransposons in a genome assembly
findtirsFind TIR transposons in a genome assembly
findtrimsFind TRIM retrotransposons in a genome assembly
illrecombCharacterize the distribution of illegitimate recombination in a genome
ltrageCalculate the age distribution of LTR retrotransposons
maskrefMask a reference genome with transposons
reannotateTransfer annotations from a reference set of repeats to Tephra annotations
sololtrFind solo-LTRs in a genome assembly
tirageCalculate the age distribution of TIR transposons

This tool is open source and is available on github, where you can find information about installation and usage.