scripts and intermediate files used to annotate TEs in Jiao et al.
scripts and intermediate files used to annotate TEs in Jiao et al. 2016
LTR Retrotransposons
scripts in ltr
Software needed:
ncbi blast+
genometools, (download), need to pass 64bit=yes with-hmer=yes threads=yes
to make, make install for ltrdigest hmm searches in parallel. I also had to pass cairo=no
as well because I didn't have the right cairo libraries and it wouldn't compile otherwise
silix, (download), need to compile with --enable-mpi
and --enable-verbose
hmmer (genometools with download and compile hmmer2 if you run make with-hmmer=yes
)
Files needed, can be downloaded by get_tRNA_hmm_dbs.sh
in ltr
directory:
download hmms of TE protein coding domains from gydb in directory gydb_hmms
, will be used to identify protein coding domains of TE models
-need to fix a hmm with name ty1/copia because this is used as a filename by ltrdigest. to remove the forward slash: sed -i "s#ty1/copia#ty1-copia#g" gydb_hmms/GyDB_collection/profiles/AP_ty1copia.hmm
download tRNAs of all eukaryotes
SINEs
Scripts in sine/
Software needed:
SINE-Finder, download (This is a supplemental file at The Plant Cell; need to make executable, and rename to sine_finder.py)
LINEs
Scripts in line/
Software needed:
MGEScanNonLTR I use the version generated for Galaxy here.
TIR including MITEs
Scripts in tir/
Software needed:
mTEA, genometools (see above, already installed for ltr annotation)
Helitrons
Scripts in helitron/
Software needed:
Software needed: