LR50 is a pipeline for assessing the quality of insect transcriptomes.It requires Linux platform.
There are 4 modules in the package of software:- classification.pl
- blast_parsing.pl
- run_inparanoid.sh
LR50 is a pipeline for assessing the quality of insect transcriptomes.It requires Linux platform.
There are 4 modules in the package of software:Please follow the instructions to install the packages.
$ tar -xzvf LR50-1.0.tar.gz
$ calculating_LR50.pl -t < Transcriptomes.nucleotide.fa > -g < GenomeReferenceSpecie.protein.fa > \
-d < Database for transcriptome annotation >
Transcriptomes.nucleotide.fa: fasta formate.
Transcriptome nucleotide sequences which you want to assess.
GenomeReferenceSpecise.protein.fa: fasta formate.
Genome reference specie (GRS) is a close related specie whose genome had been sequenced and well annotated.
Database for transcriptome annotation: protein database.
NR database or other database is used to annotate the transcriptome.
When program finished, the results are saved as two files, which are placed in the directory “LR50_out”:
- summary.xls
- Translation_pep.fasta
NOTE: Two steps in this pipeline are time consuming. One is the annotation, and another is inparanoid.
To save time, you need set more cpu cores to accelerate the first step. For inparanoid, you need change this line in
the inparanoid.pl file:
$blastall = "blastall"; #Add -aN to use N processors
to
$blastall = "blastall -a4"; #Add -aN to use N processors
to add the blast processors cores.