A Simple TBtools Manual (English)

[For version 0.665]

CJ (ccj0410@gmail.com)
South China Agricultural University

Overview

Rapid development of high-throughput sequencing (HTS) techniques has led biology into the “big-data” era. Data analysis using various bioinformatics softwares or pipelines relying on programming and command-line environment is challenging and time-consuming for most wet-lab biologists. Bioinformatics tools with a user-friendly interface are preferred to save time.
Thus, we present TBtools (a Toolkit for Biologists integrating various biological data handling tools), a stand-alone software with a user-friendly interface. It has powerful data handling engines for both bulk sequence processing and interactive data visualization. It includes a large collection of functions, which may facilitate much simple, routine but elaborate work on biological data, such as bulk sequence extraction, gene set enrichment analysis, Venn diagram preparation, heatmap illustration, comparative sequence visualization, etc.

A Glance of TBtools’ Functions




Download and Installation

TBtools is a platform-independent software that can be run under all operating systems with Java Runtime Environment 1.6 or newer. It is freely available to non-commercial users at
https://github.com/CJ-Chen/TBtools/releases

For users under all operating systems (Windows, Mac, Linux….):
1. Download TBtools-crossplatform_XXX.rar.
2. Unpack the rar file and obtain an executable jar file.
3. Optional. If users want to use the BLAST wrapper functions, then users is required to install BLAST package and add its bin directory to environment variables.

For users under windows, a better choice is:
1. Download TBtools_windows-32-bits-XXX.rar or TBtools_windows-64-bits-XXX.rar file.
2. Unpack the rar file and obtain an exe file.
3. Double click the exe file, click next and wait for the installation procedure.

Getting started with TBtools

For users start TBtools from a jar file, there are two ways:
1. Double click the jar file; if it don’t work, try next way.
2. Open the terminal (CMD or Powershell under Windows, Shell/Bash under Mac or Linux); type

java -Xmx2G -jar PathtoTBtools.jar

A command example under Windows


For windows users that have installed TBtools from an exe file, double click the TBtools icon and the main panel of TBtools will pop up.

In the main panel of TBtools, there is a main menubar and several buttons:
1. Click “Version” to check whether the current TBtools is the latest version.
2. Click “Citation” to get the citation method of TBtools.

Usage of Key Functions

Bulk Sequence Extraction

Go to it:
Main menubar -> Sequence Toolkits -> Fasta Tools -> Amazing Fasta Extractor
Input:
1. A target sequence file in Fasta format (ref https://en.wikipedia.org/wiki/FASTA), e.g.

>Unigene1 high expressing gene
ACGATCAGCTCAGCGACGATCGACTAGCTACGATCAGCTAGCTACGATCGACTAGCTAGCTACGA
ACGATCAGCTCAGCGACGATCGACTAGCTACGATCAGCTA
>Unigene2 low expressing gene
ACTCAGCTCAGCGACGATCAGCTCAGCGACGATCGACTAGCTACGACGACTAGCTACGA……
….

2. A set of gene identifiers or regions, e.g.

##### Lines prefixed with # will be ignored
##### Examples for One Gene ###########
Unigene_1
Unigene_2
### ChrID StartPos EndPos
Chr_1 100000 102000
### GeneID ChrID StartPos EndPos #########
FinalGeneID Chr_1 100000 120000

Output:
Complete sequences or regions of sequences specified by users
Detailed Usage:
1. Drag a target sequence file in the text-field or set it by click the “…” button
2. Click “initialize” button to build a FA-index (if the index has already been built, TBtools will skip it)
3. Set a path of an output file
4. Set a set of IDs or sequence regions
5. Click “Start”

  • Optional. If users select “Just Show Dialog”, users can obtain the extracted sequences directly from a dialog. In this case, setting of an output file is not required.


Sequence Extraction from Genome According to Gene Structure Annotation File (.gff3/gtf)

Go to it:
Main menubar -> Sequence Toolkits -> Gff3/GTF Manipulator -> Gtf/Gff3 Sequences Extractor
Input:
1. A target sequence set of genome in Fasta format
2. A corresponding gene structure annotation file in gff3/gtf format (ref https://en.wikipedia.org/wiki/General_feature_format)
Output:
A file storing sequences of a user-specific feature (CDS, exon, mRNA, gene, UTR, promoter, etc.)
Detailed Usage:
1. Set a gff3/gtf file
2. Click the “initialize” button and TBtools will provide available features for users to select

  1. Select a target feature, e.g. “CDS”
  2. Select an ID to group sequence segments of specific features, e.g. “Parent”
    5. Set a genome sequence file
    6. Set an output file
    *. Optional. If users want to extract sequences upstream or downstream from the specific feature, e.g 2000 bp upstream from CDS (often referring as “Promoter regions”), users need to enter “2000” in the corresponding text-field.


BLAST Wrapper and visualization

Go to it:
Main menubar -> Blast -> Blast Waper -> Several Sequences To a Big File [Commonly Used]
Input:
1. A set of query sequences. Fasta format is only required for multiple sequences.
2. A target sequence file, e.g. transcriptome, genome.
Output:
BLAST result in user specific format (xml, table, pairwise)
Detailed Usage:
1. Paste a set of query sequences or drag a sequence file and drop into the text-area
2. Set a target sequence file in fasta format
3. Set a path of an output file to store the BLAST result. Click “Temp” button will generate an intermediate file, which will be automatically deleted by TBtools when exited.
4. Click “Start”
5. When BLAST process is finished, user can click the “Visualize” button to invoke TBtools BLAST result visualization functions (Only valid for XML outfmt).
*. Optional. Detailed parameters could be changed on the top-right panel and the “Other Options”. Most of the time, keep it as default and it work well.


Example of visualization of BLAST results

Gene Ontology Enrichment Analysis

Go to it:
Main menubar -> GO and KEGG -> GO Enrichment
Input:
1. A go-basic.obo file downloaded from http://purl.obolibrary.org/obo/go/go-basic.obo
2. A GO annotation background, formatted as bellow. The first column are gene identifiers. If multiple gene annotated to a same GO term, comma could be used to separate them, e.g. “Unigene1,Unigene2 GO:0005509”; The second column are GO numbers. If one gene is annotated to several GO term, comma could be used to separate them, e.g. “Unigene1 GO:0008483,GO0030170”.


3. A gene set of interest for enrichment analysis.
Output:
Eight files will be generated. An example is showed as bellow. GO annotation background was parsed according to Gene Ontology information stored in go-basic.obo file. Fresh-hand of enrichment analysis is recommended to use the .final.xls result directly. GO enrichment analysis is conducted to three categories, Biological Process, Cellular Component, and Molecular Function. File suffixed with “.sorted.padjust” can be used for further analyses.

Detailed Usage:
1. Set the latest go-basic.obo file
2. Set a GO annotation background
3. Set a gene set of interest
4. Set an output directory. User could also set a prefix for naming of output files.

  • Optional. Click “Download go-basis.obo …” will invoke TBtools to download the latest go-basic.obo file.
  • Optional. GO-slim could be used in enrichment analysis.



KEGG Pathway Enrichment Analysis

Go to it:
Main menubar -> GO and KEGG -> KEGG Enrichment
Input:
1. A file storing KEGG pathway ontology information. Users could prepare it using the “Make One Backend File From Web” Button.
2. A KEGG annotation background, formatted as bellow. The first column are gene identifiers. If multiple gene annotated to a same KO number, comma could be used to separate them, e.g. “Unigene1,Unigene2 K12243”; The second column are KO numbers. If one gene is annotated to several KO numbers, comma could be used to separate them, e.g. “Unigene1 KO12322,KO23421”
3. A gene set of interest for enrichment analysis
Output:
Two files will be generated. *Use the .final.xls output file which has been filter with “minimum gene number>=5 and p-value<=0.05”.
Detailed Usage:
1. Set a file storing KEGG pathway ontology information.
2. Set a KEGG annotation background
3. Set a gene set of interest
4. Set an output file.

  • Optional. Select a target pathway category, e.g. “Plants”, “Animals”, ”hsa” and Click “Make …” to invoke TBtools to prepare a file storing KEGG pathway ontology information.


Funny Enrichment Barplot

Go to it:
Main menubar -> GO and KEGG -> Funny Enrichment Bar Plot
Input:
An enrichment analysis result. Enrichment result files from TBtools are preferred. Input file is required to contain at least two columns, i.e. a column storing “Term” labels and the other column storing “p-value”, “q-value” or “FDR”.
Output:
An interactive bar plot
Detailed Usage:
1. Set an enrichment analysis.
2. Select the type of the enrichment analysis result: “GO”, “KEGG” or “Other”. The first two options are presetting for TBtools enrichment results. If users use enrichment analysis result from other programs, Type “Other” should be selected and text-field for “Term Col Name” and “pValue….” might need to be modified.

  • Optional. Select a barplot type: “Normal”, “TextOnLeft” and “BarOnLeft”.


Example of output:


Wonderful Venn

Go to it:
Main menubar -> Graphics -> Venn and Upset Plot -> Wonderful Venn (Up to Six Sets)
Input:
Two to Six sets of identifiers
Output:
An interactive Venn diagram
Detailed Usage:
1. Paste each set of identifiers or drag and drop files storing identifiers.
2. Click “Start”

  • Optional. Change labels of each set


Example Output:


Upset Plot

Go to it:
Main menubar -> Graphics -> Venn and Upset Plot -> Upset Plot
Input:
As many sets of identifiers as users need
Output:
An interactive Upset Plot
Detailed Usage:
1. Click as many times of “Add” button to add panels as your need
2. Paste each set of identifiers or drag and drop files storing identifiers.

  • Optional. Change labels of each set
  • Optional. Control the rank of all vertical bar of Upset plot by changing ranking selections.

    Example output:

Heatmap Illustrator

Go to it:
Main menubar -> Graphics -> Amazing Heatmap -> The Amazing Simple Heatmap
Input:
1. A value matrix. The matrix should contain column names and row names, e.g.

  • Optional. A tab-delimited file containing row annotation information, e.g.


  • Optional. A tab-delimited file containing column annotation information, e.g.

    Output:
    An interactive heatmap
    Detailed Usage:
    1. Paste the value matrix or drag and drop files storing identifiers.
    2. Click “Start”
  • Optional. Set row annotation
  • Optional. Set column annotation
  • Optional. Paste a preset newick string

    Example output:

Batch Biological Sequence Visualization

Go to it:
Main menubar -> Graphics -> BioSequence Structure Illustrator -> Amazing Optional Gene Viewer
Input:
All Inputs are Optional.
A set of gene identifiers to filter the gene structure information or a newick tree string.
An XML file from MEME suite analysis result (MEME or MAST).
A gff/gtf file.
A file storing domain information in protein coordinate, e.g. analysis result from NCBI-CDD search.
A file storing domain information in mRNA coordinate, e.g. miRNA target sites.
A file storing renaming information of all sequences to be showed.

Output:
An interactive plot, which can simultaneously presenting phylogenetic tree, motif/domain patterns, gene structures, and miRNA target sites.
Detailed Usage:
All are optional.

  • Optional. Paste a newick tree string or a set of gene identifiers
  • Optional. Set an XML file storing motif patterns
  • Optional. Set a gff/gtf file
  • Optional. Set a file storing domain information in protein coordinate
  • Optional. Set a file storing domain information in mRNA coordinate
  • Optional. Set a file storing renaming information

    Example output:
最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 204,445评论 6 478
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 85,889评论 2 381
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 151,047评论 0 337
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 54,760评论 1 276
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 63,745评论 5 367
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 48,638评论 1 281
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 38,011评论 3 398
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 36,669评论 0 258
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 40,923评论 1 299
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 35,655评论 2 321
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 37,740评论 1 330
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 33,406评论 4 320
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 38,995评论 3 307
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 29,961评论 0 19
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 31,197评论 1 260
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 45,023评论 2 350
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 42,483评论 2 342

推荐阅读更多精彩内容