sc-RNA-seq || 10x Genomics-Loupe Cell Browser 极简教程

原文<Single cell RNA-seq 10x Genomics hands-on exercise>
-----By Gil Stelzer, June 2018

Loupe Cell Browser is a program created by 10x Genomics for visualizing Cell Ranger output.

The 10x GenomicsCell Ranger is a pipeline that processes raw sequencing data(using the cellranger count program). This includes demultiplexing the libraries based on sample indices and converting the barcode and read data to FASTQ files. There upon, alignment is performed (using STAR and the relevant reference sequence), followed by filtering and unique molecular identifier (UMI) counting. Lastly, aggregation (using cellranger aggr) of several samples is accomplished by taking outputs from multiple runs, normalizing these runs to the same sequencing depth, recomputing the gene-barcode matrices and analyzing on the combined data.

QC report (outs\web_summary.html)

In the first part of the exercise, we will view a cellranger-generated QC (quality control) report


Valid barcodes - should be above %75
  • Reads Mapped Confidently to Transcriptome – **if lower than 30% then the problem could be wrong ** reference transcriptome used for mapping, a reference transcriptome with overlapping genes, poor library quality, poor sequencing quality or reads shorter than the recommended minimum.
  • Sequencing saturation - describes how many reads are duplicates, based on UMI tags. If the sequencing is 90% saturated, it means that for every 10 reads that are sequenced, 9 are UMI duplicates. If the sequencing saturation is below 80% then it might be beneficial to sequence the sample more deeply to get additional UMI counts.
  • Q30 for R1 (Barcode and UMI) and I7 (Sample index) should be above 90%. Q30 for R2 (RNA read \ transcript) is usually less than 90% but above 70%.
  • Fraction reads in cells - fraction of barcoded confidently mapped reads with cell-associated barcodes, derived from total cell associated UMI’s / total UMI’s. The higher this percentage, the lower (premature) cell lysis has occurred and the cleaner the data will be, i.e. less ambient RNA.
Problem during library creation such as ambient RNA and more.
Analysis

The analysis tab displays t-SNE plots, the left plot shows UMI distribution, whilst the right plot can be viewed also in the Loupe Cell Browser (which enables interactivity)

These graphs allow you plan the future experiment with the same type of samples. The plot on the right demonstrates that we could reduce sequencing depth and receive similar results.

Loupe Cell Browser

In the second part of the exercise we will demonstrate how to use the Loupe Cell Browser on an acute myeloid leukemia (AML) dataset. Open the Loupe Cell Browser found on your computer (local program). Click on** 'Load Tutorial' **from the Help menu in order to load the AMLTutorial file.

This dataset contains the results of a cellranger aggr run over three samples: two healthy control samples of frozen human bone marrow mononuclear cells, and a pre-transplant sample from a patient with acute myeloid leukemia (AML). This dataset was generated in collaboration with the Fred Hutchinson Cancer Research Center, and referenced in the Nature Communications publication, "Zheng et al, Massively parallel digital transcriptional profiling of single cells" (2017; doi:10.1038/ncomms14049).

Cell Ranger produces the** two-dimensional scatter plot** appearing in the center (marked in a red ellipse) by applying t-SNE dimensionality reduction on the most significant gene vectors using principal component analysis (using the first 10 PC’s). Each point represents a single barcode, the vast majority of which represent a single cell. By default, the Categories mode and **Graph-Based clustering **are selected (purple ellipse).

Briefly, the Graph-Based clustering algorithm uses successive steps in order to cluster, starting by building a nearest-neighbor graph followed by Louvain Modularity Optimization and then cluster-merging by hierarchical clustering.

    1. How many cells does cluster 5 include?

Right click cluster 13 and select “Edit color”. Change the color to RGB (140, 20, 170).

    1. What percentage of cells does cluster 13 constitute (hover over the cell count)?

Select K-Means clustering from the clustering selection menu.

    1. Which cluster (depicted as a certain color as determined by K-Means=10) is split \ scattered the most in two dimensions?
    1. Change the default K-value to 6. Cluster number 10 (using K=10) is now merged into a larger cluster called 5 (using K=6). With which cluster(s) was it merged from K=10?
      Select the LibraryID clustering
    1. Are the normal samples good biological replicates? Explain your answer.
      Select the AMLStatus clustering

In this view, the samples are split into Normal vs. Patient.
We can see that there is an overlap in the gene expression between the patient and normal in some of the clusters.

Use the split view button to view the samples in split panels
Please note that the normal sample appears in the left panel.

Select the K-Means clustering. Change the number of clusters to 10.

    1. Do all the K-Means clusters appear in both normal and patient panels in the same proportion? In order to view this easily you can hide all clusters and then show each one by clicking their checkbox.

The bottom heatmap panel displays differential gene log2 fold change (average expression within the cluster cells compared to all the rest of the cells). Cells are clustered based on the selected clustering algorithm (K-Means in this case). Clusters appear in the rows and genes appear in the columns.

Looking at the heatmap, select the highest expressed gene in cluster 7 (using the scale bar on the heatmap panel). Once clicked (the colored square), the gene expression mode will be displayed –

    1. Which gene did you select as the highest expressed gene for cluster 7?
    1. Is there a difference in this gene’s expression in both treatments?
      Return to the categories mode and select the gene table view

Select cluster 10 and sort the genes by descending P-Value

Click the top scoring gene (most significant with lowest P-Value, symbol on left) and view in which cells the gene is expressed
Download the “b_cells.csv” file from “/course_2018/single_cell_RNA_seq_exercise”
Import this gene list into Loupe by navigating to the downloaded location


By default, “Gene Exp Max” is selected

When no gene is selected from the list, then cells, which express any of the genes in the list, are colored according the maximally expressed gene for a given cell
Select Ms4a1 to view its expression.
In order to better view all cells expressing this gene, first unsplit the view (by selecting the Split view in the categories mode) and then select the “set dark background” option.

Split the view according to LibraryID and return to the gene expression mode.

    1. Is there a difference in the expression of this gene between the samples?
      Create a new gene list and name it T-cells

Search for the TCRE gene, which is a marker for T-cells

Use GeneCards or any other system to find the official symbol for this gene and search for it in Loupe

    1. What is the official symbol for Tcre?
    1. Is there a difference in this gene’s expression in both samples?
      Search for Cd3g which is also a marker for T-cells
      In order to see the co-expression of the two genes in the “T-cells” list in the same cells select “Gene Exp Min” (non-intuitive labeling)
    1. Did the amount of highlighted cells increase or decrease?
      Change the scalar selection back to “Gene Exp Max” and filter for cells with a log2 fold change greater or equal to 1

Create a new category called Mononuclear cells(or any other name) and a cluster called T-cells
Notice that the mode changed to categories and that only the T-cells that are marked in blue appear on the right panel

Click split view (twice) until the T-cells are isolated in their own panel

In order to find the proportion of cells in each sample (normal1, normal2 and patient) we will count how many T-cells appear in each using the LibraryID category. Click the “Split View” button to split according to T-cells vs all other cells (left panel now shows T-cells and the right one shows all the rest). When you select LibraryID you will see that the T-cells are a mixture of them.

Uncheck the check-boxes for AMLNormal2 and AMLPatient so now the left panel will only display T-cells belonging to AMLNormal1.


Use the Rectangular selection tool to select all cells in the left panel containing T-cells from the normal1 sample.

Add the new cluster to the Monouclear cells category and name it “AMLNormal1 T-cells” (or any other name)
Click split view twice until you have 3 panels
Select the LibraryID category again and this time check only AMLNormal2. Use the Rectangular selection tool again to mark “AMLNormal2 T-cells”.
Click split view twice until you have 4 panels
The top left panel now contain only AMLPatient so rename it to “AMLPatient T-cells”

    1. How many T-cells are included in each of the original samples?
    1. Export a plot of the panels using the “Export Plot to Image” tool (when naming the file include the file type – e.g. jpg)

Click the “Significant Genes” button and then “Globally distinguishing”

    1. Export a plot of the heatmap using the “Export Heatmap to Image” tool (when naming the file include the file type – e.g. jpg)

Since the heatmap expression pattern for AMLNormal2 resembles AMLPatient more than when comparing AMLNormal1 to AMLPatient, we will compare only the latter two. Uncheck AMLNormal2 T-cells, click the “Significant Genes” button and then “Locally distinguishing” which will compare only the checked samples.

  1. Select the “Gene Table” view and select the “Top 100 Genes” then click the “Export Table to CSV”

参考:
What is Loupe Cell Browser
10X单细胞测序分析软件:Cell ranger
使用Loupe Cell Browser查看10X单细胞转录组分析结果
专门分析10x genomic公司的单细胞转录组数据的软件套件

©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 204,189评论 6 478
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 85,577评论 2 381
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 150,857评论 0 337
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 54,703评论 1 276
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 63,705评论 5 366
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 48,620评论 1 281
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 37,995评论 3 396
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 36,656评论 0 258
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 40,898评论 1 298
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 35,639评论 2 321
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 37,720评论 1 330
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 33,395评论 4 319
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 38,982评论 3 307
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 29,953评论 0 19
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 31,195评论 1 260
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 44,907评论 2 349
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 42,472评论 2 342

推荐阅读更多精彩内容