CITE-Seq
{{Short description|Cellular biology lab technique}}
CITE-Seq (Cellular Indexing of Transcriptomes and Epitopes by Sequencing) is a method for performing RNA sequencing along with gaining quantitative and qualitative information on surface proteins with available antibodies on a single cell level.{{Cite journal|last1=Mercatelli|first1=Daniele|last2=Balboni|first2=Nicola|last3=De Giorgio|first3=Francesca|last4=Aleo|first4=Emanuela|last5=Garone|first5=Caterina|last6=Giorgi|first6=Fedrico M.|date=2021-05-06|title=The Transcriptome of SH-SY5Y at Single-Cell Resolution: A CITE-Seq Data Analysis Workflow|journal=Methods and Protocols|volume=4|issue=2|pages=28|doi=10.3390/mps4020028|pmid=34066513|pmc=8163004|issn=2409-9279|doi-access=free }} So far, the method has been demonstrated to work with only a few proteins per cell. As such, it provides an additional layer of information for the same cell by combining both proteomics and transcriptomics data. For phenotyping, this method has been shown to be as accurate as flow cytometry (a gold standard) by the groups that developed it.{{Cite journal|last1=Stoeckius|first1=Marlon|last2=Hafemeister|first2=Christoph|last3=Stephenson|first3=William|last4=Houck-Loomis|first4=Brian|last5=Chattopadhyay|first5=Pratip K|last6=Swerdlow|first6=Harold|last7=Satija|first7=Rahul|last8=Smibert|first8=Peter|date=2017-07-31|title=Simultaneous epitope and transcriptome measurement in single cells|journal=Nature Methods|volume=14|issue=9|pages=865–868|doi=10.1038/nmeth.4380|issn=1548-7091|pmc=5669064|pmid=28759029}} It is currently one of the main methods, along with REAP-Seq, to evaluate both gene expression and protein levels simultaneously in different species.
The method was established by the New York Genome Center in collaboration with the [https://satijalab.org/ Satija lab]., while a similar approach was earlier shown by [https://patents.google.com/patent/US11156611B2/ AbVitro Inc.].
Applications
Concurrent measurement of both protein and transcript levels opens up opportunities to use CITE-Seq in various biological areas, some of which were touched upon by the developers. For instance, it may be used to characterize tumor heterogeneity in different cancers, a major research field.{{Cite journal|last1=Tirosh|first1=Itay|last2=Suvà|first2=Mario L.|s2cid=53969464|date=2018-11-16|title=Deciphering Human Tumor Biology by Single-Cell Expression Profiling|journal=Annual Review of Cancer Biology|volume=3|issue=1|doi=10.1146/annurev-cancerbio-030518-055609|issn=2472-3428|pages=151–166|doi-access=free}} It also permits identifying rare subpopulations of cells as a high-throughput single-cell method and thus detect information otherwise lost with bulk methods. It also may aid in tumor classification - for example, identification of novel subtypes. All of the above are possible due to single-cell output of both protein and transcript data at the same time, also leading to novel information on protein-RNA correlation.
It also has potential in immunology. For example, it can be utilized for immune cell characterization – recent research on T-cells has investigated the ability of T cells to maintain an effector state.{{Cite journal|last1=Gutierrez-Arcelus|first1=Maria|last2=Teslovich|first2=Nikola|last3=Mola|first3=Alex R.|last4=Polidoro|first4=Rafael B.|last5=Nathan|first5=Aparna|last6=Kim|first6=Hyun|last7=Hannes|first7=Susan|last8=Slowikowski|first8=Kamil|last9=Watts|first9=Gerald F. M.|date=2019-02-08|title=Lymphocyte innateness defined by transcriptional states reflects a balance between proliferation and effector functions|journal=Nature Communications|volume=10|issue=1|pages=687|doi=10.1038/s41467-019-08604-4|issn=2041-1723|pmc=6368609|pmid=30737409|bibcode=2019NatCo..10..687G}} Another study by one of CITE-Seq coauthors suggested CITE-Seq as a methods to look at the mechanisms of host-pathogen interactions.{{Cite journal|last1=Chattopadhyay|first1=Pratip K.|last2=Roederer|first2=Mario|last3=Bolton|first3=Diane L.|date=2018-11-06|title=A deadly dance: the choreography of host–pathogen interactions, as revealed by single-cell technologies|journal=Nature Communications|volume=9|issue=1|pages=4638|doi=10.1038/s41467-018-06214-0|pmid=30401874|pmc=6219517|issn=2041-1723|bibcode=2018NatCo...9.4638C}}
Workflow
CITE-seq, like any other sequencing technique, has a wet lab portion, where the actual antibodies are prepared, cells stained, cDNA synthesized and RNA libraries are prepared that are further sequenced, and a dry lab portion for analysis of the sequencing data obtained. The most crucial part in the wet lab experiments is designing the antibody-oligonucleotide conjugates and titrating the amount of each conjugate that needs to be present in the pool to achieve a desired read-out and quantification.
= Wet lab workflow =
The first step involves preparation of the antibody-oligo conjugates also known as Antibody-Derived Tags (ADTs). ADT preparation involves labeling an antibody directed against a cell surface protein of interest with oligonucleotides for barcoding the antibody.
Once you have the ADTs, the next step is to bind the cells with the desired ADT pool. The scRNA-seq libraries can be prepared using Drop-seq, 10X Genomics or ddSeq methods. In brief, ADT labelled cells are encapsulated within a droplet as single cells with DNA-barcoded microbeads.{{Cite journal|last1=Macosko|first1=Evan Z.|last2=Basu|first2=Anindita|last3=Satija|first3=Rahul|last4=Nemesh|first4=James|last5=Shekhar|first5=Karthik|last6=Goldman|first6=Melissa|last7=Tirosh|first7=Itay|last8=Bialas|first8=Allison R.|last9=Kamitaki|first9=Nolan|date=May 2015|title=Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets|journal=Cell|volume=161|issue=5|pages=1202–1214|doi=10.1016/j.cell.2015.05.002|pmid=26000488|issn=0092-8674|pmc=4481139}}
Within a droplet, the cells are next lysed to release both bound ADTs as well as mRNA. These then are converted to cDNA. Each DNA sequence on a microbead has a unique barcode thus indexing cDNA with cell barcodes. cDNA is prepared from both ADTs and cellular mRNAs.
In the next step, based on the developer's guidelines, cDNA is PCR-amplified and ADT cDNA and mRNA cDNA are separated based on size (generally, ADT-derived cDNAs are < 180bp and mRNA-derived cDNAs are > 300bp).{{Cite web|url=https://cite-seq.com/|title=CITE-seq|website=CITE-seq|access-date=2019-02-27}} Each of the separated cDNA molecules is independently amplified and purified to prepare sequencing libraries. Finally, the independent libraries are pooled together and sequenced. Thus, proteomics and transcriptomics data can be obtained from a single sequencing run.
= Dry lab workflow =
Analysis of single-cell sequencing presents many challenges, such as determining the best way to normalize the data.{{Citation|last=Gao|first=Shan|chapter=Data Analysis in Single-Cell Transcriptome Sequencing|date=2018|pages=311–326|publisher=Springer New York|isbn=9781493977161|doi=10.1007/978-1-4939-7717-8_18|pmid=29536451|title=Computational Systems Biology|volume=1754|series=Methods in Molecular Biology}} Due to a new level of complications that arise from sequencing of both proteins and transcripts at a single-cell level, the developers of CITE-Seq and their collaborators are maintaining several tools to help with data analysis.
scRNA-Seq data analysis based on the developer's guidelines:{{Cite journal|last1=Liu|first1=Serena|last2=Trapnell|first2=Cole|date=2016-02-17|title=Single-cell transcriptome sequencing: recent advances and remaining challenges|journal=F1000Research|doi=10.12688/f1000research.7223.1|issn=2046-1402|pmid=26949524|pmc=4758375|volume=5|page=182 |doi-access=free }} The initial analysis steps are the same as in a standard scRNA-Seq experiment. Firstly, reads need to be aligned to a reference genome of a species of interest and cells with very low number of transcripts mapped to the reference are removed. Finally, a normalized count matrix with gene expression values is obtained.
ADT data analysis{{Citation|last=Roelli|first=Patrick|title=Small script that allows to count TAGS from a CITE-seq experiment: Hoohm/CITE-seq-Count|date=2019-02-23|url=https://github.com/Hoohm/CITE-seq-Count|access-date=2019-02-27}}{{Cite web|url=https://satijalab.org/seurat/|title=Seurat|website=satijalab.org|access-date=2019-02-27}} (based on the developer's guidelines): CITE-seq-Count is a Python package from CITE-Seq developers that can be used to obtain raw counts. Seurat package from Satija lab further allows combining of the protein and RNA counts and performing clustering on both measurements, as well as doing differential expression analysis between cell clusters of interest. ADT quantification needs to take into account the differences between the antibodies. Additionally, filtering may be required to reduce noise, similarly to scRNA-Seq analysis. But in contrast to RNA data, due to higher amounts of protein in a cell, there is less dropout.
The analyses may result in identification of novel cell clusters through such methods as PCA or tSNE, crucial genes responsible for a specific cell function and other new knowledge specific to a question of interest. In general, the results obtained with ADT counts substantially increase the amount of information obtained through single cell transcriptomics.
Adaptations of the technique
The applications of antibody-oligonucleotide conjugates have expanded beyond CITE-seq, and can be adapted for sample multiplexing as well as CRISPR screens.
Cell Hashing: New York Genome Center further adapted the use of their antibody-oligonucleotide conjugates to enable sample multiplexing for scRNA-seq. This technique called, Cell Hashing,{{Cite journal|title=Cell "hashing" with barcoded antibodies enables multiplexing and doublet detection for single cell genomics|last1=Stoeckius|first1=Marlon|last2=Zheng|first2=Shiwei|date=2017-12-21|last3=Houck-Loomis|first3=Brian|last4=Hao|first4=Stephanie|last5=Yeung|first5=Bertrand|last6=Smibert|first6=Peter|last7=Satija|first7=Rahul|journal=Genome Biology |volume=19 |issue=1 |page=224 |doi=10.1186/s13059-018-1603-1|biorxiv=10.1101/237693|pmid=30567574 |doi-access=free|pmc=6300015}} uses oligonucleotide-labelled antibodies against ubiquitously expressed cell surface proteins from a particular tissue sample. In this case, an oligonucleotide sequence contains a unique barcode which would be specific to cells from distinct samples. This sample-specific cell tagging allows pooling of the sequencing libraries prepared from different samples on a sequencing platform. Sequencing the antibody tags along with the cellular transcriptome helps identify a sample of origin for each analyzed cell. A unique barcode sequence used on the cell hashing antibody can be designed to be different from an antibody barcode present on the ADTs used in CITE-seq. This makes it possible to couple cell hashing with CITE-seq on a single sequencing run. Cell hashing allows super-loading of the scRNA-seq platform, resulting in a lower cost of sequencing. It also enables detection of artifactual signals from multiplets, a major challenge in scRNA-seq. The cell hashing method has further been used by Gaublomme et al. to multiplex single-nucleus RNA-seq (snRNA-seq) by performing nucleus hashing.{{Cite journal|last1=Gaublomme|first1=Jellert T.|last2=Li|first2=Bo|last3=McCabe|first3=Cristin|last4=Knecht|first4=Abigail|last5=Drokhlyansky|first5=Eugene|last6=Van Wittenberghe|first6=Nicholas|last7=Waldman|first7=Julia|last8=Dionne|first8=Danielle|last9=Nguyen|first9=Lan|date=2018-11-23|title=Nuclei multiplexing with barcoded antibodies for single-nucleus genomics|journal=bioRxiv|doi=10.1101/476036|doi-access=free|hdl=1721.1/125028|hdl-access=free}}
ECCITE-seq: Expanded CRISPR-compatible Cellular Indexing of Transcriptomes and Epitopes by sequencing or ECCITE-seq was developed to apply the use of CITE-seq to characterize multiple modalities from a single cell. By modifying the basic CITE-seq protocol to a 5' tag-based scRNA-seq assay, it can detect transcriptome, immune receptor clonotypes, surface markers, sample identity and single guide RNAs (sgRNAs) from each single cell.{{Cite journal|last1=Mimitou|first1=Eleni|last2=Cheng|first2=Anthony|last3=Montalbano|first3=Antonino|last4=Hao|first4=Stephanie|last5=Stoeckius|first5=Marlon|last6=Legut|first6=Mateusz|last7=Roush|first7=Timothy|last8=Herrera|first8=Alberto|last9=Papalexi|first9=Efthymia|date=2018-11-08|title=Expanding the CITE-seq tool-kit: Detection of proteins, transcriptomes, clonotypes and CRISPR perturbations with multiplexing, in a single assay|journal=bioRxiv|doi=10.1101/466466|doi-access=free}} The ability of ECCITE-seq to detect sgRNA molecules and measure their effect on gene expression levels opens a prospect of applying this technique in CRISPR screens.
Advantages and Limitations of CITE-seq
Advantages: CITE-seq enables simultaneous analysis of the transcriptome as well as the proteome of single cells. Previous efforts of coupling index-sorting measurements from single cell sorts with scRNA-seq were limited to running a small sample size and were not compatible with multiplexing and massive parallel high-throughput sequencing. CITE-seq has been shown to be compatible with high-throughput microfluidic platforms like 10X Genomics and Drop-seq. It is also adaptable to micro/nano-well platforms. Coupling it with cell hashing enables the application of CITE-seq on bulk samples and sample multiplexing. These techniques work to reduce an overall cost of high-throughput sequencing on multiple samples. Lastly, CITE-seq can be adapted to detect small molecules, RNA interference, CRISPR, and other gene editing techniques.
Limitations: One of the limitations of CITE-Seq is a loss of location information. Due to the way the cells are treated, the spatial distribution of cells within a sample, as well as proteins within a cell is not known.{{Cite journal|last1=An|first1=Xingyue|last2=Varadarajan|first2=Navin|date=March 2018|title=Single-cell technologies for profiling T cells to enable monitoring of immunotherapies|journal=Current Opinion in Chemical Engineering|volume=19|pages=142–152|doi=10.1016/j.coche.2018.01.003|issn=2211-3398|pmc=6530921|pmid=31131208}} In addition, this method shares the challenges of scRNA-Seq, such as high amount of noise and possible challenges in detecting lowly expressed genes. In terms of phenotyping, optimization of the assay and antibodies also presents a potential problem if proteins of interest are not included in the currently available panels.{{Cite journal|last1=Baron|first1=Maayan|last2=Yanai|first2=Itai|date=2017-08-24|title=New skin for the old RNA-Seq ceremony: the age of single-cell multi-omics|journal=Genome Biology|volume=18|issue=1|pages=159|doi=10.1186/s13059-017-1300-5|pmid=28837001|pmc=5571565|issn=1474-760X |doi-access=free }} Moreover, right now CITE-Seq is not able to detect intracellular proteins. With the current protocol, there are many challenges that would arise during the permeabilization step, thus limiting the technique to surface markers.
Alternative methods
- REAP-seq: Peterson et al. from Merck developed a technique similar to CITE-seq called RNA Expression and Protein Sequencing assay (REAP-seq). While REAP-seq, similarly to CITE-seq, measures levels of both transcripts and proteins in a single cell, the difference between the two techniques is how the antibody is conjugated to the oligonucleotides. CITE-seq typically links the oligonucleotide to the antibody non-covalently, via streptavidin conjugation to the antibody and biotin conjugation to the oligonucleotide. REAP-seq covalently links the antibody and an aminated DNA barcode{{Cite journal|last1=Peterson|first1=Vanessa M|last2=Zhang|first2=Kelvin Xi|last3=Kumar|first3=Namit|last4=Wong|first4=Jerelyn|last5=Li|first5=Lixia|last6=Wilson|first6=Douglas C|last7=Moore|first7=Renee|last8=McClanahan|first8=Terrill K|last9=Sadekova|first9=Svetlana|date=2017-08-30|title=Multiplexed quantification of proteins and transcripts in single cells|journal=Nature Biotechnology|volume=35|issue=10|pages=936–939|doi=10.1038/nbt.3973|pmid=28854175|s2cid=205285357 |issn=1087-0156}}
- PLAYR: PLAYR or Proximal Ligation Assay for RNA makes use of mass spectrometry to simultaneously analyse the transcriptome and protein levels in single cells. In this technique both the proteins and RNA transcripts are labelled with isotope-conjugated antibodies and isotope-labelled probes, respectively, enabling their detection on a mass spectrometer{{Cite journal|last1=Frei|first1=Andreas P|last2=Bava|first2=Felice-Alessio|last3=Zunder|first3=Eli R|last4=Hsieh|first4=Elena W Y|last5=Chen|first5=Shih-Yu|last6=Nolan|first6=Garry P|last7=Gherardini|first7=Pier Federico|date=2016-01-25|title=Highly multiplexed simultaneous detection of RNAs and proteins in single cells|journal=Nature Methods|volume=13|issue=3|pages=269–275|doi=10.1038/nmeth.3742|issn=1548-7091|pmc=4767631|pmid=26808670}}
References
{{reflist}}