Immune Repertoire
Immune Repertoire
The immune repertoire refers to the complete collection of antigen receptors—T cell receptors (TCRs) and B cell receptors (BCRs)—present in an individual at any given time. Understanding the repertoire provides insights into immune status, disease states, and responses to infection or vaccination.
Overview
The adaptive immune system’s power lies in its ability to recognize virtually any antigen through an enormous diversity of receptors. Each lymphocyte expresses a unique receptor generated by V(D)J recombination, and the collective population of these receptors constitutes the immune repertoire.
What Defines the Repertoire?
| Component | Description |
|---|---|
| Receptor sequences | The unique TCR or BCR sequences present |
| Clonotype abundance | How many cells express each receptor |
| Subset distribution | Distribution across naive, memory, effector populations |
| Temporal dynamics | How the repertoire changes over time |
| Spatial distribution | Variation across tissues and compartments |
Scale of Diversity
Theoretical vs. Realized Diversity
| Parameter | TCR | BCR |
|---|---|---|
| Theoretical diversity | 10¹⁵ - 10¹⁸ | 10¹¹ - 10¹³ (before SHM) |
| Actual clonotypes | ~10⁷ - 10⁸ | ~10⁹ - 10¹⁰ |
| Total cells | ~10¹¹ - 10¹² | ~10¹⁰ - 10¹¹ |
| Naive repertoire | ~10⁷ unique clones | ~10⁸ - 10⁹ unique clones |
| Memory repertoire | ~10⁵ - 10⁶ clones | Variable |
The realized repertoire is vastly smaller than theoretical diversity because:
- Limited number of lymphocytes
- Selection processes eliminate many specificities
- Some receptors cannot fold or function properly
The Repertoire as an Immunological Fingerprint
Each individual’s repertoire is unique, shaped by:
| Factor | Influence |
|---|---|
| Genetics | HLA type, germline V/D/J genes |
| Thymic/BM selection | Central tolerance shapes what survives |
| Exposure history | Infections, vaccines, commensals |
| Age | Thymic involution, accumulated responses |
| Health status | Disease, immunosuppression |
| Environment | Geography, lifestyle |
Repertoire Organization
Defining a Clonotype
A clonotype is typically defined by:
- V gene segment used
- J gene segment used
- CDR3 nucleotide or amino acid sequence
- (For BCR) D gene segment and mutation status
- (For paired analysis) Both chains together
Example TCR Clonotype Notation:
TRBV7-2*01 | CASSLGQAYEQYF | TRBJ2-7*01
(V gene) (CDR3 amino acids) (J gene)
Example BCR Clonotype Notation:
IGHV3-23*01 | CARDRGYSSSWFDPW | IGHJ4*02 | 8.5% mutated
(V gene) (HCDR3 amino acids) (J gene) (SHM level)
Clone Size Distribution
The repertoire follows a characteristic power-law distribution:
| Clone Category | Frequency | Proportion of Repertoire |
|---|---|---|
| Rare clones (1-10 cells) | Many unique sequences | ~80% of diversity |
| Medium clones (10-1000 cells) | Moderate number | ~15% of diversity |
| Expanded clones (>1000 cells) | Few clones | ~5% of diversity |
Clone Size Distribution:
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Rare clones │████████████████████ 80% of unique sequences
Medium clones │████ 15%
Expanded clones │█ 5%
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Most diversity resides in rare clones, but expanded clones contribute significantly to function.
Repertoire Compartments
Different lymphocyte populations have distinct repertoire characteristics:
| Compartment | Diversity | Clone Sizes | CDR3 Features |
|---|---|---|---|
| Naive T cells | Very high | Small, uniform | Random distribution |
| Central memory | Moderate | Variable | Antigen-selected |
| Effector memory | Lower | Often expanded | Antigen-selected |
| Effectors | Lowest | Large clones | Highly selected |
| Tregs | Moderate | Variable | Self-reactive bias |
| Naive B cells | Very high | Small | Unmutated |
| Memory B cells | Moderate | Variable | SHM-diversified |
| Plasma cells | Low | Clone-specific | Highly mutated |
Measuring the Repertoire
The Sampling Challenge
The iceberg problem: Any sample captures only a fraction of the total repertoire.
| Sample Source | % of Total Lymphocytes | Implications |
|---|---|---|
| Peripheral blood | ~2% of total | Most accessible; may miss tissue-resident cells |
| Typical sample | 10⁶ - 10⁷ cells | Captures abundant clones well; rare clones undersampled |
| Sequencing depth | Variable | Determines detection sensitivity |
Strategies to Address:
- Adequate input cell numbers
- Sufficient sequencing depth
- Statistical estimators for unseen diversity
- Rarefaction analysis
Diversity Metrics
Richness (Clone Counting)
Definition: Total number of unique clonotypes observed
| Metric | Description |
|---|---|
| Observed richness | Direct count of unique sequences |
| Chao1 estimator | Estimates true richness from sampling |
| ACE | Abundance-based coverage estimator |
| Rarefaction | Richness at standardized sampling depth |
Limitation: Heavily dependent on sampling depth—deeper sequencing always finds more rare clones.
Diversity Indices
| Index | Formula | Interpretation |
|---|---|---|
| Shannon entropy (H) | H = -Σ pᵢ log(pᵢ) | Accounts for abundance distribution; higher = more diverse |
| Simpson index (D) | D = Σ pᵢ² | Probability of sampling same clone twice; lower = more diverse |
| Inverse Simpson (1/D) | 1/D | Effective number of equally abundant clones |
| Gini coefficient | Area under Lorenz curve | Inequality measure; 0 = equal, 1 = dominated by one clone |
Where pᵢ = proportion of clone i in the sample.
Clonality
Definition: Measure of how dominated the repertoire is by expanded clones
Clonality = 1 - (Shannon entropy / log₂(richness))
| Clonality Value | Interpretation |
|---|---|
| 0 | Maximally diverse; all clones equal |
| ~0.1-0.3 | Typical healthy repertoire |
| ~0.5 | Moderately clonal |
| 1 | Monoclonal; single clone dominates |
Sequencing Approaches
| Method | Chain Info | Throughput | Cost | Best Application |
|---|---|---|---|---|
| Bulk TCR/BCR-seq | Single chain | Very high | Low | Repertoire surveys, biomarkers |
| Single-cell TCR/BCR-seq | Paired chains | Medium | High | Functional studies, antigen specificity |
| Spatial transcriptomics | Variable | Low-medium | High | Tissue architecture |
| Long-read sequencing | Full length | Medium | Medium | Full V region; SHM analysis |
Repertoire Dynamics
During Immune Responses
Acute Infection Timeline:
| Phase | Days | Repertoire Changes |
|---|---|---|
| Recognition | 0-3 | Naive repertoire scanned for reactive clones |
| Expansion | 3-7 | Selected clones expand dramatically (up to 10⁶-fold) |
| Peak effector | 7-14 | Maximum clonal expansion; reduced diversity |
| Contraction | 14-28 | 90-95% of effectors die |
| Memory | 28+ | 5-10% survive as memory; diversity partially restored |
Vaccination:
- Similar expansion/contraction pattern
- Detectable memory repertoire shift for years
- Booster doses recall and further expand specific clones
Age-Related Changes
| Age Group | Repertoire Features |
|---|---|
| Newborn | Very diverse naive repertoire; no memory |
| Childhood | Rapidly expanding memory from exposures |
| Adult | Stable balance of naive and memory |
| Elderly | Reduced diversity; memory-dominated; CMV-driven expansions |
Immunosenescence involves:
- Thymic involution → fewer new naive T cells
- Accumulated memory → oligoclonal expansions
- Reduced diversity → impaired responses to new antigens
- Inflammaging → chronic low-grade activation
Tissue-Specific Repertoires
| Tissue | Repertoire Characteristics |
|---|---|
| Blood | Circulating pool; sampling most accessible |
| Lymph nodes | GC reactions; memory development |
| Spleen | Marginal zone B cells; blood filtration |
| Gut (GALT) | IgA-dominated; commensal-specific |
| Skin | Tissue-resident memory; recirculating cells |
| Tumor | TILs may be clonally expanded; exhausted phenotype |
Clinical Applications
Diagnostic Uses
Clonality Assessment
| Pattern | Interpretation | Example Conditions |
|---|---|---|
| Monoclonal | Single dominant clone | Lymphoma, leukemia |
| Oligoclonal | Few expanded clones | Chronic infection, some autoimmune |
| Polyclonal | Diverse, no dominance | Reactive/benign, healthy |
Clinical Application: Distinguishing malignancy from reactive lymphocytosis.
Minimal Residual Disease (MRD)
- Track tumor-specific clonotype after treatment
- Detect relapse before clinical symptoms
- Sensitivity: 1 in 10⁵ - 10⁶ cells
- Guide therapy decisions (continue vs. stop treatment)
Autoimmune Disease Signatures
| Disease | Repertoire Feature |
|---|---|
| Ankylosing spondylitis | HLA-B27-restricted TCR signatures |
| Celiac disease | Gluten-specific public TCRs |
| Type 1 diabetes | Islet-specific TCR motifs |
| Multiple sclerosis | Myelin-reactive clones |
| Rheumatoid arthritis | Citrulline-specific BCRs |
Application: Biomarkers for diagnosis, monitoring, patient stratification.
Transplant Monitoring
| Application | Repertoire Analysis |
|---|---|
| Immune reconstitution | Diversity recovery after HSCT |
| Rejection prediction | Alloreactive clone detection |
| GVHD monitoring | Donor T cell expansion patterns |
Research Applications
| Area | Repertoire Insight |
|---|---|
| Vaccine development | Characterize protective responses |
| Infection biology | Identify pathogen-specific clones |
| Cancer immunotherapy | Neoantigen-reactive TILs |
| Aging research | Immunosenescence mechanisms |
| Autoimmunity | Autoreactive clone identification |
Public vs. Private Repertoire
Public Sequences
Some TCR/BCR sequences are found across multiple individuals:
| Reason | Explanation |
|---|---|
| Convergent recombination | Same sequence generated independently due to simpler junctions |
| Shared antigen exposure | Common pathogens (CMV, EBV, influenza) |
| HLA-restricted sharing | Similar HLA types select similar TCRs |
| Structural constraints | Some sequences are “preferred” solutions |
Significance: Public sequences can serve as biomarkers across populations.
Private Sequences
Most sequences are unique to an individual:
| Feature | Implication |
|---|---|
| Stochastic V(D)J | Random recombination outcomes |
| Personal history | Unique exposure patterns |
| Same function, different sequence | Convergent recognition with divergent receptors |
Repertoire Analysis Workflow
Typical Pipeline
1. Sample Collection
└── Blood, tissue, or sorted cells
2. Library Preparation
├── RNA extraction
├── Reverse transcription
├── PCR amplification (multiplex or 5'RACE)
├── UMI incorporation (for quantification)
└── Adapter ligation
3. Sequencing
└── Illumina, PacBio, or other platform
4. Bioinformatics
├── Quality filtering
├── UMI deduplication
├── V(D)J assignment (IMGT reference)
├── CDR3 extraction
├── Error correction
└── Clonotype quantification
5. Analysis
├── Diversity metrics
├── Clone tracking
├── Motif analysis
├── Lineage clustering (BCR)
└── Statistical comparisons
Key Considerations
| Factor | Recommendation |
|---|---|
| Batch effects | Process comparison samples together |
| Technical replicates | Assess reproducibility |
| Sequencing depth | Determine adequate depth empirically |
| Normalization | Rarefaction or statistical methods |
| Multiple testing | Correct for many comparisons |
Common Analysis Tools
| Tool | Application |
|---|---|
| MiXCR | V(D)J alignment and assembly |
| IMGT/HighV-QUEST | Reference database and analysis |
| immunarch | R package for repertoire analysis |
| VDJtools | Diversity, clonality, overlap |
| Change-O/Alakazam | BCR lineage and SHM analysis |
| tcrdist3 | TCR distance and clustering |
| GLIPH2 | TCR specificity grouping |
Key Concepts
-
The immune repertoire is the complete collection of antigen receptors, characterized by clonotype identity and abundance
-
Diversity metrics (Shannon entropy, Simpson index, clonality) quantify repertoire complexity beyond simple clone counts
-
Sampling depth critically affects detection—rare clones require deep sequencing
-
Repertoire dynamics reflect immune history—acute responses cause expansion, memory maintains selected clones
-
Clinical applications range from MRD detection to autoimmune diagnosis to transplant monitoring
-
Public sequences shared across individuals may serve as population-level biomarkers
-
Paired chain information is essential for definitive antigen specificity and many diagnostic applications
Related Articles
- T Cell Receptor Structure — TCR architecture
- V(D)J Recombination — How diversity is generated
- Somatic Hypermutation — BCR diversification
- T Cell Development — Thymic selection shapes the T cell repertoire
- B Cell Development — Bone marrow checkpoints shape B cell repertoire
References
-
Robins H. (2013). Immunosequencing: applications of immune repertoire deep sequencing. Current Opinion in Immunology, 25:646-652.
-
Six A, et al. (2013). The past, present and future of immune repertoire biology. Frontiers in Immunology, 4:413.
-
Greiff V, et al. (2017). Systems analysis reveals high genetic and antigen-driven predetermination of antibody repertoires throughout B cell development. Cell Reports, 19:1467-1478.
-
Emerson RO, et al. (2017). Immunosequencing identifies signatures of cytomegalovirus exposure history and HLA-mediated effects on the T cell repertoire. Nature Genetics, 49:659-665.