Immune Repertoire

Immune Repertoire

The immune repertoire refers to the complete collection of antigen receptors—T cell receptors (TCRs) and B cell receptors (BCRs)—present in an individual at any given time. Understanding the repertoire provides insights into immune status, disease states, and responses to infection or vaccination.

Overview

The adaptive immune system’s power lies in its ability to recognize virtually any antigen through an enormous diversity of receptors. Each lymphocyte expresses a unique receptor generated by V(D)J recombination, and the collective population of these receptors constitutes the immune repertoire.

What Defines the Repertoire?

ComponentDescription
Receptor sequencesThe unique TCR or BCR sequences present
Clonotype abundanceHow many cells express each receptor
Subset distributionDistribution across naive, memory, effector populations
Temporal dynamicsHow the repertoire changes over time
Spatial distributionVariation across tissues and compartments

Scale of Diversity

Theoretical vs. Realized Diversity

ParameterTCRBCR
Theoretical diversity10¹⁵ - 10¹⁸10¹¹ - 10¹³ (before SHM)
Actual clonotypes~10⁷ - 10⁸~10⁹ - 10¹⁰
Total cells~10¹¹ - 10¹²~10¹⁰ - 10¹¹
Naive repertoire~10⁷ unique clones~10⁸ - 10⁹ unique clones
Memory repertoire~10⁵ - 10⁶ clonesVariable

The realized repertoire is vastly smaller than theoretical diversity because:

  • Limited number of lymphocytes
  • Selection processes eliminate many specificities
  • Some receptors cannot fold or function properly

The Repertoire as an Immunological Fingerprint

Each individual’s repertoire is unique, shaped by:

FactorInfluence
GeneticsHLA type, germline V/D/J genes
Thymic/BM selectionCentral tolerance shapes what survives
Exposure historyInfections, vaccines, commensals
AgeThymic involution, accumulated responses
Health statusDisease, immunosuppression
EnvironmentGeography, lifestyle

Repertoire Organization

Defining a Clonotype

A clonotype is typically defined by:

  1. V gene segment used
  2. J gene segment used
  3. CDR3 nucleotide or amino acid sequence
  4. (For BCR) D gene segment and mutation status
  5. (For paired analysis) Both chains together

Example TCR Clonotype Notation:

TRBV7-2*01 | CASSLGQAYEQYF | TRBJ2-7*01
   (V gene)    (CDR3 amino acids)   (J gene)

Example BCR Clonotype Notation:

IGHV3-23*01 | CARDRGYSSSWFDPW | IGHJ4*02 | 8.5% mutated
   (V gene)     (HCDR3 amino acids)  (J gene)    (SHM level)

Clone Size Distribution

The repertoire follows a characteristic power-law distribution:

Clone CategoryFrequencyProportion of Repertoire
Rare clones (1-10 cells)Many unique sequences~80% of diversity
Medium clones (10-1000 cells)Moderate number~15% of diversity
Expanded clones (>1000 cells)Few clones~5% of diversity
Clone Size Distribution:
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Rare clones        │████████████████████  80% of unique sequences
Medium clones      │████  15%
Expanded clones    │█  5%
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Most diversity resides in rare clones, but expanded clones contribute significantly to function.

Repertoire Compartments

Different lymphocyte populations have distinct repertoire characteristics:

CompartmentDiversityClone SizesCDR3 Features
Naive T cellsVery highSmall, uniformRandom distribution
Central memoryModerateVariableAntigen-selected
Effector memoryLowerOften expandedAntigen-selected
EffectorsLowestLarge clonesHighly selected
TregsModerateVariableSelf-reactive bias
Naive B cellsVery highSmallUnmutated
Memory B cellsModerateVariableSHM-diversified
Plasma cellsLowClone-specificHighly mutated

Measuring the Repertoire

The Sampling Challenge

The iceberg problem: Any sample captures only a fraction of the total repertoire.

Sample Source% of Total LymphocytesImplications
Peripheral blood~2% of totalMost accessible; may miss tissue-resident cells
Typical sample10⁶ - 10⁷ cellsCaptures abundant clones well; rare clones undersampled
Sequencing depthVariableDetermines detection sensitivity

Strategies to Address:

  • Adequate input cell numbers
  • Sufficient sequencing depth
  • Statistical estimators for unseen diversity
  • Rarefaction analysis

Diversity Metrics

Richness (Clone Counting)

Definition: Total number of unique clonotypes observed

MetricDescription
Observed richnessDirect count of unique sequences
Chao1 estimatorEstimates true richness from sampling
ACEAbundance-based coverage estimator
RarefactionRichness at standardized sampling depth

Limitation: Heavily dependent on sampling depth—deeper sequencing always finds more rare clones.

Diversity Indices

IndexFormulaInterpretation
Shannon entropy (H)H = -Σ pᵢ log(pᵢ)Accounts for abundance distribution; higher = more diverse
Simpson index (D)D = Σ pᵢ²Probability of sampling same clone twice; lower = more diverse
Inverse Simpson (1/D)1/DEffective number of equally abundant clones
Gini coefficientArea under Lorenz curveInequality measure; 0 = equal, 1 = dominated by one clone

Where pᵢ = proportion of clone i in the sample.

Clonality

Definition: Measure of how dominated the repertoire is by expanded clones

Clonality = 1 - (Shannon entropy / log₂(richness))
Clonality ValueInterpretation
0Maximally diverse; all clones equal
~0.1-0.3Typical healthy repertoire
~0.5Moderately clonal
1Monoclonal; single clone dominates

Sequencing Approaches

MethodChain InfoThroughputCostBest Application
Bulk TCR/BCR-seqSingle chainVery highLowRepertoire surveys, biomarkers
Single-cell TCR/BCR-seqPaired chainsMediumHighFunctional studies, antigen specificity
Spatial transcriptomicsVariableLow-mediumHighTissue architecture
Long-read sequencingFull lengthMediumMediumFull V region; SHM analysis

Repertoire Dynamics

During Immune Responses

Acute Infection Timeline:

PhaseDaysRepertoire Changes
Recognition0-3Naive repertoire scanned for reactive clones
Expansion3-7Selected clones expand dramatically (up to 10⁶-fold)
Peak effector7-14Maximum clonal expansion; reduced diversity
Contraction14-2890-95% of effectors die
Memory28+5-10% survive as memory; diversity partially restored

Vaccination:

  • Similar expansion/contraction pattern
  • Detectable memory repertoire shift for years
  • Booster doses recall and further expand specific clones
Age GroupRepertoire Features
NewbornVery diverse naive repertoire; no memory
ChildhoodRapidly expanding memory from exposures
AdultStable balance of naive and memory
ElderlyReduced diversity; memory-dominated; CMV-driven expansions

Immunosenescence involves:

  • Thymic involution → fewer new naive T cells
  • Accumulated memory → oligoclonal expansions
  • Reduced diversity → impaired responses to new antigens
  • Inflammaging → chronic low-grade activation

Tissue-Specific Repertoires

TissueRepertoire Characteristics
BloodCirculating pool; sampling most accessible
Lymph nodesGC reactions; memory development
SpleenMarginal zone B cells; blood filtration
Gut (GALT)IgA-dominated; commensal-specific
SkinTissue-resident memory; recirculating cells
TumorTILs may be clonally expanded; exhausted phenotype

Clinical Applications

Diagnostic Uses

Clonality Assessment

PatternInterpretationExample Conditions
MonoclonalSingle dominant cloneLymphoma, leukemia
OligoclonalFew expanded clonesChronic infection, some autoimmune
PolyclonalDiverse, no dominanceReactive/benign, healthy

Clinical Application: Distinguishing malignancy from reactive lymphocytosis.

Minimal Residual Disease (MRD)

  • Track tumor-specific clonotype after treatment
  • Detect relapse before clinical symptoms
  • Sensitivity: 1 in 10⁵ - 10⁶ cells
  • Guide therapy decisions (continue vs. stop treatment)

Autoimmune Disease Signatures

DiseaseRepertoire Feature
Ankylosing spondylitisHLA-B27-restricted TCR signatures
Celiac diseaseGluten-specific public TCRs
Type 1 diabetesIslet-specific TCR motifs
Multiple sclerosisMyelin-reactive clones
Rheumatoid arthritisCitrulline-specific BCRs

Application: Biomarkers for diagnosis, monitoring, patient stratification.

Transplant Monitoring

ApplicationRepertoire Analysis
Immune reconstitutionDiversity recovery after HSCT
Rejection predictionAlloreactive clone detection
GVHD monitoringDonor T cell expansion patterns

Research Applications

AreaRepertoire Insight
Vaccine developmentCharacterize protective responses
Infection biologyIdentify pathogen-specific clones
Cancer immunotherapyNeoantigen-reactive TILs
Aging researchImmunosenescence mechanisms
AutoimmunityAutoreactive clone identification

Public vs. Private Repertoire

Public Sequences

Some TCR/BCR sequences are found across multiple individuals:

ReasonExplanation
Convergent recombinationSame sequence generated independently due to simpler junctions
Shared antigen exposureCommon pathogens (CMV, EBV, influenza)
HLA-restricted sharingSimilar HLA types select similar TCRs
Structural constraintsSome sequences are “preferred” solutions

Significance: Public sequences can serve as biomarkers across populations.

Private Sequences

Most sequences are unique to an individual:

FeatureImplication
Stochastic V(D)JRandom recombination outcomes
Personal historyUnique exposure patterns
Same function, different sequenceConvergent recognition with divergent receptors

Repertoire Analysis Workflow

Typical Pipeline

1. Sample Collection
   └── Blood, tissue, or sorted cells

2. Library Preparation
   ├── RNA extraction
   ├── Reverse transcription
   ├── PCR amplification (multiplex or 5'RACE)
   ├── UMI incorporation (for quantification)
   └── Adapter ligation

3. Sequencing
   └── Illumina, PacBio, or other platform

4. Bioinformatics
   ├── Quality filtering
   ├── UMI deduplication
   ├── V(D)J assignment (IMGT reference)
   ├── CDR3 extraction
   ├── Error correction
   └── Clonotype quantification

5. Analysis
   ├── Diversity metrics
   ├── Clone tracking
   ├── Motif analysis
   ├── Lineage clustering (BCR)
   └── Statistical comparisons

Key Considerations

FactorRecommendation
Batch effectsProcess comparison samples together
Technical replicatesAssess reproducibility
Sequencing depthDetermine adequate depth empirically
NormalizationRarefaction or statistical methods
Multiple testingCorrect for many comparisons

Common Analysis Tools

ToolApplication
MiXCRV(D)J alignment and assembly
IMGT/HighV-QUESTReference database and analysis
immunarchR package for repertoire analysis
VDJtoolsDiversity, clonality, overlap
Change-O/AlakazamBCR lineage and SHM analysis
tcrdist3TCR distance and clustering
GLIPH2TCR specificity grouping

Key Concepts

  1. The immune repertoire is the complete collection of antigen receptors, characterized by clonotype identity and abundance

  2. Diversity metrics (Shannon entropy, Simpson index, clonality) quantify repertoire complexity beyond simple clone counts

  3. Sampling depth critically affects detection—rare clones require deep sequencing

  4. Repertoire dynamics reflect immune history—acute responses cause expansion, memory maintains selected clones

  5. Clinical applications range from MRD detection to autoimmune diagnosis to transplant monitoring

  6. Public sequences shared across individuals may serve as population-level biomarkers

  7. Paired chain information is essential for definitive antigen specificity and many diagnostic applications

References

  1. Robins H. (2013). Immunosequencing: applications of immune repertoire deep sequencing. Current Opinion in Immunology, 25:646-652.

  2. Six A, et al. (2013). The past, present and future of immune repertoire biology. Frontiers in Immunology, 4:413.

  3. Greiff V, et al. (2017). Systems analysis reveals high genetic and antigen-driven predetermination of antibody repertoires throughout B cell development. Cell Reports, 19:1467-1478.

  4. Emerson RO, et al. (2017). Immunosequencing identifies signatures of cytomegalovirus exposure history and HLA-mediated effects on the T cell repertoire. Nature Genetics, 49:659-665.