TCR_Explore was designed as an open-access web server that analyses and visualises TCR repertoire data without the need for coding expertise. TCR_Explore introduces multiple pipelines using an automated process that includes pairing of αβ or γδ chains, as well as facilitating interrogation of linked flow cytometric index data for immunophenotyping analyses. Additionally, automated summarisation process from a single input file enables the creation of a variety of publication-ready analytical plots.
There are three main sections:
Please contact: Kerry.Mullan1@monash.edu or Nicole.Mifsud@monash.edu to report errors.
Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
Step 1. open the TCR_Explore.Rproj file
Step 2. open the install.packages.TCR_Explore.R file
Step 3. Run each line (top right)
Possible installing prompts
When you see this line: “These packages have more recent versions available. It is recommended to update all of them. Which would you like to update? Enter one or more numbers, or an empty line to skip updates:”, answer with 1 and hit Enter.
On a Mac if you see “Do you want to install from sources the package which needs compilation? (Yes/no/cancel)”, answer with no and hit Enter. This same message will appear as an popup on Windows.
Once all packages are installed, this will be the final section of the installation process.
* installing *source* package ‘TCR_explore.R’ ...
** using staged installation
** R
** inst
** byte-compile and prepare package for lazy loading
** help
*** installing help indices
*** copying figures
** building package indices
** testing if installed package can be loaded from temporary location
** testing if installed package can be loaded from final location
** testing if installed package keeps a record of temporary installation path
* DONE (TCR_explore.R)
Step 4. open the TCR_Explore_v1.0.R and hit RunApp (top-right corner)
Step 1. Convert the .seq to .fasta files (QC -> SEQ to FASTA file merger)
Step 2. Upload the .fasta file to IMGT/Vquest
Step 3. Downloading TCR_Explore QC file
NOTE: If steps 1 and 2 were completed prior to this process and the Junction sheet is missing, the following process can be completed with 'Summary' sheet only. However, without this sheet, one cannot download the TCRdist3 file in the 'TCR analysis -> overview of TCR pairing -> summary table', as the JUNCTION nucleotide sequences will be missing.
Step 4. Fill in the QC file Copy all sequences into the one .csv file, if more than one plate is present add a number in front of the initial well.
The program adds three columns to the end of the file
NA
. These NA
will need to be replaced with either 'pass' or 'fail' in based on the chromatogram quality NA
e.g. High quality sequence (pass), two sequence (fail), cannot resolve frameshift/stop sequence (fail), messy sequences (fail)Step 5. Creating the file needed for paired TCR_Explore
This will render two tables in this section:
Download the paired chain file.
If needed also download the TCRdist output file.
Step 5. Downloading filtered file
A video explain this process is in available.
For the ImmunoSEQ processed data, TCR_Explore will remove rows with missing information (e.g. NA in both V and J genes) NOTE: For other sources, the user will have to manually remove non-functional sequence. Contact Kerry.mullan@monash.edu if you have a specific filtering requirement.
TCR analysis section
Motif analysis section
Diversity and chain interrogation
Group overlap analysis
Upload the file. This can be from our QC section or alternative sources. The other features in the side panel are
summary table
Treemap
Chord diargram
Pie chart
The user can specify the type of summary table to download.
They can either select their own columns (general summary) or downlaod as TCRdist3 .csv output.
For the TCRdist3, there is a need to use our QC process as it matches the IMGT column names.
There is also a need to select if the input data is either alpha-beta (ab) or gamma-delta (gd) for the TCRdist3 column selection.
TCR analysis section
Go to top
The user can specify:
TCR analysis section
Go to top
There are several features the user can specify:
TCR analysis section
Go to top
There are several features the user can specify:
TCR analysis section
Go to top
CDR3 length distribution
Single length motif analysis
Aligned motif analysis
The length distribution presented is by the unique CDR3 sequences.
The user can specify:
Motif analysis section
Go to top
The nucleotide and amino acid plots show the unique sequences of a certain length (e.g. 15)
These are displayed as 'Motif (amino acid)' and 'Motif (nucleotide)'
The 'Motif (amino acid)' can also compare two groups of the same sequence.
Motif analysis section
Go to top
This section can align the sequences using 'muscle' package.
Motif analysis section
Go to top
Chain usage
Diversity of TCR sequence
Diversity and chain interrogation
Go to top
The top panel showcases the Inverse Simpson Diversity index (SDI) table. This table can be downloaded, which may be needed with more complex designs (ANOVA).
The bottom panel showcases the graphical outputs and simple t-test.
Diversity and chain interrogation
Go to top
There are two graphs to this overlap section.
The Heatmap plot can display data of a specific group/individual (Select specific groups=yes; selected group=“E10630.CD8”). The user may wish to display the x by y of AV vs BV.
If 'Select specific groups=no', the user can showcase the multiple individuals on either the x or y axis (see image below)
TCR analysis section
Go to top
The upset plot can highlight if the specific clonotypes overlap.
Please contact: Nicole.Mifsud@monash.edu or Kerry.Mullan@monash.edu to report errors.
The first tab is used to merge the paired TCR file with the .fcs FACS file.
The user needs to type into the “Group of data” (e.g. other) and “Individual of data” (e.g. 780) the group and individual. There is also the option to specify if multiple plates were used. However, if there is only one group and individual, only the header will show (i.e. group and Indiv), but the data will pair correctly (see test-data example)
The merged file is based on a 80 well sorted plate (A1-H10). Columns 11 and 12 are not included, which is based on the experimental setup.
upload the merged index paired TCR data file.
Things to do before uploading the file
Recommended selecting for ab TCR data: Indiv, group,TRBV,CDR3b.Sequence, TRBJ, TRAV, CDR3a.Sequence, TRAJ, AJ, BJ and AJBJ. Do not select flurochrome columns, cloneCount
Creating the files
Note: I would recommend leaving the clonal filter at 0 or 1. I would then copy these columns in excel followed by removing unwanted clones rather than having to redo this step.
User defined variables include:
A high quality chormatogram will display few mismatches (Blue) between the primary and secondary sequences.
Showcasing the heterogeneous sequences in the .ab1 file
Overlap of the primary and secondary sequence.
Checking for heterozygosity in sequence overlap.
Check heterogenatiy of sequences with a 0.33 ratio cut-off as per the 'sangerseqR' package recommendation
Rows with missing sequences are removed from V and J gene columns
If using ImmunoSEQ data, there is an additional filtering step to only keep in-frame sequences
ASN = amino acid data and DNA = DNA data
Inverse Simpson Diversity Index: ∞=infinite diversity and 1=limited diversity