Science Glossary

Last modified 2024-09-16

TermAbbreviationDefinition
Numbers
10x genomicsSingle-cell RNA-seq (scRNA-seq) technology (file output: .fastq).
A, B, C
agentIndividual or group that collects, analyzes, or monitors datasets or projects.
aliquotA subsample of a larger sample that's extracted for analysis.
assayAn investigative (analytic) procedure in laboratory medicine, pharmacology, environmental biology, and molecular biology for qualitatively assessing or quantitatively measuring the presence, amount, or functional activity of a target entity (the analyte).
batchA set of biological samples processed in tandem, such as a sequencing batch, in which multiple libraries or pools are prepared and submitted together for sequencing.
binary base call formatBCL formatContains the base calls and quality values for each cycle. These files are generated by the sequencing process.
cell barcodeA unique identifier for an individual cell from a well, given a by Cell Ranger in the dry lab (see cell hashing). A sequence that uniquely identifies one or more cells that shared a droplet. Each droplet in a well should have a unique cell barcode, but multiple wells can share a cell barcode. Cell barcodes are attached to 10x GEM beads, and become physically attached to gene sequences during reverse transcription.
cell gatingPlacement of gates and regions around populations of cells with common characteristics (see BiteSize Bio).
cell hashingA combination of methods used to tag individual cells from a number of populations (in our case, usually patient samples) by attaching a population-specific label or barcode. This allows multiple populations to be mixed for experimental steps to reduce handling variation between samples. Later, the original populations can be identified based on their population labels/hashes.
cytometry by time of flightCYTOF or CyTOF

Use of mass spectroscopy on single cells to measure metal isotope labels on antibodies.

cytometerinstrument that counts the blood cells in the common blood test.
cytometryMeasurement of cell characteristics.
D, E, F
data hydrationFilling in an object or structured entity with data from various sources.
datasetA collection of files generated in a single acquisition or analysis session. 
decorator serviceMicroservice designed and built by the AIFI software development team to tag files/data with metadata.
FASTQ formatText-based sequencing data file format that stores both raw sequence data and quality scores and is refined via BCL.
flow cellA component of Illumina sequencing. A glass slide containing small fluidic channels (lanes) onto which one or more sequencing libraries or sequencing pools are applied. Each flow cell has one or more lane(s) (depending on the sequencer). Our pipeline work is performed on sequencers with multiple lanes. Lab pilots are performed on a MiSeq, which has a single lane.
flow cytometryTechnique in which a laser is used to detect and measure physical and chemical characteristics of a population of cells or particles.
flow cytometry standardFCSA data file standard developed in 1984 for reading and writing data from flow cytometry experiments (see the Wikipedia entry on this topic).
G, H, I
gating, supervised

The traditional, typically manual process of distinguishing types of cell, such as T cells and B cells. Experts decide on a particular method and series of steps to do the gating. When the process is automated, software runs through that series of steps, mimicking the behavior of a subject matter expert.

gating, unsupervisedAnalysis of data distribution to arrive at the statistically most promising way to cluster groups of cells.
GlobusA commercial research data management service (see the company's website).
h5A scientific data file that's saved in the hierarchical data format (HDF) and contains multidimensional arrays.
hashtag oligoHTO

Application of a piece of single-stranded DNA to a cell-surface-targeted antibody in order to tag all cells in an individual sample for the purpose of identification (see cell hashing). These antibodies should be able to bind to all cell types. HTOs contain a barcode sequence (an HTO barcode) that can be used to identify a population of cells labeled with this antibody after capture and sequencing.

hierarchical stochastic neighbor embeddingHSNETechnique used for analysis of mass cytometry data sets. HSNE constructs a hierarchy of non-linear similarities that can be interactively explored with a stepwise increase in detail up to the single-cell level.
high-performance computingHPC
Human Immune System ExplorerHISEOpen science platform used to share, analyze, and interpret studies of the human immune system in health and disease. 
i7 indexIdentifier for a 10x chromium well channel.
J, K, L
lab, dryRefers to any lab work that does not involve a physical lab.
lab, wetRefers to any lab work involving a physical lab.
laboratory information management system LIMSA software system for efficient, compliant laborabory operations, such as management of samples and resulting data sets.
ledger data serviceLDSThe store for ingested data of all types, including samples, subjects, subject lab results, and survey data.
ligandA molecule that binds to another (usually larger) molecule. 
M, N, O
metadataUser-defined data enabling interpretation of primary data. 
P, Q, R
PCR primerA (usually short) single strand of DNA used to initiate PCR. Primers set the boundaries of the DNA to be copied/amplified. They can be used to add Indexes to the ends of pieces of DNA.
policy groupA group of agents with a shared access policy. 
polymerase chain reaction PCRA method for copying (often called "amplifying") specific DNA sequences.
poolA mixture of biological samples that are then processed together as a single entity.
pool/batchIn the context of cell hashing, a pool or batch consists of one group of samples uniquely labeled by HTO.
pool, sampleA sample pool is a mixture of patient samples, usually after each sample has been separately labeled by cell hashing.
pool, sequencingA sequencing pool is a mixture of sequencing libraries that is applied together to one or more flow cells.
projectWork with a common purpose associated with one or more datasets.
pub/subA publisher–subscriber relationship in the cloud (see What is Cloud Pub/Sub?).
S, T, U
sampleGenetic data attributed to an individual.
Scientific Advisory BoardSABThe advisory board for the Allen Institute.
TEA-seqTrimodal assay for integrated single-cell measurement of transcription, epitopes, and chromatin accessibility.
toolchainSet of programming tools used to perform a complex software development task or to create a software product, which is typically another computer program or a set of related programs.
V, W
wellA physical vessel used by the 10x chromium device to handle and process genetic data.
X, Y, Z