Antibody Sequence Liabilities Assessment

In antibody discovery, identifying promising candidates goes beyond just their binding affinity. It's crucial to also assess their potential for manufacturability, stability, and safety. The developability of an antibody is a critical factor that determines its likelihood of becoming a successful therapeutic. This refers to a set of physicochemical properties that make an antibody amenable to being manufactured at a large scale, formulated for stability, and safe for administration to patients.

A key aspect of developability is the assessment of sequence liabilities — specific amino acid motifs within the antibody sequence that can predispose it to undesirable chemical or physical modifications. These modifications can lead to a range of issues, including:

Loss of potency: Modifications in the antigen-binding regions (CDRs) can reduce or eliminate the antibody's ability to bind to its target.
Reduced stability: Liabilities can compromise the structural integrity of the antibody, leading to aggregation, fragmentation, and a shorter shelf-life.
Safety concerns: Modified or aggregated antibodies can trigger an immune response in patients, a phenomenon known as immunogenicity.

The Antibody Sequence Liabilities block in Platforma.bio screens each antibody sequence for known liability motifs and classifies them by fixability — how tractable each liability is to engineer away. This lets you distinguish candidates that need a quick conservative substitution from those that require significant reengineering or are simply not viable.

Prerequisites

Before starting, ensure you have set up a project and samples from each round of your selection experiment have been imported and processed with a clonotyping block.

How to assess antibody sequence liabilities

This analysis is typically performed after you have processed your raw sequencing data, performed clonotyping, and identified enriched clusters.

Add the analysis block

Start from your project's analysis pipeline.

Click the + Add Block button on the left-hand panel.
In the "Add Block" window, find and select the Antibody Sequence Liabilities block.
Click Add to Project to add it to your pipeline.

Configure the settings

Once the block is added, configure its settings before running the analysis.

Select dataset: Use the dropdown to choose the input dataset — the output of a preceding block that contains your list of clonotypes, such as Redefine Clonotypes.
Predefined liabilities: A collapsible section lists all built-in liability definitions. Expand it to inspect each entry — its pattern, risk level, and fixability class. All predefined liabilities are active by default with the exception of Integrin binding (relevant primarily for therapeutic candidates going in vivo). Uncheck any liabilities you want to exclude.

To run only your custom liabilities and skip the predefined set entirely, uncheck Use predefined liabilities.
Custom liabilities: Add your own motifs using the Add button. Each custom liability requires a name, a regex pattern, a risk level, a fixability class, and the regions it should be applied to. See Custom liabilities below for details.
Click Run to start the analysis.

Interpreting the results

After the block finishes, it generates a table with one row per clonotype and several liability columns.

Score columns

Four columns replace the old single "Liabilities risk" column, each capturing a distinct aspect of the liability picture:

Column	Values	What it means
Is Productive	`Pass` / `Fail`	Fails when a stop codon or out-of-frame sequence is found. These are not engineering problems — the sequence is invalid. Hard exclusion in Lead Selection.
Structural liabilities	`None` / `Present`	Present when a structurally disqualifying liability is found — missing conserved cysteines or extra unpaired cysteines. Cannot be fixed by simple substitution. Hard exclusion in Lead Selection.
Developability risk	`None` / `Low` / `Medium` / `High`	The severity of fixable liabilities only (PTM motifs, fragmentation sites). Independent of structural liabilities. Default Lead Selection filter passes None/Low/Medium; High is filtered by default.
Developability cost	continuous float	Sum of engineering effort across all non-disqualifying liability hits, weighted by fixability class and region importance. Lower = easier to engineer. Used for ranking, not filtering.

The columns are independent: a sequence can have Structural liabilities: Present and Developability risk: High simultaneously if it carries both a missing cysteine and a deamidation motif. Each column surfaces its concern separately.

Per-region columns

The block also outputs per-region liability strings (CDR1 aa liabilities, CDR2 aa liabilities, etc.) and per-region risk levels (CDR1 aa risk, etc.). These show which specific motifs were detected in each region, useful for understanding exactly where a liability sits and how risky it is to engineer around.

Per-region risk reflects fixable liabilities only — structural and disqualifying issues are captured in the global score columns, not the regional risk.

Developability cost weights

The developability cost weights each liability hit by two factors:

Fixability weight — how hard the liability is to fix:

Class	Weight
Easily fixable	1
Fixable	3
Hard to fix	8
Structural	20

Region weight — how much the region contributes to binding:

Region	Weight
CDR3	1.5
CDR1, CDR2	1.2
FR1	1.0
FR2, FR3	0.5
FR4	0.3

A deamidation motif (fixability weight 3) in CDR3 (region weight 1.5) contributes 4.5 to the score. The same motif in FR4 contributes 0.9. This reflects the real engineering risk: fixing a CDR3 liability requires more care and has a higher chance of affinity loss than fixing the same liability in a framework region.

Reported sequence liabilities

The block screens for the following predefined liabilities.

Sequence validity

Liability	Motif	Fixability	Description
Stop codon	`*`	Disqualifying	Protein synthesis terminates prematurely. The sequence does not encode a viable antibody.
Out-of-frame	`_`	Disqualifying	A frameshift mutation produces a completely incorrect amino acid sequence. Not an engineering problem — the sequence is invalid.

Structural liabilities

Liability	Fixability	Description
Missing cysteines	Structural	Conserved disulfide bonds are absent. These bonds hold the antibody's core structure together; missing ones cannot be restored by simple substitution.
Extra cysteines	Hard to fix	An unpaired cysteine will bond with other antibody molecules, causing aggregation. The rogue cysteine must be identified and removed, which often requires structural modeling and risks reshaping the CDR.

Fixable liabilities

Liability	Motif	Risk	Fixability	Description
Deamidation	`N[GS]`	High	Fixable	Asparagine converts to aspartate, introducing a charge change and structural distortion. The GS context accelerates the reaction. Fix: N→Q, but this position is often a CDR contact — expect affinity testing.
Fragmentation	`DP`	High	Fixable	The D–P peptide bond is susceptible to cleavage under acidic conditions. Fix: D→E or P→A, but proline often drives loop conformation — removing it can reshape the binding site.
Isomerization	`D[DGHST]`	High	Fixable	Aspartate isomerizes to isoaspartate, inserting an extra methylene group into the backbone and causing conformational change. Fix: D→E.
N-linked glycosylation	`N[^P][ST]`	High	Fixable	Unintended sugar attachment in the variable region causes binding heterogeneity and manufacturing inconsistency. Fix: mutate any of the three positions to break the motif.
Hydrolysis	`NP`	Medium	Fixable	The N–P bond is prone to hydrolysis. Fix: N→Q or P→A.
Fragmentation	`TS`	Medium	Fixable	β-elimination under alkaline conditions can cleave this bond. Fix: T→S/A or S→A.
Deamidation	`N[AHNT]`	Medium	Easily fixable	Slower deamidation context than N[GS]. Fix: N→Q, usually safe.
Tryptophan oxidation	`W`	Medium	Easily fixable	Exposure to light or reactive oxygen species oxidizes the tryptophan side chain, reducing binding. Fix: W→Y/F — conservative substitution, low affinity risk except at CDR3 C-terminus.
Methionine oxidation	`M`	Medium	Easily fixable	Oxidized methionine can reduce binding affinity and, in Fc positions, shorten serum half-life. Fix: M→L/I — standard conservative substitution.
Deamidation	`[STK]N`	Low	Easily fixable	Weak deamidation context driven by the preceding residue. Very low spontaneous rate. Fix: N→Q if desired.

Integrin binding (off by default)

Liability	Motif	Risk	Fixability	Description
Integrin binding	`RGD\|RYD\|KGD\|NGR\|LDV\|DGE\|GPR`	Low	Easily fixable	These motifs mimic natural integrin ligands and can cause off-target cell adhesion in vivo. Relevant primarily for therapeutic candidates; less critical at discovery stage. Disabled by default.

Custom liabilities

You can define additional liability motifs beyond the predefined set. Custom liabilities are applied to the selected regions alongside predefined ones and participate in all four score columns.

Adding a custom liability in the UI

In the block settings, click Add in the Custom liabilities section.
Fill in the fields:
- Name — a unique label for this liability (must not match any predefined name)
- Pattern — a regex applied to the amino acid sequence (see Pattern syntax below)
- Risk level — Low, Medium, or High
- Fixability — Easy fix, Fixable, or Hard to fix
- Regions — which regions to apply this motif to (default: CDR1, CDR2, CDR3)

Pattern syntax

Patterns are standard regular expressions applied to single-letter amino acid sequences.

Construct	Example	Matches
Literal residue	`W`	Any tryptophan
Character class	`[GS]`	Glycine or serine
Negated class	`[^P]`	Any residue except proline
Sequence	`DP`	Aspartate followed by proline
Alternation	`RGD\|KGD`	RGD or KGD

Importing and exporting custom liabilities

For sharing or reusing a custom liability set across projects, use the Export and Import buttons in the custom liabilities section.

Export downloads your current custom entries as custom-liabilities.json. Import loads a JSON file and replaces the current custom entries, applying the same validation (duplicate names, invalid patterns, empty regions).

The JSON format is an array of objects:

[
  {
    "name": "My motif",
    "pattern": "N[GS]",
    "riskLevel": "High",
    "fixability": "fixable",
    "regions": ["CDR1", "CDR2", "CDR3"]
  },
  {
    "name": "Integrin mimic (custom)",
    "pattern": "RGD|KGD",
    "riskLevel": "Low",
    "fixability": "easily_fixable",
    "regions": ["CDR1", "CDR2", "CDR3", "FR1"]
  }
]

Field reference:

Field	Type	Allowed values
`name`	string	Any unique text; must not match a predefined liability name
`pattern`	string	Valid regex in single-letter amino acid notation
`riskLevel`	string	`"Low"` \| `"Medium"` \| `"High"`
`fixability`	string	`"easily_fixable"` \| `"fixable"` \| `"hard_to_fix"`
`regions`	array	One or more of `"CDR1"` `"CDR2"` `"CDR3"` `"FR1"` `"FR2"` `"FR3"` `"FR4"`

To run only your custom set without the predefined liabilities, uncheck Use predefined liabilities in the settings panel.

Downstream analysis: visualizing liabilities

The liability data can be used in downstream selection and visualization blocks. A common application is mapping liability scores onto the Clonotype Space UMAP to see how they relate to enrichment and sequence diversity.

Go to your Clonotype Space block.
Open the Data Mapping panel on the right.
Use the Grouping / Color dropdown to select Developability cost — this colors each point by its engineering effort score.
Use the Highlight dropdown to select Developability risk and choose the values you want to emphasize, such as None, Medium, and High, to surface candidates across the risk spectrum.

The screenshot below shows a UMAP colored by Developability cost with Developability risk highlighted for None, Medium, and High — making it easy to spot highly enriched clonotypes with low engineering burden.

Prerequisites​

How to assess antibody sequence liabilities​

Add the analysis block​

Configure the settings​

Interpreting the results​

Score columns​

Per-region columns​

Developability cost weights​

Reported sequence liabilities​

Sequence validity​

Structural liabilities​

Fixable liabilities​

Integrin binding (off by default)​

Custom liabilities​

Adding a custom liability in the UI​

Pattern syntax​

Importing and exporting custom liabilities​

Downstream analysis: visualizing liabilities​