PD-L1 Binder Demo Analysis

Key Findings

1. Pipeline works end-to-end. Generated 2 binder candidates for PD-L1 in 82.92 seconds on 1x H100. AF2 reward scoring completed successfully.

2. Both samples fail production thresholds — expected at 100 steps with no refinement. i_PAE × 31 = 25.76 and 27.32 (need ≤ 7.0). pLDDT = 0.212 and 0.239 (need ≥ 0.9).

3. Sample 0 (n=262) is the better candidate: lower i_PAE (0.831 vs 0.881), higher i_pTM (0.147 vs 0.068), much lower RMSD (11.25Å vs 45.21Å).

4. Reward = -i_PAE only. All other reward weights (con, plddt, dgram_cce, etc.) are 0.0. Consider multi-objective reward for production runs.

Quality Assessment

Demo values (blue/orange) vs production pass-rate thresholds (red dashed)

Criterion	Sample 0 (n=262)	Sample 1 (n=234)	Threshold	Status
i_PAE × 31 ≤ 7.0 (binding)	25.76	27.32	≤ 7.0	FAIL
pLDDT ≥ 0.9 (structure)	0.212	0.239	≥ 0.9	FAIL
scRMSD < 1.5Å (designability)	N/A (evaluate stage not run)	< 1.5Å	—

Criterion

Sample 0 (n=262)

Sample 1 (n=234)

Threshold

Status

i_PAE × 31 ≤ 7.0 (binding)

25.76

27.32

≤ 7.0

FAIL

pLDDT ≥ 0.9 (structure)

0.212

0.239

≥ 0.9

FAIL

scRMSD < 1.5Å (designability)

N/A (evaluate stage not run)

< 1.5Å

—

Timing & Configuration

Left: generation timing. Right: demo config vs recommended production settings

Parameter	Demo	Production	Impact
Diffusion steps	100	400	4× better convergence
Samples	2	32–64	Cover binder length space
Replicas	1	4–16	More candidates per length
Search	best-of-n	beam-search	Smarter exploration
Refinement	None	sequence_hallucination	Key to SOTA quality
GPUs	1× H100	4–8× H100	Parallel generation

Parameter

Demo

Production

Impact

Diffusion steps

100

400

4× better convergence

Samples

32–64

Cover binder length space

Replicas

4–16

More candidates per length

best-of-n

beam-search

Smarter exploration

Refinement

None

sequence_hallucination

Key to SOTA quality

GPUs

1× H100

4–8× H100

Parallel generation

Next Steps

Production run: 400 steps, 32 samples, beam-search, sequence_hallucination refinement on PD-L1

Full pipeline: Add filter → evaluate → analyze for designability and pass-rate metrics

Multi-target: Run on PD-1, IFNAR2, CD45 to compare difficulty across targets

Search algorithm sweep: Compare best-of-n vs beam-search vs MCTS

Appendix: Full Per-Sample Metrics (49 columns)

Metric	Sample 0 (n=262)	Sample 1 (n=234)
total_reward	-0.831	-0.881
af2folding_i_pae	0.831	0.881
af2folding_plddt	0.212	0.239
af2folding_i_ptm_log	0.147	0.068
af2folding_ptm_log	0.455	0.476
af2folding_con	2.261	1.715
af2folding_i_con	5.195	5.086
af2folding_rmsd	11.247	45.209
af2folding_pae	0.559	0.523
af2folding_min_ipae	0.568	0.727
af2folding_fape	214.727	193.455
af2folding_dgram_cce	287.986	511.127
af2folding_exp_res	0.063	0.009
af2folding_seq_ent	0.0	0.0
af2folding_recycles_log	3.0	3.0
binder_length	262	234
sample_type	final	final
total_time	82.92s (2 samples)

Appendix: Generation Config

Target: PD-L1 (02_PDL1)
Target PDB: assets/target_data/bindcraft_targets/PD-L1.pdb
Target chain: A1-115
Hotspots: A37, A39, A49, A98
Binder length range: 64–155

Model: complexa.ckpt (v1 architecture)
Autoencoder: complexa_ae.ckpt
nsteps: 100
self_cond: True
guidance_w: 1.0
seed: 5
batch_size: 16
search: best-of-n (1 replica)
refinement: None

Reward: AF2 multimer
  i_pae weight: -1.0
  all others: 0.0
  num_recycles: 3
  use_initial_guess: True

Generated by Analyst Team — 2026-04-03 — Full analysis (markdown)

PD-L1 Binder Demo Analysis

Key Findings

Metrics Comparison

Quality Assessment

Sample Profile

Timing & Configuration

Next Steps