Beam Search Sweep Analysis

Key Findings

1. E_fine_selector dominates. Won 7/10 proteins by best single sample and 7/10 by mean reward. Its frequent pruning (6 checkpoints, 3 branches) consistently finds the best binders.

2. D_early_brancher drops to 4th with full data (-0.2792). Its earlier lead was biased by only completing easier proteins. Frequent pruning beats coarse exploration.

3. B_length_explorer is worst (-0.3501 mean reward). 16 lengths with 2 beams each lacks depth. Length diversity doesn't compensate for shallow search.

4. Best binding prediction: IFNAR2 with i_PAE = 0.130 (D_early_brancher). Hardest target: BetV1 at best i_PAE = 0.205.

5. Lookaheads beat finals in all configs - the search is not optimally selecting intermediate candidates. Room for improvement in pruning/scoring.

Configuration Rankings

Left: mean reward. Center: wins by best sample. Right: wins by mean reward.

Rank	Config	Strategy	Mean Reward	Wins (best)	Wins (mean)
1	E_fine_selector	6 checkpoints, 3 branches	-0.2475	7	7
2	A_balanced	Paper default	-0.2571	0	2
3	C_deep_exploiter	2 lengths, 16 beams	-0.2700	1	1
4	D_early_brancher	2 checkpoints, 8 branches	-0.2792	2	0
5	B_length_explorer	16 lengths, 2 beams	-0.3501	0	0

Rank

Config

Strategy

Mean Reward

Wins (best)

Wins (mean)

E_fine_selector

6 checkpoints, 3 branches

-0.2475

A_balanced

Paper default

-0.2571

C_deep_exploiter

2 lengths, 16 beams

-0.2700

D_early_brancher

2 checkpoints, 8 branches

-0.2792

B_length_explorer

16 lengths, 2 beams

-0.3501

Protein Difficulty

Easiest to hardest protein targets. Green < 0.16 (easy), Orange 0.16-0.22 (medium), Red > 0.22 (hard).

Difficulty	Protein	Best i_PAE	Best Config
Easy	IFNAR2	0.130	D_early_brancher
Easy	PD1	0.138	E_fine_selector
Easy	PDL1	0.150	C_deep_exploiter
Easy	CrSAS6	0.156	E_fine_selector
Medium	CD45	0.159	E_fine_selector
Medium	SpCas9	0.168	E_fine_selector
Medium	DerF7	0.175	E_fine_selector
Medium	Claudin1	0.198	C_deep_exploiter
Hard	BetV1	0.205	E_fine_selector
Hard	HER2_AAV	0.222	E_fine_selector

Difficulty

Protein

Best i_PAE

Best Config

Easy

IFNAR2

0.130

D_early_brancher

Easy

PD1

0.138

E_fine_selector

Easy

PDL1

0.150

C_deep_exploiter

Easy

CrSAS6

0.156

E_fine_selector

Medium

CD45

0.159

E_fine_selector

Medium

SpCas9

0.168

E_fine_selector

Medium

DerF7

0.175

E_fine_selector

Medium

Claudin1

0.198

C_deep_exploiter

Hard

BetV1

0.205

E_fine_selector

Hard

HER2_AAV

0.222

E_fine_selector

Next Steps

Add refinement to E_fine_selector: sequence_hallucination is expected to dramatically improve pLDDT and i_PAE

Fix D_early_brancher failures: Reduce n_branch from 8 to 6 to avoid OOM on larger targets

Full pipeline on top candidates: filter → evaluate → analyze for designability (scRMSD, ProteinMPNN)

Hybrid config: Combine E's frequent pruning with D's wide branching (n_branch=6, 4 checkpoints)

Appendix: Configuration Details

Config	nsamples	beam_width	n_branch	Checkpoints	Finals	Total PDBs
A_balanced	4	8	4	[0,100,200,300,400]	32	544
B_length_explorer	16	2	4	[0,100,200,300,400]	32	544
C_deep_exploiter	2	16	4	[0,100,250,400]	32	416
D_early_brancher	4	8	8	[0,200,400]	32	544
E_fine_selector	4	8	3	[0,65,130,200,270,340,400]	32	608

All: 400 diffusion steps, beam-search, AF2 multimer reward (i_pae=-1.0), batch_size=8, seed=5

Appendix: Execution Summary

First pass: 50 launched, 33 completed, 17 failed (189.6 min)
Retry pass: 17 retried, 11 completed, 6 still failed (92.5 min)
Total data: 50/50 experiments complete (all retried successfully)
Failure pattern: D_early_brancher required most retries (6/10). GPU memory from n_branch=8
Total samples: 26,560 (1,600 finals + 24,960 lookaheads)

Appendix: Per-Config Per-Protein Mean Reward

Protein	A_balanced	B_length_exp	C_deep_exp	D_early_br	E_fine_sel
PD1	0.000	-0.238	0.000	-0.161	-0.156
PDL1	-0.184	-0.210	-0.172	-0.191	-0.176
IFNAR2	-0.148	-0.154	-0.150	-0.148	-0.146
CD45	-0.244	-0.579	-0.274	-0.361	-0.365
Claudin1	-0.225	-0.252	-0.226	-0.224	-0.212
CrSAS6	-0.178	-0.266	-0.190	-0.187	-0.178
DerF7	-0.200	-0.262	-0.202	-0.201	-0.192
BetV1	-0.790	-0.650	-0.893	-0.760	-0.598
SpCas9	-0.316	-0.398	-0.306	-0.266	-0.213
HER2_AAV	-0.288	-0.493	-0.287	-0.294	-0.239

Generated by Analyst Team — 2026-04-05 — Full analysis (markdown)

Beam Search Configuration Sweep

Key Findings

Config x Protein Heatmaps

Configuration Rankings

Per-Protein Comparison

Protein Difficulty

Compute Budget

Next Steps