[codex] Add AF3 layout and Boltz-2 parser support#12
Conversation
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 00959778bf
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
| logger.warning( | ||
| "No recognised PAE keys in confidences.json/summary_confidences.json; " | ||
| "using default PAE=100 for all residue pairs." | ||
| ) |
There was a problem hiding this comment.
Raise on unknown AF3 confidence schema
Falling back to an all-100 PAE matrix here silently turns missing/invalid AF3 confidence files into apparently valid results, which can produce misleading interface metrics instead of failing fast. This is triggered whenever confidences.json/summary_confidences.json is absent or in an unsupported format (for example, a layout mismatch where _read_json returns {}), and the run is then scored with fabricated PAE values rather than being skipped with an error as before.
Useful? React with 👍 / 👎.
Summary
This PR fixes the two parser compatibility gaps reported in issues #3 and #11:
<input>_model_<rank>.cif,confidence_<input>_model_<rank>.json, optional PAE/pLDDT NPZ files)<job>_ranking_scores.csvand prefixed per-sample files6OGE_ABC_DSSO_CDIseed/model plus focused regression tests1.0.2Closes #3.
Fixes #11.
Notes
Vanilla AlphaFold3 writes one sanitized job directory under the selected
--output_dir. Users passing the parent output directory should still run AlphaJudge with--recursive, or pass the job directory directly.Boltz-2 protein-ligand outputs can include trailing small-molecule tokens in PAE/pLDDT arrays. AlphaJudge now trims larger Boltz-2 arrays to the protein/nucleic residue block it scores.
Validation
Result:
2 passed in 12.54s.Additional parser smoke checks:
af3 5 seed-19698302_sample-1boltz2 6OGE_ABC_DSSO_CDI_Boltz2_model_0 (1058, 1058) 1058 0.8897088170051575