OptionDecoder.
RESEARCH · 2026 · 60 SETS · 11 ML MODELS · 1 TRUTH

DÉCODE
the option patterns

A multi-exam decoder for multiple-choice option distribution — 60 sets / 11 ML models / cross-era strict audit.
The truth about "folklore" rules: "3-long-1-short pick shortest" is the only weak rule validated on both exams (+2~5%).

CET-430 sets · 20.95%
CET-636 sets · 27-30%
ML models11
Audits12
LicenseMIT
SECTION 01 · CET-4 SITE

College English Test Band 4
30 sets · 6 ML models

0
%
Realistic
expected hit rate
vs random 18.77% · net +2.18%
0
Sets
0
Questions
0
Options
0
Models
0
Effective rules
0
Eliminated
0
Eras
0
L+R 3-long-1-short
01

Three Eras
2017-2018 · 2019-2020 · 2021-2023

02

v4.1 → v4.6
3-long-1-short heuristic wins

03

Folklore audit
5 eliminated / 8 kept

04

6 audits
99.7% A parser bug · 3-long-1-short real effect

05

Basic stats visualization
letter dist · length bias · position bias

Note: CET-4 site has no figures (the 27 belong to CET-6).

06

Complete code & data
5 Python scripts / 4 reports / 1 source list

OptionDecoder/
├── README.md              # Repo overview
├── data/cet4/             # 6 JSON (rules/models/eras/audit/stats/folklore)
├── src/cet4/              # 5 Python scripts
│   ├── utils/                # fetcher / parser / basic_stats / folklore_audit / final_audit
│   └── v4.1_baseline/       # v4_ml_loo.py — 5-fold cross-era
├── analysis/cet4/         # ml_v4_loo / ml_v4_summary / final_audit
├── reports/cet4/          # README / FINAL_STRATEGY_v46 / AUDIT_REPORT_v46 / ELIMINATED_METHODS
└── plans/cet4/sources.md   # 30 set URLs