Full product evaluation loop
Define realistic scenarios for every major capability, fix failures, and rerun the whole evaluation set.
Use this to test a product as users actually experience it.
- List realistic scenarios that cover the major capabilities.
- Define pass/fail checks or scoring criteria before testing.
- Run each scenario under consistent conditions.
- Record screenshots, logs, outputs, and failure notes.
- Fix root causes for failed scenarios.
- Rerun affected scenarios, then rerun the full set.
Stop when the full scenario set passes against the original criteria.