Gradio

📋 Task Overview

This demo supports four evaluation tasks. Select one to get started:

Task	Description
Pointwise – Image Editing	Rate a single edited image against its source image and the editing instruction. Produces per-aspect scores and a refined request.
Pointwise – T2I Generation	Rate a single generated image against a text-to-image prompt. Produces per-aspect scores and a refined prompt.
Pairwise – Image Editing	Compare two edited images (A vs B) given a source image and editing instruction. Determines which edit is better per aspect.
Pairwise – T2I Generation	Compare two generated images (A vs B) given a text-to-image prompt. Determines which generation is better per aspect.
Prompt Tuning – Image Editing	Generate an edit using Flux (Kontext) from a source image and instruction, then evaluate it. Use the refinement to tune your prompt.

Try the examples below - they're basically begging to be clicked! 🎯

Task Type

Select the evaluation task

Source Image

Edited Image

Editing Instruction

Examples

Task Type	Editing Instruction	Source Image	Edited Image	Image B

Evaluation Result

Terms of use

By using this service, users are required to agree to the following terms: The service is a research preview intended for non-commercial use only. It only provides limited safety measures and may generate offensive content. It must not be used for any illegal, harmful, violent, racist, or sexual purposes. The service may collect user dialogue data for future research.

We deploy our model backend with SGLang. However, there could be congestion during the serving process, leading to delayed responses. If you encounter any issues with the webpage, kindly refresh it.

License

The service is a research preview and is subject to the License of Qwen2, the License of LLaVA-NEXT, and the Terms of Use governing the data generated by OpenAI. Users are required to strictly adhere to the terms outlined in these licenses. Please contact us if you identify any potential violations.

Citation

@article{RationalRewards,
      title={RationalRewards: Empowering Image Synthesis with Rationalized Reward Models},
      author = {Wang, Haozhe and Qu, Chao and Huang, Zuming and Chu, Wei and Lin,Fangzhen and Chen, Wenhu},
      journal={arXiv preprint arXiv:2504.08837},
      year={2025}
}

RationalRewards: Empowering Image Synthesis with Rationalized Reward Models

Model | Code | Dataset | Homepage

📋 Task Overview

Terms of use

License

Citation