Our Github Page: https://github.com/Q-Future

Please use the HF versions for the benchmark datasets by Q-Future.

from datasets import load_dataset

ds = load_dataset("q-future/Q-Bench-HF") # or A-Bench-HF, Q-Bench2-HF
ds["dev"][0] # Containing images (in PIL.ImageFile), questions, and answers

Our Spaces

Great thanks to the research GPU grants!

Our Mainstream Models

Though we have other model variants released for the community to replicate our results, please use the previous ones as they are proved to have more stable performance.