Files

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* Update Readme.md

* remove submodule

* add mPLUG MiniGPT4

* Update Readme.md

* Update Readme.md

* Update Readme.md

---------

Co-authored-by: Yuliang Liu <34134635+Yuliang-Liu@users.noreply.github.com>

2023-06-01 09:57:03 +08:00

1.7 KiB

Raw Blame History

OwlEval

We have compiled some examples and their corresponding questions from recent open-source work, and organized them into OwlEval.

Following we will introduce the OwlEval and the data format in this document.

Data Format

questions

questions.jsonl contains case images and information about their corresponding questions

Each row contains the following field:

image: Indicates the name of the picture
question_id: Indicate the question id number, there are 82 questions
question: Represents specific problem information
type：Indicate whether the problem is a single-turn problem or a multi-turn problem

For example:

{"image": "1.jpg", "question_id": 1, "question": "What is funny about this image? Describe it panel by panel.", "type": ["single"]}

answer

This contains the responses of each model for each question, integrated into six jsonl:

llava_13b_answer.jsonl

minigpt4_13b_answer.jsonl

MMreact_answer.jsonl

mPLUG_Owl_7b_answer.jsonl

BLIP2_13b_answer.jsonl

openflamingo_answer.jsonl

For each answer/xxx.jsonl it contains the following information:

image: Indicates the name of the picture
question_id: Indicate the question id number, there are 82 questions
question: Represents specific problem information
answer: Replie given by the model
model_id: The ID of the model the answer is generated by