chore: polish benchmark doc #839

numb3r3 · 2022-10-10T05:26:36Z

No description provided.

codecov · 2022-10-10T05:31:05Z

Codecov Report

Merging #839 (4bc1f4d) into clip-benchmark (cc0e98c) will not change coverage.
The diff coverage is n/a.

@@               Coverage Diff               @@
##           clip-benchmark     #839   +/-   ##
===============================================
  Coverage           81.58%   81.58%           
===============================================
  Files                  21       21           
  Lines                1575     1575           
===============================================
  Hits                 1285     1285           
  Misses                290      290

Flag	Coverage Δ
cas	`81.58% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

ZiniuYu · 2022-10-10T05:44:48Z

docs/user-guides/benchmark.rst


-Zero-shot retrieval
+Zero-shot Retrieval
 +++++++++++++++++++

 In zero-shot retrieval benchmark, each model is evaluated on the following datasets: `COCO Caption <https://github.com/tylin/coco-caption>`_, `Flickr8k <http://hockenmaier.cs.illinois.edu/8k-pictures.html>`_ and `Flickr30k <https://shannon.cs.illinois.edu/DenotationGraph/>`_.


Suggested change

In zero-shot retrieval benchmark, each model is evaluated on the following datasets: `COCO Caption <https://github.com/tylin/coco-caption>`_, `Flickr8k <http://hockenmaier.cs.illinois.edu/8k-pictures.html>`_ and `Flickr30k <https://shannon.cs.illinois.edu/DenotationGraph/>`_.

In the zero-shot retrieval benchmark, each model is evaluated on the following datasets: `COCO Caption <https://github.com/tylin/coco-caption>`_, `Flickr8k <http://hockenmaier.cs.illinois.edu/8k-pictures.html>`_ and `Flickr30k <https://shannon.cs.illinois.edu/DenotationGraph/>`_.

The best results are highlighted in bold (higher is better).

ZiniuYu · 2022-10-10T05:45:16Z

docs/user-guides/benchmark.rst

@@ -151,7 +154,7 @@ From the table, we observe that the ViT models outperform the RN models in gener
 More specifically, the ``ViT-H-14::laion2b_s32b_b79k`` model and ``ViT-g-14::laion2b_s12b_b42k`` model achieve the best and second-best results on all zero-shot retrieval tasks.
 For ViT models, the results of the same base model are better on those pre-trained with larger datasets (e.g., ``ViT-B-32::openai`` vs ``ViT-B-32::laion400m_e31`` vs ``ViT-B-32::laion2b-s34b-b79k``).

-Zero-shot classification
+Zero-shot Classification
 ++++++++++++++++++++++++

 In zero-shot classification benchmark, each model is evaluated on the following datasets: `ImageNetV2 <https://github.com/modestyachts/ImageNetV2>`_, `VOC2007 <http://host.robots.ox.ac.uk/pascal/VOC/voc2007/>`_ and 19 `VTAB datasets <https://github.com/google-research/task_adaptation>`_.


Suggested change

In zero-shot classification benchmark, each model is evaluated on the following datasets: `ImageNetV2 <https://github.com/modestyachts/ImageNetV2>`_, `VOC2007 <http://host.robots.ox.ac.uk/pascal/VOC/voc2007/>`_ and 19 `VTAB datasets <https://github.com/google-research/task_adaptation>`_.

In the zero-shot classification benchmark, each model is evaluated on the following datasets: `ImageNetV2 <https://github.com/modestyachts/ImageNetV2>`_, `VOC2007 <http://host.robots.ox.ac.uk/pascal/VOC/voc2007/>`_ and 19 `VTAB datasets <https://github.com/google-research/task_adaptation>`_.

The best results are highlighted in bold (higher is better).

docs/user-guides/benchmark.rst

github-actions · 2022-10-10T05:53:33Z

📝 Docs are deployed on https://ft-polish-clip-benchmark--jina-docs.netlify.app 🎉

…832) * docs: clip benchmark on zeroshot classification and retrieval tasks * docs: add label * docs: introduction * docs: open clip naming convention * fix: typo * docs: retrieval table * docs: update classification * chore: test html table * chore: update css * chore: test rst * chore: test rst * chore: test * fix: use rst in benchmark * fix: typo * fix: rst * fix: rst * fix: subtitle * docs: classification benchmark * docs: highlight retrieval * docs: highlight retireval * docs: highlight classification * docs: remove redundancy * docs: add links * fix: link * docs: update section * docs: datasets description * docs: add datasets description * docs: format * docs: footnote * docs: add QPS * docs: improve conclusion * docs: update machine config * docs: update software version * chore: polish benchmark doc (#839) * chore: update benchmark intro * chore: minor revision * chore: minor revision * chore: minor revision * chore: minor revision * chore: minor revision * chore: minor revision Co-authored-by: felix-wang <[email protected]>

chore: update benchmark intro

749f5f9

github-actions bot added size/s area/docs labels Oct 10, 2022

chore: minor revision

8590c86

numb3r3 added 2 commits October 10, 2022 13:37

chore: minor revision

f24c996

chore: minor revision

eff9c92

numb3r3 requested review from ZiniuYu and jemmyshin October 10, 2022 05:39

chore: minor revision

01a1468

ZiniuYu requested changes Oct 10, 2022

View reviewed changes

chore: minor revision

78036ae

ZiniuYu approved these changes Oct 10, 2022

View reviewed changes

chore: minor revision

4bc1f4d

numb3r3 requested a review from ZiniuYu October 10, 2022 05:51

numb3r3 merged commit 9839451 into clip-benchmark Oct 10, 2022

numb3r3 deleted the polish-clip-benchmark branch October 10, 2022 06:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: polish benchmark doc #839

chore: polish benchmark doc #839

numb3r3 commented Oct 10, 2022

codecov bot commented Oct 10, 2022 •

edited

Loading

ZiniuYu Oct 10, 2022

ZiniuYu Oct 10, 2022

github-actions bot commented Oct 10, 2022

	In zero-shot retrieval benchmark, each model is evaluated on the following datasets: `COCO Caption <https://github.com/tylin/coco-caption>`_, `Flickr8k <http://hockenmaier.cs.illinois.edu/8k-pictures.html>`_ and `Flickr30k <https://shannon.cs.illinois.edu/DenotationGraph/>`_.
	In the zero-shot retrieval benchmark, each model is evaluated on the following datasets: `COCO Caption <https://github.com/tylin/coco-caption>`_, `Flickr8k <http://hockenmaier.cs.illinois.edu/8k-pictures.html>`_ and `Flickr30k <https://shannon.cs.illinois.edu/DenotationGraph/>`_.
	The best results are highlighted in bold (higher is better).

	In zero-shot classification benchmark, each model is evaluated on the following datasets: `ImageNetV2 <https://github.com/modestyachts/ImageNetV2>`_, `VOC2007 <http://host.robots.ox.ac.uk/pascal/VOC/voc2007/>`_ and 19 `VTAB datasets <https://github.com/google-research/task_adaptation>`_.
	In the zero-shot classification benchmark, each model is evaluated on the following datasets: `ImageNetV2 <https://github.com/modestyachts/ImageNetV2>`_, `VOC2007 <http://host.robots.ox.ac.uk/pascal/VOC/voc2007/>`_ and 19 `VTAB datasets <https://github.com/google-research/task_adaptation>`_.
	The best results are highlighted in bold (higher is better).

chore: polish benchmark doc #839

chore: polish benchmark doc #839

Conversation

numb3r3 commented Oct 10, 2022

codecov bot commented Oct 10, 2022 • edited Loading

Codecov Report

ZiniuYu Oct 10, 2022

Choose a reason for hiding this comment

ZiniuYu Oct 10, 2022

Choose a reason for hiding this comment

github-actions bot commented Oct 10, 2022

codecov bot commented Oct 10, 2022 •

edited

Loading