Coco Cn Save

Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks

Project README

COCO-CN

COCO-CN is a bilingual image description dataset enriching MS-COCO with manually written Chinese sentences and tags. The new dataset can be used for multiple tasks including image tagging, captioning and retrieval, all in a cross-lingual setting.

Chinese sentences	COCO-CN train	COCO-CN val	COCO-CN test
human written	:white_check_mark:	:white_check_mark:	:white_check_mark:
human translation	:x:	:x:	:white_check_mark:
machine translation (baidu)	:white_check_mark:	:white_check_mark:	:white_check_mark:

Progress

version 201805: 20,341 images (training / validation / test: 18,341 / 1,000 / 1,000), associated with 22,218 manually written Chinese sentences and 5,000 manually translated sentences. Data is freely available upon request. Please submit your request via Google Form.
Precomputed image features: ResNext-101
COCO-CN-Results-Viewer: A lightweight tool to inspect the results of different image captioning systems on the COCO-CN test set, developed by Emiel van Miltenburg at the Tilburg University.
NUS-WIDE100: An extra test set.

2018-12-16: Code for cross-lingual image tagging and captioning released.
2018-12-20: Code for cross-lingual image retrieval and our image annotation system released.
2019-01-13: The COCO-CN paper accepted as a regular paper by the T-MM journal.
2021-02-03: Release of new annotations (4,573 images and 4,712 manually written sentences) collected via our iCap interactive image captioning System. The images have no overlap with the prevously released dataset.

Citation

If you find COCO-CN useful, please consider citing the following paper:

Xirong Li, Chaoxi Xu, Xiaoxu Wang, Weiyu Lan, Zhengxiong Jia, Gang Yang, Jieping Xu, COCO-CN for Cross-Lingual Image Tagging, Captioning and Retrieval, IEEE Transactions on Multimedia, Volume 21, Number 9, pages 2347-2360, 2019

Open Source Agenda is not affiliated with "Coco Cn" Project. README Source: li-xirong/coco-cn

Stars

167

Open Issues

Last Commit

1 year ago

Repository

li-xirong/coco-cn

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/coco-cn"><img src="https://www.opensourceagenda.com/projects/coco-cn/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022