# IBM_VizWiz **Repository Path**: mirrors_ibm/IBM_VizWiz ## Basic Information - **Project Name**: IBM_VizWiz - **Description**: Resources related to our winning entry to the 2020 VizWiz Grand Challenge. - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-12-23 - **Last Updated**: 2025-09-21 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Winning Entry to the 2020 VizWiz Grand Challenge Image Captioning as an Assistive Technology: Lessons Learned from VizWiz 2020 Challenge. This repository contains resources related to our work on the [VizWiz 2020 Image Captioning](https://vizwiz.org/tasks-and-datasets/image-captioning/) challenge. You can find details of our work in [this paper](https://arxiv.org/abs/2012.11696), and also in [this blog post](https://www.ibm.com/blogs/research/2020/07/image-captioning-assistive-technology/). *** ## Resources ### Multimodal Assistive Captioner System ![System Pipeline](/images/pipeline.png) #### Real-Time Demo (Video Length: ~2 mins) Demo #### Presentation (Video Length: ~5 mins) Presentation *** ## Citations If you find the work useful, please consider citing the following articles. ``` @misc{dognin2020image, title={Image Captioning as an Assistive Technology: Lessons Learned from VizWiz 2020 Challenge}, author={Pierre Dognin and Igor Melnyk and Youssef Mroueh and Inkit Padhi and Mattia Rigotti and Jarret Ross and Yair Schiff and Richard A. Young and Brian Belgodere}, year={2020}, eprint={2012.11696}, archivePrefix={arXiv}, primaryClass={cs.CV} } ``` ``` @misc{dognin2020alleviating, title={Alleviating Noisy Data in Image Captioning with Cooperative Distillation}, author={Pierre Dognin and Igor Melnyk and Youssef Mroueh and Inkit Padhi and Mattia Rigotti and Jarret Ross and Yair Schiff}, year={2020}, eprint={2012.11691}, archivePrefix={arXiv}, primaryClass={cs.CV} } ```