Month: August 2023

  • [R] Compressing vision-language and unimodal Transformers via structured pruning

    [R] Compressing vision-language and unimodal Transformers via structured pruning

    [ad_1] 🚀 Code: https://github.com/sdc17/UPop 📑 Paper: https://proceedings.mlr.press/v202/shi23e/shi23e.pdf 🧐 A Quick Look What is it: UPop is the first structured pruning framework for vision-language Transformers. It enables effective structured pruning on various multi-modal & uni-modal tasks (including Visual Reasoning, Image Captioning, Visual Question Answer, Image-Text Retrieval, Text-Image Retrieval, Image Classification and Image Segmentation), datasets (including NLVR2,…