【マルチモーダル】Vision-Language #まとめ編

Index Index Vision-Language 一方向型と双方向型 アルゴリズム CLIP / 2021 A Large-scale ImaGe and Noisy-text embedding / ALIGN / 2021 Uni-Perceiver / 2021 Uni-Perceiver-MoE / 2022 Uni-Perceiver v2 / 2022 Unified-IO / 2022 Flamingo / 2022 Textless Vision-Language Transformer / TVL / 2022 Multi-modal G…