Darius Knowledge Hub

Home

❯

Multimodal Models

Multimodal Models

Jan 21, 20261 min read

  • knowledge

Multimodal Models


Created: =dateformat(this.file.ctime,"dd MMM yyyy, hh:mm a") | Modified: =dateformat(this.file.mtime,"dd MMM yyyy, hh:mm a") Tags: knowledge


Overview

Related fields

  • Transformers

Introduction


Theoretical References

Papers

  • [2309.10020] Multimodal Foundation Models: From Specialists to General-Purpose Assistants
  • BEiT

Articles

  • Multimodality and Large Multimodal Models (LMMs) - Chip Huyen

Courses

  • Large Multimodal Models - Chunyuan CVPR 2023
    • [CVPR2023 Tutorial Talk] Large Multimodal Models: Towards Building and Surpassing Multimodal GPT-4 - YouTube
    • [2306.14895] Large Multimodal Models: Notes on CVPR 2023 Tutorial

Code References

Methods

Tools, Frameworks

  • GitHub - salesforce/LAVIS: LAVIS - A One-stop Library for Language-Vision Intelligence


Graph View

  • Multimodal Models
  • Overview
  • Related fields
  • Introduction
  • Theoretical References
  • Papers
  • Articles
  • Courses
  • Code References
  • Methods
  • Tools, Frameworks

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community