Blog

The Annotated CLIP (Part-1)

Learning Transferable Visual Models From Natural Language Supervision

Multimodal

Transformers

This post is part-1 of the two series blog posts on CLIP. In this blog, we present an Introduction to CLIP in an easy to digest manner. We also compare CLIP to other research papers and look at the background and inspiration behind CLIP.

Mar 3, 2023

Aman Arora

Categories

The Annotated CLIP (Part-1)