As AI applications expand beyond text, the ability to work with image, video, and other modalities is becoming a must-have skill. Building effective multimodal systems requires not only the right models, but also the right infrastructure to store, retrieve, and serve diverse data types at scale.
In this code-along, Apoorva Joshi, a Senior AI Developer Advocate at MongoDB, will teach you how to build a simple multimodal AI application using MongoDB and Voyage AI. You’ll learn how to structure and query image and video data, apply retrieval techniques with Voyage AI, and connect everything in a functional pipeline. This session is ideal for data scientists and AI engineers looking to expand their application-building toolkit.
Presenter Bio
Apoorva JoshiSenior AI Developer Advocate at MongoDB
Apoorva is a Data Scientist turned Developer Advocate, with over 7 years of experience applying machine learning to problems in domains such as cybersecurity and mental health. As an AI Developer Advocate at MongoDB, she now helps developers be successful at building AI applications through written content and hands-on workshops.