MJRoBot (Marcelo Rovai)
Published © MIT

Vision-Language Models (VLM) at the Edge

We will learn Vison-Language Models across tasks such as captioning, object detection, grounding, and segmentation on a Raspberry Pi.

IntermediateFull instructions provided8 hours5,672
Vision-Language Models (VLM) at the Edge

Things used in this project

Hardware components

Raspberry Pi 5
Raspberry Pi 5
×1

Story

Read more

Code

EdgeML with Raspberry-Pi: FLORENCE-2

Credits

MJRoBot (Marcelo Rovai)
67 projects • 959 followers
Professor, Engineer, MBA, Master in Data Science. Writes about Electronics with a focus on Physical Computing, IoT, ML, TinyML and Robotics.

Comments