Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

thatAverageGuy/EarlyFusion-on-EasyVQA

Open more actions menu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EarlyFusion-on-EasyVQA

This repository contains the streamlit demo for the Episode 1 of Vision Language Modelling Series by "Donkey Stereotype by PrithiviDa".

Youtube: Video Link

Original Reference: Training Notebook

Dataset: Training and Testing Dataset

Demo: Host Link

test_samples directory contains some images to interact with demo. Their corresponding questions are in questions.txt. For anyone who has no idea what this is all about, just pick up the images and questions from the directory and play around.

Note:The model demonstrated here is EarlyFusion one from the video.

About

Streamlit app for demonstrating multi-modal(vision+language) modelling in Pytorch.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

Morty Proxy This is a proxified and sanitized view of the page, visit original site.