AIRI-Institute
/

OmniFusion

Model card Files Files and versions Community

razzant commited on Dec 29, 2023

Commit

daa6833

·

1 Parent(s): 0700252

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ To further enhance the model's multimodal capabilities, we employ trainable spec
 2. Once the adapter has learned to map ViT's visual embeddings to the language model's textual space, we proceed to unfreeze Mistral for improved understanding of dialog formats and complex queries.
 <p align="left">
-<img src="https://raw.githubusercontent.com/AIRI-Institute/OmniFusion/main/content/datasets.png" width="80%">
 </p>
 ### Results
@@ -45,7 +45,7 @@ Model Performance on Visual Dialog Benchmark
 ### Examples
 <p align="left">
-<img src="https://raw.githubusercontent.com/AIRI-Institute/OmniFusion/main/content/examples.png" width="100%">
 </p>
 ### Future Plans

 2. Once the adapter has learned to map ViT's visual embeddings to the language model's textual space, we proceed to unfreeze Mistral for improved understanding of dialog formats and complex queries.
 <p align="left">
+<img src="https://raw.githubusercontent.com/AIRI-Institute/OmniFusion/main/content/datasets.png" width="50%">
 </p>
 ### Results
 ### Examples
 <p align="left">
+<img src="https://raw.githubusercontent.com/AIRI-Institute/OmniFusion/main/content/examples.png" width="70%">
 </p>
 ### Future Plans