Visual Document Retrieval
Transformers
Safetensors
ColPali
English
pretraining
tonywu71 commited on
Commit
7d3c8ab
Β·
verified Β·
1 Parent(s): 0519273

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -88,6 +88,13 @@ with torch.no_grad():
88
  scores = processor.score_retrieval(query_embeddings.embeddings, image_embeddings.embeddings)
89
  ```
90
 
 
 
 
 
 
 
 
91
  ## Limitations
92
 
93
  - **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.
 
88
  scores = processor.score_retrieval(query_embeddings.embeddings, image_embeddings.embeddings)
89
  ```
90
 
91
+ ## Resources
92
+
93
+ - The *ColPali* arXiv paper can be found [here](https://doi.org/10.48550/arXiv.2407.01449). πŸ“„
94
+ - The official blog post detailing ColPali can be found [here](https://huggingface.co/blog/manu/colpali). πŸ“
95
+ - The original model implementation code for the ColPali model and for the `colpali-engine` package can be found [here](https://github.com/illuin-tech/colpali). 🌎
96
+ - Cookbooks for learning to use the transformers-native version of *ColPali*, fine-tuning, and similarity maps generation can be found [here](https://github.com/tonywu71/colpali-cookbooks). πŸ“š
97
+
98
  ## Limitations
99
 
100
  - **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.