chenjoya
/

LiveCC-7B-Instruct

Model card Files Files and versions Community

chenjoya commited on 11 days ago

Commit

6e0d506

·

verified ·

1 Parent(s): 5a0c883

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -206,7 +206,7 @@ for t in range(31):
 - This model is finetuned on LiveCC-7B-Base, which is starting from Qwen2-VL-7B-Base, so it may have limitations mentioned in https://huggingface.co/Qwen/Qwen2-VL-7B.
 - When performing real-time video commentary, it may appear collapse --- e.g., repeat pattern. If you encounter this situation, try to adjust repetition_penalty, streaming_eos_base_threshold, and streaming_eos_threshold_step.
-- This model only has a context window of 32768. Using more visual tokens per frame (e.g. 768 * 28 * 28) will have the best performance, but will shorten the working duration.
 These limitations serve as ongoing directions for model optimization and improvement, and we are committed to continually enhancing the model's performance and scope of application.

 - This model is finetuned on LiveCC-7B-Base, which is starting from Qwen2-VL-7B-Base, so it may have limitations mentioned in https://huggingface.co/Qwen/Qwen2-VL-7B.
 - When performing real-time video commentary, it may appear collapse --- e.g., repeat pattern. If you encounter this situation, try to adjust repetition_penalty, streaming_eos_base_threshold, and streaming_eos_threshold_step.
+- This model only has a context window of 32768. Using more visual tokens per frame (e.g. 768 * 28 * 28) will have better performance, but will shorten the working duration.
 These limitations serve as ongoing directions for model optimization and improvement, and we are committed to continually enhancing the model's performance and scope of application.