Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
47
1
Big Deeper
BigDeeper
Follow
21world's profile picture
1 follower
·
0 following
AI & ML interests
Differentiable hashing, orthonormal polynomial language modeling, image compression into language representations.
Recent Activity
liked
a model
27 days ago
deepseek-ai/DeepSeek-V3
new
activity
about 2 months ago
ByteDance/LatentSync:
Very large RAM foot print.
new
activity
3 months ago
brittlewis12/s1-32B-GGUF:
THE q8_0 version appears to go on and on indefinitely.
View all activity
Organizations
None yet
BigDeeper
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
27 days ago
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
29 days ago
•
789k
•
•
3.82k
New activity in
ByteDance/LatentSync
about 2 months ago
Very large RAM foot print.
4
#1 opened 3 months ago by
BigDeeper
New activity in
brittlewis12/s1-32B-GGUF
3 months ago
THE q8_0 version appears to go on and on indefinitely.
6
#1 opened 3 months ago by
BigDeeper
New activity in
ndkhanh95/LatentSync
3 months ago
Having a problem. Unable to find a suitable output format for 'video_out.mp4
#1 opened 3 months ago by
BigDeeper
New activity in
chunyu-li/LatentSync
3 months ago
Any ideas how to mitigate this problem?
#3 opened 3 months ago by
BigDeeper
New activity in
Lightricks/LTX-Video
5 months ago
Longer video?
6
#25 opened 5 months ago by
BigDeeper
What minimal VRAM does it require?
12
#18 opened 5 months ago by
DrNicefellow
New activity in
Qwen/Qwen2.5-Coder-32B-Instruct
5 months ago
VSCODE + Cline + Ollama + Qwen2.5-Coder-32B-Instruct.Q8_0
3
#20 opened 5 months ago by
BigDeeper
New activity in
black-forest-labs/FLUX.1-dev
9 months ago
comfyui does not recognize model files in sft format
4
5
#18 opened 9 months ago by
peidong
New activity in
bigscience/bloomz-3b
9 months ago
Are there advantages or disadvantages in changing the format for translation?
3
#10 opened 9 months ago by
BigDeeper
New activity in
QuantFactory/Meta-Llama-3-120B-Instruct-GGUF
12 months ago
What does 120B really mean?
3
#1 opened 12 months ago by
BigDeeper
New activity in
meta-llama/Meta-Llama-3-70B
12 months ago
Does anyone know which specific Python library contains the tokenizer that was used to train Llama-3-70b?
1
2
#11 opened 12 months ago by
BigDeeper
15 TeraTokens = 190 Million books
2
#4 opened about 1 year ago by
Languido
New activity in
meta-llama/Meta-Llama-3-8B
12 months ago
I was trying to fine-tune llama3 8b but getting following error - TypeError: LlamaForCausalLM.forward() got an unexpected keyword argument 'decoder_input_ids'
4
#117 opened 12 months ago by
aniiikket11
New activity in
cognitivecomputations/dolphin-2.9-llama3-8b-gguf
12 months ago
Has anyone tried this gguf with agentic framework?
3
#6 opened 12 months ago by
BigDeeper
New activity in
microsoft/Phi-3-mini-128k-instruct
12 months ago
gguf
30
#24 opened about 1 year ago by
LaferriereJC
New activity in
pjh64/Phi-3-mini-128K-Instruct.gguf
12 months ago
How did you manage to produce gguf files, when llama.cpp/convert.py gives an error about the ROPE encoding?
4
#1 opened 12 months ago by
BigDeeper
New activity in
PrunaAI/Phi-3-mini-128k-instruct-GGUF-Imatrix-smashed
12 months ago
Do they work with ollama? How was the conversion done for 128K, llama.cpp/convert.py complains about ROPE.
8
#2 opened about 1 year ago by
BigDeeper
New activity in
PrunaAI/Phi-3-mini-128k-instruct-GGUF-Imatrix-smashed
about 1 year ago
Do they work with ollama? How was the conversion done for 128K, llama.cpp/convert.py complains about ROPE.
8
#2 opened about 1 year ago by
BigDeeper
New activity in
microsoft/Phi-3-mini-128k-instruct
about 1 year ago
gguf
30
#24 opened about 1 year ago by
LaferriereJC
Load more