47 1

Big Deeper

BigDeeper

AI & ML interests

Differentiable hashing, orthonormal polynomial language modeling, image compression into language representations.

Recent Activity

liked a model 27 days ago

deepseek-ai/DeepSeek-V3

new activity about 2 months ago

ByteDance/LatentSync:Very large RAM foot print.

new activity 3 months ago

brittlewis12/s1-32B-GGUF:THE q8_0 version appears to go on and on indefinitely.

View all activity

Organizations

None yet

BigDeeper's activity

liked a model 27 days ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 29 days ago • 789k • • 3.82k

New activity in ByteDance/LatentSync about 2 months ago

Very large RAM foot print.

#1 opened 3 months ago by

BigDeeper

New activity in brittlewis12/s1-32B-GGUF 3 months ago

THE q8_0 version appears to go on and on indefinitely.

#1 opened 3 months ago by

BigDeeper

New activity in ndkhanh95/LatentSync 3 months ago

Having a problem. Unable to find a suitable output format for 'video_out.mp4

#1 opened 3 months ago by

BigDeeper

New activity in chunyu-li/LatentSync 3 months ago

Any ideas how to mitigate this problem?

#3 opened 3 months ago by

BigDeeper

New activity in Lightricks/LTX-Video 5 months ago

Longer video?

#25 opened 5 months ago by

BigDeeper

What minimal VRAM does it require?

#18 opened 5 months ago by

DrNicefellow

New activity in Qwen/Qwen2.5-Coder-32B-Instruct 5 months ago

VSCODE + Cline + Ollama + Qwen2.5-Coder-32B-Instruct.Q8_0

#20 opened 5 months ago by

BigDeeper

New activity in black-forest-labs/FLUX.1-dev 9 months ago

comfyui does not recognize model files in sft format

#18 opened 9 months ago by

peidong

New activity in bigscience/bloomz-3b 9 months ago

Are there advantages or disadvantages in changing the format for translation?

#10 opened 9 months ago by

BigDeeper

New activity in QuantFactory/Meta-Llama-3-120B-Instruct-GGUF 12 months ago

What does 120B really mean?

#1 opened 12 months ago by

BigDeeper

New activity in meta-llama/Meta-Llama-3-70B 12 months ago

Does anyone know which specific Python library contains the tokenizer that was used to train Llama-3-70b?

#11 opened 12 months ago by

BigDeeper

15 TeraTokens = 190 Million books

#4 opened about 1 year ago by

Languido

New activity in meta-llama/Meta-Llama-3-8B 12 months ago

I was trying to fine-tune llama3 8b but getting following error - TypeError: LlamaForCausalLM.forward() got an unexpected keyword argument 'decoder_input_ids'

#117 opened 12 months ago by

aniiikket11

New activity in cognitivecomputations/dolphin-2.9-llama3-8b-gguf 12 months ago

Has anyone tried this gguf with agentic framework?

#6 opened 12 months ago by

BigDeeper

New activity in microsoft/Phi-3-mini-128k-instruct 12 months ago

gguf

#24 opened about 1 year ago by

LaferriereJC

New activity in pjh64/Phi-3-mini-128K-Instruct.gguf 12 months ago

How did you manage to produce gguf files, when llama.cpp/convert.py gives an error about the ROPE encoding?

#1 opened 12 months ago by

BigDeeper

New activity in PrunaAI/Phi-3-mini-128k-instruct-GGUF-Imatrix-smashed 12 months ago

Do they work with ollama? How was the conversion done for 128K, llama.cpp/convert.py complains about ROPE.

#2 opened about 1 year ago by

BigDeeper

New activity in PrunaAI/Phi-3-mini-128k-instruct-GGUF-Imatrix-smashed about 1 year ago

Do they work with ollama? How was the conversion done for 128K, llama.cpp/convert.py complains about ROPE.

#2 opened about 1 year ago by

BigDeeper

New activity in microsoft/Phi-3-mini-128k-instruct about 1 year ago

gguf

#24 opened about 1 year ago by

LaferriereJC