ijohn free life's picture

ijohn free life

ijohn07

·

john_whickins

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

RWKV-Red-Team/RWKV-LatestSpace

liked a model 1 day ago

THUDM/GLM-Z1-9B-0414

liked a model 3 days ago

google/gemma-3-4b-it-qat-q4_0-gguf

View all activity

Organizations

ijohn07's activity

upvoted an article 5 days ago

Article

Cohere on Hugging Face Inference Providers 🔥

6 days ago

• 87

upvoted a collection 6 days ago

InternVL3

34 items • Updated 1 day ago • 54

upvoted 3 collections 7 days ago

Kimina Prover Preview

State-of-the-Art Models for Formal Mathematical Reasoning • 4 items • Updated 7 days ago • 26

Skywork-OR1

Skywork Open Reasoner 1 • 8 items • Updated 8 days ago • 21

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 7 days ago • 104

upvoted a collection 10 days ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 9 days ago • 61

upvoted 2 collections 13 days ago

Cogito v1 Preview

5 items • Updated 14 days ago • 101

HiDream-I1

A collections of HiDream-I1 models. • 4 items • Updated 14 days ago • 26

upvoted a collection 16 days ago

Llama 4

Llama 4 release • 10 items • Updated 16 days ago • 439

upvoted a collection 18 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 3 days ago • 152

upvoted a paper 26 days ago

HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs

Paper • 2503.02003 • Published Mar 3 • 47

upvoted a paper 28 days ago

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

Paper • 2407.03168 • Published Jul 3, 2024 • 3

upvoted 8 collections about 1 month ago

Qwen 2.5 Coder Llamafiles (<50B)

Llamafiles for the smaller Qwen 2.5 Coder models • 6 items • Updated Feb 25 • 1

Qwen 2.5 Llamafiles (<50B)

Llamafiles for the smaller Qwen 2.5 text only models • 6 items • Updated Feb 25 • 1

Deepseek Distilled Llamafiles (<50B)

Llamafiles for the smaller Deepseek Distilled Models • 5 items • Updated Feb 25 • 2

DeepHermes

Preview models of hybrid reasoner Hermes series • 6 items • Updated Mar 13 • 27

Gemma 3

4 items • Updated Mar 12 • 15

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated about 11 hours ago • 219

Gemma 3

All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 11 hours ago • 53

Gemma 3 Release

24 items • Updated 3 days ago • 340