CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published 4 days ago • 85
FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents Paper • 2504.13128 • Published 4 days ago • 5
Unmasking Deepfakes: Masked Autoencoding Spatiotemporal Transformers for Enhanced Video Forgery Detection Paper • 2306.06881 • Published Jun 12, 2023 • 2
Cell2Sentence Models Collection Cell2Sentence models trained for single-cell tasks • 5 items • Updated 5 days ago • 6
Multimodal DSE Retrievers Collection A collection of DSE models for multimodal retrieval • 5 items • Updated 6 days ago • 13
Kimina Prover Preview Collection State-of-the-Art Models for Formal Mathematical Reasoning • 4 items • Updated 7 days ago • 26
Advancing Medical Representation Learning Through High-Quality Data Paper • 2503.14377 • Published Mar 18 • 2
EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records Paper • 2405.14567 • Published May 23, 2024 • 2
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 11 days ago • 76
CoRNStack Collection State-of-the-art code retrieval and re-ranking models and datasets • 9 items • Updated 26 days ago • 17
DeTikZify Collection Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ • 12 items • Updated Mar 19 • 24
💫StarVector Models Collection StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 93
Llama Nemotron Collection Open, Production-ready Enterprise Models • 4 items • Updated 7 days ago • 36