Latest open artifacts (#20): New orgs! New types of models! With Nemotron Super, Sarvam, Cohere Transcribe, & others
NVIDIA's Nemotron-3-Super-120B arrives with 120B params (12B active), 1M context window, and LatentMoE architecture—alongside fresh releases from Sarvam, Cohere, and others spanning OCR, transcription, code-editing, and math theorem proving. A month of niche, application-focused models rather than headline-grabbing frontier models.
Interconnects · 5 min read
Repos