Tag:

MOE

Generative AI
NVIDIA releases Nemotron 3 Super: a 120B parameter open-source hybrid Mamba-Attention MOE model that delivers 5x higher throughput for agent AI.

by ai-intensify March 11, 2026

March 11, 2026

The gap between the proprietary frontier model and the highly transparent open-source model is narrowing faster than ever. NVIDIA has officially pulled back the curtain nemotron 3 superA staggering 120 …

0 Facebook Twitter Pinterest Email
AI News
YuanLab AI Releases Yuan 3.0 Ultra: A Flagship Multimodal MOE Foundation Model, Built for Strong Intelligence and Unmatched Efficiency

by March 5, 2026

March 5, 2026

How can a trillion-parameter large language model achieve state-of-the-art enterprise performance while reducing its total parameter count by 33.3% and increasing pre-training efficiency by 49%? Yuan Lab AI releases Yuan3.0 …

0 Facebook Twitter Pinterest Email
Generative AI
Alibaba Qiwen Team Releases Qiwen3.5-397B MOE Model with 17B Active Parameters and 1M Token Reference for AI Agents

by February 16, 2026

February 16, 2026

Alibaba Cloud has just updated the open-source landscape. Today, the Quen team released QUEEN3.5The latest generation of their Large Language Model (LLM) family. is the most powerful version Qwen3.5-397B-A17B. This …

0 Facebook Twitter Pinterest Email
Machine Learning
NVIDIA Nemotron 3 Nano 30B MoE Model Now Available on Amazon SageMaker Jumpstart

by February 11, 2026

February 11, 2026

Today we are excited to announce that nvidia nemotron The 3Nano 30B model with 3B active parameters is now generally available in the Amazon SageMaker JumpStart Model Catalog. You can …

0 Facebook Twitter Pinterest Email
AI News
Zipu AI Releases GLM-4.7-Flash: A 30B-A3B MOE Model for Efficient Local Coding and Agents

by January 20, 2026

January 20, 2026

glm-4.7-flash is a new member of the GLM 4.7 family and targets developers who want robust coding and reasoning performance in practical models to run locally. Zhipu AI (Z.AI) describes …

0 Facebook Twitter Pinterest Email
AI Tools
NVIDIA AI Releases Nemotron 3: A Hybrid Mamba Transformer MOE Stack for Long Context Agent AI

by December 20, 2025

December 20, 2025

NVIDIA has released the Nemotron 3 family of open models as part of a full stack for agentic AI, including model weights, datasets, and reinforcement learning tools. The family has …

0 Facebook Twitter Pinterest Email
Generative AI
AI Interview Series #4: Transformers vs. Mixture of Experts (MOE)

by December 4, 2025

December 4, 2025

Question: MoE models have far more parameters than Transformers, yet they can run faster at inference time. how is that possible? Difference Between Transformers and …

0 Facebook Twitter Pinterest Email