DeepSeek AI released DeepSeek-OCR 2, an open source document OCR and understanding system that reorganizes its vision encoder to read pages in a causal order that is closer to the …
Releases
-
-
AI News
Ant Group releases Lingbot-VLA, a Vision Language Action Foundation model for real-world robot manipulation
How do you create a single vision language action model that can control many different dual-handed robots in the real world? Lingbot-VLA is Ant Group Robiont’s new Vision Language Action …
-
AI Tools
MBZUAI Releases K2 Think V2: A Fully Sovereign 70B Reasoning Model for Math, Code, and Science
Can a fully sovereign open logic model match state-of-the-art systems when every part of its training pipeline is transparent? Researchers at Mohammed Bin Zayed University of Artificial Intelligence (MBZUAI) have …
-
Generative AI
Moonshot AI Releases KM K2.5: An Open Source Visual Agent Intelligence Model with Native Swarm Execution
Moonshot AI has released Kimi K2.5 as an open source visual agentic intelligence model. It combines a great mix of an expert language backbone, a native vision encoder, and a …
-
Open source AI model provider Allen Institute for AI on Tuesday launched a new family of open coding agents that enable enterprise developer teams to train small, open models on …
-
GitHub has open sourced the internal agent runtime that powers the GitHub Copilot CLI and exposes it as a programmable SDK. GitHub copilot-sdkNow in Technical Preview, lets you embed the …
-
Generative AI
Microsoft releases VibeVoice-ASR: a unified speech-to-text model designed to handle up to 60 minutes of long audio in a single pass
Microsoft has released VibeVoice-ASR as part of the VibeVoice family of open source Frontier voice AI models. VibeVoice-ASR is described as an integrated speech-to-text model that can handle up to …
-
Inworld AI has introduced Inworld TTS-1.5, an upgrade to its TTS-1 family, which targets realtime voice agents with tighter constraints on latency, quality, and cost. TTS-1.5 is described as the …
-
Generative AI
Liquid AI releases LFM2.5-1.2b-thinking: a 1.2b parameter reasoning model that fits under 1GB on-device
Liquid AI has released LFM2.5-1.2b-Thinking, a 1.2 billion parameter reasoning model that runs entirely on device and fits in about 900 MB on a modern phone. What was required in …
-
glm-4.7-flash is a new member of the GLM 4.7 family and targets developers who want robust coding and reasoning performance in practical models to run locally. Zhipu AI (Z.AI) describes …
