Kimi K2.7 Code is live on Baseten Model APIs.
Kimi K2.7 Code has 30% lower reasoning-token usage compared to K2.6, with improved instruction following and higher end-to-end coding task success rates.
Try it out: baseten.co/library/kimi-k…
My full review of @philipkiely's Inference Engineering.
TL;DR: I desperately wish I could ship this book back to myself in 2023. But you should read it today to avoid making the same mistakes I did!
Big news from Boltz - our biggest update yet! 🚀
Today we’re releasing two new state-of-the-art models for protein and small molecule design with extensive wet lab validation and a new API to run all of our models on scalable GPUs wherever you (or your agents) work! 🔥
GLM 5.2 is live on Baseten.
5.2 is built for agentic engineering: stronger coding, sharper agentic reasoning, and a long context window built to run hours-long tasks.
Test it today: baseten.co/library/glm-52/
AI advantage is shifting from raw model scale to deep integration with unique organizational data, workflows, and human expertise through continual learning loops. Ideally, this will result in distributed, defensible value rather than winner-take-all dynamics around general
The new AgentPerf benchmark by @ArtificialAnlys shows that @NVIDIAAI Blackwell delivers best performance for demanding agentic workloads. With NVIDIA, we're continuously investing in making your coding agents run fast, scale seamlessly, and cost less.
We're thrilled to be working with the Harvey team to push open models to frontier-level performance for legal AI.
Shout out to @gabepereyra for the great article. LAB was key to our joint work post-training open-weight models for legal agents.
Congrats to the MiniMax team on the open-source launch of M3!
There are very few <500bn parameter models that can tackle coding, agentic workloads, and multimodal all with a 1M-token context window but M3 does it all.
Dig in here: baseten.co/library/minima…