News

Chinese AI startup DeepSeek on January 20 launched two large-language models (LLMs): DeepSeek-R1-Zero and DeepSeek-R1-Distill ...
Compared to DeepSeek R1, Llama-3.1-Nemotron-Ultra-253B shows competitive results despite having less than half the parameters.
The United States risks losing the so-called "AI Cold War" against China unless it abandons traditional containment ...
The initial model lineup includes five base sizes: 3 billion, 8 billion, 14 billion, 32 billion, and 70 billion parameters.
Google released API pricing for Gemini 2.5 Pro, an AI reasoning model with industry-leading performance on several benchmarks ...
When I wrote about DeepSeek’s remarkable AI breakthrough in January, I didn’t expect to see my predictions validated so ...
Meta has debuted the first two models in its Llama 4 family, its first to use mixture of experts tech. A Saturday post from ...
A new company, Deep Cogito, has emerged from stealth with a family of openly available AI models that can be switched between ...
Bhavish Aggarwal-led AI unicorn Krutrim has said that it has started hosting Meta’s Llama 4 models on its cloud platform.
Meta for its part wasn't hiding the fact this was an experimental build. In its launch blog post, the Instagram parent wrote ...