Latest AI News
We continue our series about alternatives to transformers. In the AI of the week, we dive into Anthropic’s groundbreaking paper about natural language autoencoders.
There is this thing that happens in ML research where a line of work gets quietly good for years, and then one day you wake up and it’s suddenly competing with the dominant paradigm. State space models are having that moment right now.
By complete coincidence, the day we released Neil Zeghidour (CEO of Gradium, the for profit spinoff of the vaunted Kyutai Moshi )’s talk on what remains to be built for realtime voice, Thinking Machines emerged for only the third time in a ~year (despite much drama) to drop Interaction Models: A Scalable Approach to Human-AI Collaboration ,…
The proximal cause of today’s op-ed is OpenAI’s deprecation of their finetuning APIs. For years, OpenAI stood out among the big labs for their finetuning support, and many many many talks and content pieces and AI engineers promoted how you can get some variant of “get o1 performance at 4o prices” and insisting that it was an important part of the toolkit.
The artificial intelligence coding revolution comes with a catch: it's expensive. Claude Code , Anthropic's terminal-based AI agent that can write, debug, and deploy code autonomously, has captured the imagination of software developers worldwide.
Railway , a San Francisco-based cloud platform that has quietly amassed two million developers without spending a dollar on marketing, announced Thursday that it raised $100 million in a Series B funding round, as surging demand for artificial intelligence applications exposes the limitations of legacy cloud infrastructure. TQ Ventures led the round, with…
As noted earlier, Mozilla’s characterization of AI-assisted vulnerability discovery as a game changer has been met with massive, vocal skepticism in many quarters. Critics initially scoffed when Mozilla didn’t obtain CVE designations for any of the 271 vulnerabilities.
Apple’s next iOS update could include something phone photographers have been waiting for: a lot more control over the Camera app. According to Bloomberg ’s Mark Gurman, the Camera app will be “fully customizable” in iOS 27 and users will be able to “pick their own set of controls — called widgets — that run along the top of the interface.” Gurman notes…
After two weeks of hearing from assorted witnesses that he was a lying snake, the jury finally heard from the lying snake himself: Sam Altman. At the end of the testimony, his lawyer William Savitt asked him how it felt to be accused of stealing a charity.
Sony’s Xperia 1 flagships have looked more or less the same since 2020 , but that’s finally changing with the Xperia 1 VIII, which moves to a chunky square camera island. The phone also boasts what should be a substantially improved telephoto camera, along with an AI camera assistant that looks like an improved version of Google’s Camera Coach.
OpenAI CEO Sam Altman finally took the stand this morning to defend himself against his former co-founder Elon Musk’s lawsuit challenging OpenAI’s corporate structure. Altman was immediately asked what he thought of Musk’s allegation that OpenAI’s other founders “stole a charity” when they launched a for-profit subsidiary to market products based on the…
Neil Batlivala has spent seven years building a healthcare company that most of the tech industry has never heard of and that serves a patient population most of Silicon Valley ignores. But last month, that work put him at the center of something much bigger.
EMO: Pretraining mixture of experts for emergent modularity 🧠 Models: https://huggingface.co/collections/allenai/emo | 📄 Tech report: https://allenai.org/papers/emo | 💻 Code: https://github.com/allenai/EMO | 📊 Visualization: https://emovisualization.netlify.app/ Today we're releasing EMO , a new mixture-of-experts (MoE) model pretrained end-to-end so that…
Building Blocks for Foundation Model Training and Inference on AWS For a long time, "scaling" in foundation models mostly meant one thing: spend more compute on pre-training and capabilities rise. That intuition was supported by empirical work such as Kaplan et al.