AI — Page 15
Models, agents, infra, applied AI.
- ElevenLabs launches Speech Engine to turn chat agents into voice with one prompt
The new pipeline layers onto existing stacks, adds 70+ languages, enterprise compliance, and pricing from 8 cents per minute via ElevenAPI.
- OpenAI says internal model cracks Erdos unit distance problem
The general-purpose reasoning model produced a polynomial-improvement construction; external mathematicians checked the proof and published a companion paper.
- Sam Altman offers every YC startup $2M in OpenAI tokens for equity
At a YC event, Sam Altman said OpenAI would invest across the class with $2 million in API tokens per startup, trading usage for cap table.
- Nick Frosst initially said Cohere's Command A+ was out and Apache 2.0; later corrected us on X
Frosst's first X post called Command A+ Cohere's best model and described it as "out now" and "open source Apache 2.0." After publication, he corrected us on X; availability and licensing remain unconfirmed.
- This founder took revenue to zero after $9M raise to unwind services-led growth
In an X thread, Gushwork founder Nayrhit Bhattacharya said headline metrics masked a services-heavy business; he paused go-to-market for 4-5 months, automated the content pipeline, and relaunched at 90%+ gross margins.
- Parag Agrawal jumps into AI search as Exa Labs hauls in $250M
Parallel Web Systems, led by former Twitter CEO Parag Agrawal, joins a surge of AI search upstarts while Exa Labs raises big at a multibillion valuation, per TechCrunch.
- YC shock offer: $2M in OpenAI tokens for each batch company in exchange for equity, Bosmeny says
In a post on X, Bosmeny likened the blanket offer to Yuri Milner’s YC deal; no terms beyond equity-for-tokens were disclosed.
- Mistral AI acquires Emmi AI to build an industrial physics-AI stack
Johannes Brandstetter's Linz team joins Arthur Mensch's Mistral to push real-time sims and digital twins into aerospace, auto, and semiconductors.
- Hugging Face-led team open-sources Carbon, a fast DNA foundation model with 393k bp context
Three checkpoints (500M, 3B, 8B) ship under Apache 2.0 with open weights, code, and data; a 6-mer tokenizer and a new loss drive the team’s cited ~275x throughput vs Evo2 while the 3B matches Evo2-7B’s win rate, trained on ~1T tokens, and they say a single GPU can process a human genome in under two days.
- ElevenLabs teams with Einstein's estate on education-focused AI persona, per Staniszewski
In a post on X, Staniszewski said the move is in collaboration with Einstein's estate and frames AI agents as one-to-one teachers for students.
- Tim Rocktäschel co-founds Recursive to build self-improving AI
The UCL AI professor joins a heavyweight founding crew to turn compute into accumulated knowledge with open-ended, automated scientific discovery.
- Google’s Antigravity 2.0 lands as a desktop app with multi-agent teams and voice
Rebuilt release adds multi-agent teams, scheduled tasks, native voice and one-click integration, with a blog post and downloads live today.
- Google launches Gemini 3.5 Flash for agents and coding, now in AI Studio
Announced during Google I/O, the new model is positioned for long-horizon agent workflows and coding tasks and is live via the Gemini API in Google AI Studio.
- Cities push back on AI plate readers as Flock Safety deployments trigger political blowback
An X thread flagged a Washington Post report from Troy, NY, where an AI camera rollout spurred uproar and a state of emergency; the Austin rampage has reopened questions about whether ALPRs like Flock Safety would have changed events there.
- MIT Media Lab debuts Human Operator, an AI that briefly moves your muscles to teach tasks
The prototype uses Claude-driven vision-language planning and electrical muscle stimulation to guide wrist and finger movement for training and assistance.
- HeyGen demos Avatar V workflow that stitches scenes into continuous AI video
The team showed cinematic avatar shots linked into longer sequences, pointing at narrative workflows instead of isolated clips.
- Marin debuts Delphi to forecast 25B pretraining runs with 0.2 percent error
In an X thread, Will Held says small-run fits predicted a 25B-param, 600B-token run at ~1e23 FLOPs, matching Paloma macro loss within 0.2% error.
- PolicyLayer: prompts are not permissions for production agents
In a post on X, the company argues agent behavior should be controlled by enforceable policies rather than instructions buried in a prompt.
- Meta's AIRA paper points to agents co-designing model architectures
Meta's AIRA architecture discovery work signals that agents are starting to help design neural models.
- RLWRLD debuts RLDX-1, a dexterity-first robot hand foundation model
A short X video frames the bet: focus on robot hands and the failure points in everyday tasks like pouring coffee and grasping objects.
- UniPat AI team debuts SaaS-Bench as agents finish under 4% of real SaaS workflows
Built across 23 live SaaS apps and 106 long-horizon tasks, the open benchmark finds frontier agents stumble on planning, memory, cross-app context, and error recovery.
- Anthropic reportedly edges OpenAI in U.S. paid business accounts. Marketing, product, or OpenAI fatigue?
An X post cites 34.4% vs 32.3% U.S. enterprise share for Anthropic with no disclosed methodology. The gap is slim and could reflect marketing, product fit, or buyers seeking an OpenAI alternative, even as some developers say OpenAI's code tools beat Claude Code.
- Colossus pegs Cognition AI at $445M run rate on Devin; US Army, Goldman, Mercedes named as customers
A Colossus magazine Q&A and profile cites Devin at a $445M annualized run rate with usage doubling every eight weeks, and names the US Army, Goldman Sachs, and Mercedes-Benz as early customers.
- OpenAI launches Deployment Company, agrees to acquire Tomoro
Majority-owned unit will embed Forward Deployed Engineers; backed by 19 partners and starting with 150 specialists and $4 billion, per OpenAI and Greg Brockman.