AI — Page 13

Models, agents, infra, applied AI.

X user @sang_yun_lee tees up 'sleep' for language models in new work
In a short X post, the user @sang_yun_lee hints at a periodic, recurrent cycle for LMs, framing the idea with: "Almost all animals sleep. Why don't LMs?"
XCENA raises $135M to make memory, not compute, the center of AI performance
CEO Jin Kim and a Samsung/SK hynix veteran team rebrand MetisX as XCENA and double down on CXL computational memory with MX1, at a $570M valuation.
AISlop ships CLI and GitHub Action to catch AI-generated code smells
AISlop is live on GitHub and the Marketplace as a CI quality gate with ai-slop/* rules, signaling a push to flag AI-written code in CI workflows.
AI drug discovery enters physical validation: Insilico, Recursion, FutureHouse and Google DeepMind credited with new milestones
Peer-reviewed data for AI-generated peptides, enzymes and CRISPR variants, plus AI-designed compounds entering human trials, push in silico models into preclinical reality.
jqwik maintainer hid a data-wiping prompt for AI agents in v1.10.0
Johannes Link added a concealed string telling vulnerable coding agents to delete jqwik tests and code, then updated docs to disclose it after users objected.
Liquid AI releases LFM2.5-8B-A1B, a device-optimized 8B MoE model for on-device agents
Boston startup says the 128K-context, 38T-token LFM2.5 upgrade delivers reliable agentic behavior, fast tool calling, and open weights for phones, laptops, PCs, robots, and lightweight servers.
CHIMERA paper accepted to ACL 2026 main conference, shared on X by Noy Sternlicht
A retweet from Peter Jansen amplified Noy Sternlicht's note that CHIMERA made ACL 2026's main conference; the arXiv preprint is live.
Sakana AI's DiffusionBlocks trains one block at a time, claiming 1/B memory with end-to-end parity
ICLR 2026 work recasts block-wise updates as reverse diffusion, reporting comparable results in vision, image generation, and language while storing activations for a single block.
Runway plugs its creative models into Claude, ChatGPT, Cursor, and Replit with MCP
The new connector makes image and video generation callable inside agent workflows and exposes models like Gen-4.5, Kling, and GPT image 2 right from chat.
Genesis AI launches Genesis World 1.0, an open-source robotics sim
Open-source and billed as the second piece of its full-stack suite, Genesis World 1.0 targets the 1x real-world speed bottleneck with a technical blog laying out the thesis.
PrismML open-sources compact Bonsai Image 4B and launches Bonsai Studio for iPhone
Open-source 4B image models fit in 0.93GB and 1.21GB, and Bonsai Studio brings on-device generation to iPhone under Apache 2.0.
Prava launches Prava Pay to let AI agents pay with one-time cards
In a thread on X, Sushant Pandey announced Prava Pay, which the company says gives AI agents scoped, single-use Visa cards with passkey approvals so users can let bots buy things safely.
Finn Mallery launches SEND, a one-prompt, multi-channel outbound tool
Mallery introduced SEND, offered a free month to commenters, and took aim at clunky dashboards and AI SDRs.
Human Archive taps India’s gig economy to feed physical AI
Founded by Berkeley and Stanford researchers, the data lab pays service workers to wear camera caps and sensors and says it already spans 100k+ contributors and 500 partners.
OpenClaw momentum builds around a local, open agent as a Google 'Spark' rumor circulates
Aligned News cited a 300,000-star moment and a Google 'Spark' entrant; while unverified, the buzz spotlights OpenClaw's local, open, self-hosted agent thesis.
Zeb Evans cuts 22% at ClickUp and bets on 3,000 AI agents to build a 100x org
The ClickUp CEO says savings will fund million-dollar salary bands for AI-leveraged top performers, even as Gartner warns automation cuts do not guarantee returns.
AlphaProof Nexus teaser hints at agentic math push, but the builders stay unnamed
A brief X post teased an agentic framework for research-level math, but shared no docs or team names identifying what AlphaProof Nexus is.
Pushmeet Kohli shares Google DeepMind's AlphaProof Nexus results: agentic proof search in Lean
VP of Research Pushmeet Kohli points to a GitHub trove of Lean-formalized proofs and prose by AlphaProof Nexus, signaling progress while holding back the framework code.
Anthropic's Claude Code Auto Mode rolls out to Pro and adds Sonnet 4.6, per community post
A widely reshared ClaudeDevs note, surfaced by Aligned News in a post on X, says Auto Mode now runs on Pro with Sonnet 4.6 and Opus 4.7.
Aligned News flags new paper on evaluation awareness in frontier LLMs
Haritz Puerto says a paper on decomposing and measuring evaluation awareness just dropped, plus a resource called EvalAwa..., but links and authorship were not shared.
Changling Li leads EvalAwareBench to measure when LLMs know they are being tested
In a new paper and open releases, Li and collaborators decompose evaluation awareness, test nine models across four benchmarks, and publish a factor-controlled dataset and code.
Hugging Face leader says Gemma 4 tops 120M downloads in weeks, counting Hugging Face and Ollama only
The tally counts only Hugging Face and Ollama pulls, hinting at on-device demand but leaving methodology and release timing unclear.
Asimov plans Palo Alto, SF, and Austin meetups to talk humanoids and AI
Coffee meetups land June 2, 4, and 6, with RSVPs running through Luma as Asimov convenes folks interested in humanoids and AI.
Freu AI launches Mac agent that compiles your cross-app workflow once, then runs it locally with zero recurring token cost
Demos show a Mac agent that records your cross-app workflow once, compiles it into a deterministic DSL, and replays it locally with zero recurring tokens. With freu-cli open-sourced and a local vision execution model coming, can ahead-of-time semantic compilation beat screenshot agents on cost and latency?