page 9 of 13
new research
New Framework Tells You When to Trust AI Reasoning—With Math to Back It Up
A new method gives reasoning model outputs statistical uncertainty bounds—and explains exactly which training data caused them.
latest
research
ReSS Makes LLMs Explain Tabular Predictions — Without Hallucinating
research
CONCORD Lets Always-On AI Assistants Talk to Each Other Without Spilling Your Secrets
research
WebXSkill Gives Web Agents Reusable Skills That Actually Execute
research
Satellites Don't Know Their Own Rules. This AI Learns Them on the Fly.
research
Your LLM Is Chaotic by Design — And Now We Know Why
research
SciFi Framework Lets LLMs Run Scientific Workflows Solo — Safely
research
Even SOTA LLMs Fumble Basic Explore-Exploit Tradeoffs, Study Finds
research
The ML Community Is Building an AI for Materials Science Syllabus
anthropic
A Single Typo in CLAUDE.md Was Silently Destroying Code Quality
local-llm
llama.cpp b8808 Fixes a Media Marker Bug That Was Quietly Breaking Servers
news
Can a Text Layer Actually Keep AI Behavior Consistent? Probably Not Alone.
anthropic
AI Didn't Lower the Bar for Engineers. It Raised It.
research
Thesis Wants to Be the IDE for AI Agents Running Your ML Experiments
local-llm
llama.cpp b8807 Squeezes More Speed Out of Vulkan GPU Compute
local-llm
Reddit Asks: Why Not Use Mythos to Debug Claude Code?
openai
OpenAI Python SDK Gets Smarter WebSocket Handling in v2.32.0
openai
OpenAI Academy Launches Beginner ChatGPT Guide — Is This Overdue?
local-llm
llama.cpp Adds Q1_0 CUDA Backend — Extreme Quantization Gets GPU Acceleration
openai
OpenAI Skills Let You Package Your ChatGPT Workflows Once, Reuse Forever