
Mac Studio Clusters Now Run Trillion-Parameter Models for $40K
macOS RDMA over Thunderbolt 5 has turned four Mac Studios into a 1.5TB unified memory cluster that runs Kimi K2 at 25 tokens per second - a setup that would cost $780K with NVIDIA H100s.

macOS RDMA over Thunderbolt 5 has turned four Mac Studios into a 1.5TB unified memory cluster that runs Kimi K2 at 25 tokens per second - a setup that would cost $780K with NVIDIA H100s.

Anthropic's claude.com/import-memory page walks users through a two-step process to transfer ChatGPT, Gemini, or any chatbot's stored memories into Claude - no data loss, no starting over.

Turkish AI company Codeway left Firebase and Google Cloud Storage wide open, exposing 300 million chat messages from 25 million users and 8.27 million photos and videos across two apps. Over 12 TB of user data leaked.

Chat & Ask AI, a popular chatbot wrapper app with 50 million users, left its Firebase database wide open - exposing 300 million messages including suicide discussions, drug recipes, and medical conversations to anyone who knew where to look.

Researchers claim to have extracted 53MB of TypeScript source maps from Persona's FedRAMP-authorized government endpoint, revealing the inner workings of the identity verification platform used by OpenAI and federal agencies.

Compare the best tools for running large language models locally: Ollama, LM Studio, llama.cpp, GPT4All, and LocalAI. Includes hardware requirements and model recommendations.