← Back to News

Tags: Ollama

Ollama Adds Apple MLX Support, Boosts Mac Model Performance

Ollama Adds Apple MLX Support, Boosts Mac Model Performance
Ollama, a runtime for running large language models locally, announced preview support for Apple’s open‑source MLX framework and added Nvidia’s NVFP4 compression format. The update targets Apple Silicon Macs, requiring at least 32 GB of RAM, and currently supports Alibaba’s 35‑billion‑parameter Qwen 3.5 model. These changes aim to improve caching, memory efficiency, and overall speed, aligning with growing interest in running AI models on personal machines amid frustrations with cloud‑based rate limits and subscription costs. Read more

Hundreds of Ollama LLM Servers Exposed Online, Raising Cybersecurity Concerns

Hundreds of Ollama LLM Servers Exposed Online, Raising Cybersecurity Concerns
Cisco Talos identified more than 1,100 Ollama servers publicly reachable on the internet, many of which lack proper security controls. While roughly 80% of the servers are dormant, the remaining 20% host active language models that could be exploited for model extraction, jailbreaking, backdoor injection, and other attacks. The majority of exposed instances are located in the United States, followed by China and Germany, underscoring a widespread neglect of basic security practices such as access control and network isolation in AI deployments. Read more