Show HN: Persistent memory for Claude Code with self-hosted Qdrant and Ollama

I built an MCP server that gives Claude Code long-term memory across sessions, backed by infrastructure you control.

Every Claude Code session starts from zero, no memory of previous sessions. This server uses mem0ai as a library and exposes 11 MCP tools for storing, searching, and managing memories. Qdrant handles vector storage, Ollama runs embeddings locally (bge-m3), and Neo4j optionally builds a knowledge graph.

Some engineering details HN might find interesting:

- Zero-config auth: auto-reads Claude Code's OAT token from ~/.claude/.credentials.json, detects token type (OAT vs API key), and configures the SDK accordingly. No separate API key needed. - Graph LLM ops (3 calls per add_memory) can be routed to Ollama (free/local), Gemini 2.5 Flash Lite (near-free), or a split-model where Gemini handles entity extraction (85.4% accuracy) and Claude handles contradiction detection (100% accuracy).

Python, MIT licensed, one-command install via uvx.

https://github.com/elvismdev/mem0-mcp-selfhosted

Summary

The article describes a self-hosted, open-source version of the Mem0 platform, a decentralized social network that focuses on privacy and user control. It provides instructions for setting up and deploying the Mem0 platform on your own server, allowing users to host their own data and social networks.

Story

Show HN: Persistent memory for Claude Code with self-hosted Qdrant and Ollama