We present Micro-AGI, an architecture where multiple specialized LoRA adapters share a single frozen language model and communicate through tensor-level signals rather than natural language. A router network activates different cognitive specialists based on input characteristics. Across five iterations, routing-augmented responses scored up to +1.6 points higher than baseline. We identify the fundamental bottleneck: the LM head compresses rich internal representations into surface-level token probabilities, pointing toward vector-based communication as the necessary next step. Also available in French: Micro-AGI : Intelligence émergente à partir de réseaux de petits modèles de langage
Cros et al. (Sun,) studied this question.