Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit | AI Center Türkiye