Compression

Zlab Princeton researchers have released llm-pruning collectionA JAX based repository that consolidates the major pruning algorithms for large language models into a single, reproducible framework. This targets a concrete target, …

llm-pruning repository: a JAX based repo for structured and unstructured LLM compression

Apple Researchers Release CLaRa: A Continuous Latent Logic Framework for Compression-Native RAG with 16x–128x Semantic Document Compression