token-compression

Here are 23 public repositories matching this topic...

open-compress / claw-compactor

14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.

Updated Mar 20, 2026
Python

cokeshao / Awesome-Multimodal-Token-Compression

Star

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

awesome-list model-acceleration long-context mllm efficient-ai token-compression efficient-mllm

Updated Feb 22, 2026

xuyang-liu16 / Awesome-Token-level-Model-Compression

Star

📚 Collection of token-level model compression resources.

computer-vision model-compression model-acceleration efficient-deep-learning token-pruning token-merging token-compression

Updated Sep 3, 2025

HumanMLLM / LLaVA-Scissor

Star

The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

video-understanding connected-components video-language-understanding mllm multimodal-large-language-models token-compression

Updated Jul 1, 2025
Python

HelgeSverre / toon-php

Sponsor

Star

Token-Oriented Object Notation - A compact data format for reducing token consumption when sending structured data to LLMs (PHP implementation)

php serialization ai data-format toon llm token-compression

Updated Dec 6, 2025
PHP

HVision-NKU / GlimpsePrune

Star

Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"

inference-efficiency lvlms mllms visual-token-pruning token-compression

Updated Feb 13, 2026
Python

ilang-ai / autocode

Star

You say it. AutoCode builds it. 38 professional skills, persistent memory, 60%+ dev cost savings. Zero dependencies. Free forever.

developer-tools persistent-memory ai-agents claude prompt-engineering anthropic anthropic-claude ai-memory token-compression claude-code claude-code-plugin claude-code-skills anthropic-skills

Updated Mar 20, 2026
Shell

YiwengXie / FluxMem

Star

[CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding

streaming-video video-understanding large-multimodal-models token-compression

Updated Mar 16, 2026
Python

Fanziyang-v / FlashVID

Star

[ICLR 2026 Oral] FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging

efficiency multimodal video-llms token-compression flashvid

Updated Mar 16, 2026
Python

hanxunyu / VisionTrim

Star

[ICLR 2026] Official code repository for "⚡️VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration"

efficiency multimodal token-compression lightweight-vlm

Updated Feb 24, 2026
Shell

JinXins / MergeMix

Star

[ICLR 2026] MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

image-classification data-augmentation preference-learning mixup multimodal ranking-loss mmcv llava token-merging token-compression iclr2026

Updated Feb 27, 2026
Python

sangminwoo / awesome-token-redundancy-reduction

Star

😎 Awesome papers on token redundancy reduction

token-pruning token-reduction token-merging token-compression token-sparsification token-redundancy-reduction

Updated Mar 12, 2025

edgee-ai / edgee

Star

AI gateway with token compression for Claude Code, Codex, and more

cli cost-optimization coding-assistant agentic edgee llm-gateway token-compression context-optimization

Updated Mar 19, 2026
Rust

mvish7 / dycoke_token_compression

Star

This repo integrates DyCoke's token compression method with VLMs such as Gemma3 and InternVL3

inference-optimization vlms video-large-language-models token-compression

Updated Nov 11, 2025
Python

MouxiaoHuang / PPE

Star

[ICLR 2026] Official code of PPE: Positional Preservation Embedding for Token Compression in Multimodal Large Language Models.

multimodal positional-encoding large-language-models vision-language-model token-merging token-compression iclr2026 token-clustering

Updated Mar 16, 2026
Python

AP3008 / Janus

Star

Rust Local Token Compression Proxy for coding agents, built solo for GenAI Genesis 2026. 🏆 1st Google Sustainability Hack

rust redis local proxy-server tui tokio deduplication ratatui axum-framework token-compression semantic-caching

Updated Mar 16, 2026
Rust

pzrain / DiViCo

Star

Official implementation of TCSVT 2025 paper: DiViCo: Disentangled Visual Token Compression For Efficient Large Vision-Language Model

multimodal large-vision-language-model token-compression

Updated May 13, 2025
Python

lijun2005 / Arxiv25-HiPrune

Star

[Arxiv 2025 Preprint] HiPrune, a training-free visual token pruning method for VLM acceleration.

vision-language-model training-free-acceleration token-compression

Updated Nov 10, 2025
Jupyter Notebook

jhenderiks / carapace

Star

hardened docker container & compose for openclaw

docker mcp rtk ai-agent llm token-compression openclaw context-mode

Updated Mar 18, 2026
TypeScript

jee599 / contextzip

Star

⚡ Compress Claude Code context by 60-90%. Six noise filters RTK doesn't have.

rust cli ai developer-tools rtk claude llm context-window token-compression

Updated Mar 19, 2026
Rust

Improve this page

Add a description, image, and links to the token-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

token-compression

Here are 23 public repositories matching this topic...

open-compress / claw-compactor

cokeshao / Awesome-Multimodal-Token-Compression

xuyang-liu16 / Awesome-Token-level-Model-Compression

HumanMLLM / LLaVA-Scissor

HelgeSverre / toon-php

HVision-NKU / GlimpsePrune

ilang-ai / autocode

YiwengXie / FluxMem

Fanziyang-v / FlashVID

hanxunyu / VisionTrim

JinXins / MergeMix

sangminwoo / awesome-token-redundancy-reduction

edgee-ai / edgee

mvish7 / dycoke_token_compression

MouxiaoHuang / PPE

AP3008 / Janus

pzrain / DiViCo

lijun2005 / Arxiv25-HiPrune

jhenderiks / carapace

jee599 / contextzip

Improve this page

Add this topic to your repo