MIRA Lab
MIRA Lab
People
Publications
News
Courses
Courses Fall 2025
Courses Fall 2024
Courses Fall 2023
Courses Fall 2022
Courses Fall 2021
Courses Spring 2021
Courses Spring 2020
Admission
Admission 2025 保研
Photos
Social Media
Contact Us
AttentionPredictor: Temporal Patterns Matter for KV Cache Compression
December 2025
Qingyue Yang
,
Jie Wang
,
Xing Li
,
Zhihai Wang
,
Chen Chen
,
Lei Chen
,
Xianzhi Yu
,
Wulong Liu
,
Jianye Hao
,
Mingxuan Yuan
,
Bin Li
Type
Conference paper
Publication
The Thirty-Ninth Annual Conference on Neural Information Processing Systems
Related
Towards Next-Generation Logic Synthesis: A Scalable Neural Circuit Generation Framework
Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis
A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design
Computing Circuits Optimization via Model-Based Circuit Genetic Evolution
Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms
Cite
×