Title Institute Date Video Slides
Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference ICML 2024 25 Jul 2024 Link Link
  McGill NLP 22 Apr 2024    
  ServiceNow 4 Apr 2024    
  University of Edinburgh 4 Apr 2024    
Efficient Transformers with Dynamic Token Pooling ACL 2023 11 Jul 2023 Link Link
  University of Cambridge (LTL) 2 Mar 2023    
  University of Edinburgh 7 Nov 2022    
Hierarchical Transformers Are More Efficient Language Models Harvard CMSA 8 Dec 2021 Link Link
  DeepMind 12 Nov 2021    
  ML in PL 2021 Conference 5 Nov 2021