Publications
2024
2023
- Hyena Hierarchy: Towards Larger Convolutional Language ModelsIn International Conference on Machine Learning (ICML) , 2023
- Simple Hardware-Efficient Long Convolutions for Sequence ModelingIn International Conference on Machine Learning (ICML) , 2023
- Hungry Hungry Hippos: Towards Language Modeling with State Space ModelsIn The International Conference on Learning Representations (ICLR) , 2023
2022
- Decentralized Training of Foundation Models in Heterogeneous EnvironmentsIn Advances in Neural Information Processing Systems , 2022
- Fine-tuning Language Models over Slow Networks using Activation Compression with GuaranteesIn Advances in Neural Information Processing Systems , 2022
- Transform Once: Efficient Operator Learning in Frequency DomainIn Advances in Neural Information Processing Systems , 2022
- S4ND: Modeling Images and Videos as Multidimensional Signals with State SpacesIn Advances in Neural Information Processing Systems , 2022
- ButterflyFlow: Building Invertible Layers with Butterfly MatricesIn International Conference on Machine Learning (ICML) , 2022
- Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network ModelsIn International Conference on Learning Representations (ICLR) , 2022
2021
- Scatterbrain: Unifying Sparse and Low-rank AttentionIn Advances in Neural Information Processing Systems (NeurIPS) , 2021
- Combining Recurrent, Convolutional, and Continuous-time Models with Linear State Space LayersAdvances in Neural Information Processing Systems, 2021
- Rethinking Neural Operations for Diverse TasksIn Advances in Neural Information Processing Systems , 2021
- Catformer: Designing Stable Transformers via Sensitivity AnalysisIn International Conference on Machine Learning (ICML) , 2021
- Knowledge Distillation as Semiparametric InferenceIn International Conference on Learning Representations (ICLR) , 2021
- MONGOOSE: A Learnable LSH Framework for Efficient Neural Network TrainingIn International Conference on Learning Representations (ICLR) , 2021
2020
- Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear MapsIn International Conference on Learning Representations (ICLR) , 2020
- HiPPO: Recurrent Memory with Optimal Polynomial ProjectionsIn Advances in neural information processing systems (NeurIPS) , 2020
- Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear MapsIn The International Conference on Learning Representations (ICLR) , 2020
2019
- On the downstream performance of compressed word embeddingsIn Advances in Neural Information Processing Systems (NeurIPS) 32 , 2019
- Approximating the Permanent by Sampling from Adaptive PartitionsIn Advances in Neural Information Processing Systems (NeurIPS) 32 , 2019
- Adaptive Hashing for Model CountingIn Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence (UAI) , 2019
- Learning Fast Algorithms for Linear Transforms Using Butterfly FactorizationsIn The International Conference on Machine Learning (ICML) 36 , 2019
- A Kernel Theory of Modern Data AugmentationIn The International Conference on Machine Learning (ICML) 36 , 2019
- Low-Precision Random Fourier Features for Memory-Constrained Kernel ApproximationIn The International Conference on Artificial Intelligence and Statistics (AISTATS) 22 , 2019
2018
- Learning Compressed Transforms with Low Displacement RankIn Advances in Neural Information Processing Systems (NeurIPS) 31 , 2018
2017
- Gaussian Quadrature for Kernel FeaturesIn Advances in Neural Information Processing Systems (NeurIPS) 30 , 2017