GaLore Memory-Efficient LLM Training by Gradient Low-Rank Projection (CAT & Meta & UTA & CMU 2024)
17 мар 2024