Blog – PyTorch
August 22, 2025ai_discoveryinfo
DRAMA Model Inference Efficiency Boosted by 1.7x-2.3x Blog DRAMA Model Inference Efficiency Boosted by 1.7x-2.3x TL;DR NJTs (Nested Jagged Tensors) boost DRAMA model inference efficiency by 1.7x-2.3x, making it more…Shreya GoyalAugust 22, 2025 ZenFlow: Stall-Free Offloading Engine for LLM Training Blog ZenFlow: Stall-Free Offloading Engine for LLM Training Introduction ZenFlow is a new extension to DeepSpeed introduced in summer 2025, designed as a…Tingfeng Lan, Yusen Wu, Bin Ma, Zhaoyuan Su, Ru
Read more →