HuggingFace Safetensors Support in PyTorch Distributed Checkpointing Blog HuggingFace Safetensors Support in PyTorch Distributed Checkpointing Summary PyTorch Distributed Checkpointing (DCP) is making investments into addressing the interoperability blockers to ensure…Ankita George, Saurabh Mishra, Joe Cummings, Philip Bontrager, Daulet Askarov, Teja Rao, Chien-Chin Huang, Ela Krepska, Jafar TaghiyarJune 6, 2025
Introducing the PyTorch Ecosystem Working Group and Project Spotlights Blog Introducing the PyTorch Ecosystem Working Group and Project Spotlights The PyTorch Ecosystem goes back several years, with some of its earliest projects like Hugging…PyTorch Ecosystem Working GroupJune 5, 2025
Open Source AI is Transforming the Economy—Here’s What the Data Shows Blog Open Source AI is Transforming the Economy—Here’s What the Data Shows Blog cross-posted on the Linux Foundation blog. As we approach the midpoint of 2025, the…Frank Nagle, Assistant Professor in the Strategy Unit at Harvard Business School and Advising Chief Economist at the Linux FoundationJune 4, 2025
Build Responsible AI Products with your own Yellow Teaming LLM Blog Build Responsible AI Products with your own Yellow Teaming LLM The tools we use to build AI are evolving fast, with PyTorch at the heart…Zach Lasiuk, Principal Solutions Designer, ArmJune 4, 2025
PyTorch Hangzhou Meetup Recap: Exploring the AI Open Source Ecosystem and Cutting-Edge Technology Practices Blog PyTorch Hangzhou Meetup Recap: Exploring the AI Open Source Ecosystem and Cutting-Edge Technology Practices On May 17, the PyTorch Meetup was successfully held in Hangzhou, drawing nearly 60 developers…PyTorch FoundationMay 27, 2025
Accelerating GPU Performance with Triton: April 30th PyTorch ATX Event Community Accelerating GPU Performance with Triton: April 30th PyTorch ATX Event The PyTorch ATX Triton event, sponsored by Red Hat, was held on April 30, 2025,…Jason Meaux, ATX PyTorch Leader Stephen Watt, VP and Distinguished Engineer, Red HatMay 20, 2025
PyTorch/XLA 2.7 Release Usability, vLLM boosts, JAX bridge, GPU Build Blog PyTorch/XLA 2.7 Release Usability, vLLM boosts, JAX bridge, GPU Build PyTorch/XLA is a Python package that uses the XLA deep learning compiler to enable PyTorch…Pei Zhang, Chris JonesMay 13, 2025
MetaShuffling: Accelerating Llama 4 MoE Inference Blog MetaShuffling: Accelerating Llama 4 MoE Inference Mixture-of-Experts (MoE) is a popular model architecture for large language models (LLMs). Although it reduces…Shikai Li, Gefei Zuo, Jianyu Huang, Jason Park, Zoey Sun, Xiaozhu Meng, Xiaodong Wang, Hongtao Yu, Changkyu Kim, CQ Tang, Stephen ChenMay 12, 2025
PyTorch: The Open Language of AI Blog PyTorch: The Open Language of AI Key takeaways: PyTorch today powers the generative AI world with major AI players like Meta,…Joe Spisak (Meta), Luca Antiga (Lightning.AI)May 7, 2025
Recap of the PyTorch Korea User Group Meetup: A Technical Conference with a PyTorch Core Maintainer Blog Recap of the PyTorch Korea User Group Meetup: A Technical Conference with a PyTorch Core Maintainer At the end of March, the PyTorch Korea User Group hosted a special meetup that…Jiho Kim, PyTorch Korea User GroupMay 5, 2025
FlexAttention Part II: FlexAttention for Inference Blog FlexAttention Part II: FlexAttention for Inference Overview In PyTorch 2.5.0 release, we introduced FlexAttention torch.nn.attention.flex_attention for ML researchers who’d like to…Joy Dong, Boyuan Feng, Driss Guessous, Joel Schlosser, Yanbo Liang, Horace HeApril 30, 2025
6x faster Async Checkpointing in PyTorch, using Cached Plans, no GIL contention Blog 6x faster Async Checkpointing in PyTorch, using Cached Plans, no GIL contention Meta: Less Wright, Meet Vadakkanchery, Saurabh Mishra, Ela Krepska, Hamid Shojanazeri, Pradeep Fernando Crusoe: Ethan…Meta and CrusoeApril 30, 2025
Accelerating Large Scale Training and Convergence with PyTorch Float8 Rowwise on Crusoe 2K H200s Blog Accelerating Large Scale Training and Convergence with PyTorch Float8 Rowwise on Crusoe 2K H200s Meta: Less Wright, Hamid Shojanazeri, Vasiliy Kuznetsov, Daniel Vega-Myhre, Gokul Nadathur, Will Constable, Tianyu Liu,…Meta and CrusoeApril 28, 2025
Accelerate PyTorch 2.7 on Intel® GPUs Blog Accelerate PyTorch 2.7 on Intel® GPUs PyTorch 2.7 continues to deliver significant functionality and performance enhancements on Intel® GPU architectures to streamline…Intel PyTorch TeamApril 25, 2025
PyTorch 2.7 Release Blog PyTorch 2.7 Release We are excited to announce the release of PyTorch® 2.7 (release notes)! This release features:…PyTorch TeamApril 23, 2025
Accelerating Whisper on Arm with PyTorch and Hugging Face Transformers Blog Accelerating Whisper on Arm with PyTorch and Hugging Face Transformers Automatic speech recognition (ASR) has revolutionized how we interact with technology, clearing the way for…Pareena Verma, ArmApril 8, 2025
SGLang Joins PyTorch Ecosystem: Efficient LLM Serving Engine Community SGLang Joins PyTorch Ecosystem: Efficient LLM Serving Engine We’re thrilled to announce that the SGLang project has been integrated into the PyTorch ecosystem!…SGLang TeamMarch 19, 2025
PyTorch Day China 2025 Call for Proposals Open Blog PyTorch Day China 2025 Call for Proposals Open We’re excited to announce the first-ever PyTorch Day China! This new event, hosted by the PyTorch Foundation,…PyTorch FoundationMarch 19, 2025
PyTorch at GTC 2025 Community PyTorch at GTC 2025 GTC is coming back to San Jose on March 17–21, 2025. Join PyTorch Foundation members Arm,…PyTorch FoundationMarch 16, 2025
Scaling Recommendation Systems Training to Thousands of GPUs with 2D Sparse Parallelism Blog Scaling Recommendation Systems Training to Thousands of GPUs with 2D Sparse Parallelism At Meta, recommendation systems are the cornerstone of delivering relevant and personalized ads to billions…PyTorch Team at Meta: Chunzhi Yang, Rich Zhu, Zain Huda, Liangbei Xu, Xin Zhang, Jiyan Yang, Dennis van der Staay, Wang Zhou, Jin Fang, Jade Nie, Yuxi HuMarch 11, 2025