update publication_list.md

chensuyue · chensuyue · commit c2fb2d3b0e74 · 2025-03-17T11:54:05.000+08:00
Signed-off-by: chensuyue &lt;suyue.chen@intel.com&gt;
diff --git a/docs/source/publication_list.md b/docs/source/publication_list.md
@@ -1,8 +1,9 @@
-Full Publications/Events (88)
+Full Publications/Events (89)
 ==========
-## 2025 (2)
+## 2025 (3)
+* arXiv: [Faster Inference of LLMs using FP8 on the Intel Gaudi](https://arxiv.org/abs/2503.09975) (Mar 2025)
+* PyTorch landscape: [PyTorch general optimizations](https://landscape.pytorch.org/) (Mar 2025)
 * Blog on SqueezeBits: [[Intel Gaudi] #4. FP8 Quantization](https://blog.squeezebits.com/intel-gaudi-4-fp8-quantization--40269) (Jan 2025)
-* PyTorch landscape: [PyTorch general optimizations](https://landscape.pytorch.org/) (March 2025)
 
 ## 2024 (7)
 * Blog by Microsoft: [Phi-4 quantization and inference speedup](https://techcommunity.microsoft.com/blog/machinelearningblog/phi-4-quantization-and-inference-speedup/4360047) (Dec 2024)