We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent c772246 commit 9502843Copy full SHA for 9502843
daily-arxiv-llm.md
@@ -1,5 +1,11 @@
1
The paper list will be updated automatically, please do not edit.
2
3
+### 2025-03-10
4
+
5
+* [Optimizing LLM Inference Throughput via Memory-aware and SLA-constrained Dynamic Batching](https://arxiv.org/abs/2503.05248)
6
+* [Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts](https://arxiv.org/abs/2503.05447)
7
8
9
### 2025-03-07
10
11
* [Dynamic Pricing for On-Demand DNN Inference in the Edge-AI Market](https://arxiv.org/abs/2503.04521)
0 commit comments