Blättern nach Person
Ebene hoch |
Artikel
Ghafouri, Saeid ; Razavi, Kamran ; Salmani, Mehran ; Sanaee, Alireza ; Lorido-Botran, Tania ; Wang, Lin ; Doyle, Joseph ; Jamshidi, Pooyan (2024)
IPA: Inference Pipeline Adaptation to achieve high accuracy and cost-efficiency.
In: Journal of Systems Research, 4 (1)
doi: 10.5070/SR34163500
Artikel, Bibliographie
Konferenzveröffentlichung
Salmani, Mehran ; Ghafouri, Saeid ; Sanaee, Alireza ; Razavi, Kamran ; Mühlhäuser, Max ; Doyle, Joseph ; Jamshidi, Pooyan ; Sharifi, Mohsen (2023)
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems.
3rd Workshop on Machine Learning and Systems. Rome, Italy (08.05.2023-08.05.2023)
doi: 10.1145/3578356.3592578
Konferenzveröffentlichung, Bibliographie
Report
Razavi, Kamran ; Salmani, Mehran ; Mühlhäuser, Max ; Koldehofe, Boris ; Wang, Lin (2024)
A Tale of Two Scales: Reconciling Horizontal and Vertical Scaling for Inference Serving Systems.
doi: 10.48550/arXiv.2407.14843
Report, Bibliographie
Ghafouri, Saeid ; Razavi, Kamran ; Salmani, Mehran ; Sanaee, Alireza ; Lorido-Botran, Tania ; Wang, Lin ; Doyle, Joseph ; Jamshidi, Pooyan (2023)
IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency.
doi: 10.48550/arXiv.2308.12871
Report, Bibliographie