Blättern nach Person
Ebene hoch |
Razavi, Kamran (2024)
Resource Efficient Inference Serving With SLO Guarantee.
Technische Universität Darmstadt
doi: 10.26083/tuprints-00028615
Dissertation, Erstveröffentlichung, Verlagsversion
Mühlhäuser, Max ; Alexopoulos, Nikolaos ; Gropengießer, Uwe ; Razavi, Kamran ; Wang, Lin
Hrsg.: Schulte, Stefan ; Koldehofe, Boris (2024)
Towards Democratic Computing.
In: From Multimedia Communications to the Future Internet: Essays Dedicated to Ralf Steinmetz on the Occasion of His Retirement, Auflage: 1st Edition
doi: 10.1007/978-3-031-71874-8_17
Buchkapitel, Bibliographie
Razavi, Kamran ; Salmani, Mehran ; Mühlhäuser, Max ; Koldehofe, Boris ; Wang, Lin (2024)
A Tale of Two Scales: Reconciling Horizontal and Vertical Scaling for Inference Serving Systems.
doi: 10.48550/arXiv.2407.14843
Report, Bibliographie
Razavi, Kamran ; Davari Fard, Shayan ; Karlos, George ; Nigade, Vinod ; Mühlhäuser, Max ; Wang, Lin (2024)
NetNN: Neural Intrusion Detection System in Programmable Networks.
doi: 10.48550/arXiv.2406.19990
Report, Bibliographie
Razavi, Kamran ; Ghafouri, Saeid ; Mühlhäuser, Max ; Jamshidi, Pooyan ; Wang, Lin (2024)
Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical Scaling.
19th European Conference on Computer Systems (EuroMLSys 2024). Athens, Greece (21.04.-25.04.2024)
doi: 10.1145/3642970.3655833
Konferenzveröffentlichung, Bibliographie
Ghafouri, Saeid ; Razavi, Kamran ; Salmani, Mehran ; Sanaee, Alireza ; Lorido-Botran, Tania ; Wang, Lin ; Doyle, Joseph ; Jamshidi, Pooyan (2024)
IPA: Inference Pipeline Adaptation to achieve high accuracy and cost-efficiency.
In: Journal of Systems Research, 4 (1)
doi: 10.5070/SR34163500
Artikel, Bibliographie
Ghafouri, Saeid ; Razavi, Kamran ; Salmani, Mehran ; Sanaee, Alireza ; Lorido-Botran, Tania ; Wang, Lin ; Doyle, Joseph ; Jamshidi, Pooyan (2023)
IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency.
doi: 10.48550/arXiv.2308.12871
Report, Bibliographie
Salmani, Mehran ; Ghafouri, Saeid ; Sanaee, Alireza ; Razavi, Kamran ; Mühlhäuser, Max ; Doyle, Joseph ; Jamshidi, Pooyan ; Sharifi, Mohsen (2023)
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems.
3rd Workshop on Machine Learning and Systems. Rome, Italy (08.05.2023-08.05.2023)
doi: 10.1145/3578356.3592578
Konferenzveröffentlichung, Bibliographie
Razavi, Kamran ; Karlos, George ; Nigade, Vinod ; Mühlhäuser, Max ; Wang, Lin (2022)
Distributed DNN Serving in the Network Data Plane.
5th International Workshop on P4 in Europe. Rome, Italy (09.12.2022-09.12.2022)
doi: 10.1145/3565475.3569079
Konferenzveröffentlichung, Bibliographie
Razavi, Kamran ; Luthra, Manisha ; Koldehofe, Boris ; Mühlhäuser, Max ; Wang, Lin (2022)
FA2: Fast, Accurate Autoscaling for Serving Deep Learning Inference with SLA Guarantees.
28th Real-Time and Embedded Technology and Applications Symposium (RTAS 2022). Milano, Italy (04.05.2022-06.05.2022)
doi: 10.1109/RTAS54340.2022.00020
Konferenzveröffentlichung, Bibliographie
Luthra, Manisha ; Hennig, Sebastian ; Razavi, Kamran ; Wang, Lin ; Koldehofe, Boris (2020)
Operator as a Service: Stateful Serverless Complex Event Processing.
9th Workshop on Scalable Cloud Data Management. virtual Conference (10.12.2020-13.12.2020)
Konferenzveröffentlichung, Bibliographie