BARISTA: Efficient and Scalable Serverless Serving System for Deep Learning Prediction Services | |
---|---|
Author | |
Abstract |
Pre-trained deep learning models are increasingly being used to offer a variety of compute-intensive predictive analytics services such as fitness tracking, speech and image recognition. The stateless and highly parallelizable nature of deep learning models makes them well-suited for serverless computing paradigm. However, making effective resource management decisions for these services is a hard problem due to the dynamic workloads and diverse set of available resource configurations that have their deployment and management costs. |
Year of Publication |
2019
|
Conference Name |
IEEE International Con- ference on Cloud Engineering (IC2E),
|
Date Published |
06/2019
|
Publisher |
IEEE
|
Conference Location |
Prague, Czech Republic
|
URL |
https://doi.org/10.1109%2Fic2e.2019.00-10
|
DOI |
10.1109/ic2e.2019.00-10
|
Google Scholar | BibTeX | XML | DOI |