Throughput Maximization of Delay Aware DNN Inference in Edge Computing by Exploring DNN Model Partitioning and Inference Parallelism
Throughput Maximization of Delay Aware DNN Inference in Edge Computing by Exploring DNN Model Partitioning and Inference Parallelism
Throughput Maximization of Delay Aware DNN Inference in Edge Computing by Exploring DNN Model Partitioning and Inference Parallelism