Load-Balancing for Improving User Responsiveness on Multicore Embedded Systems

by   Geunsik Lim, et al.

Most commercial embedded devices have been deployed with a single processor architecture. The code size and complexity of applications running on embedded devices are rapidly increasing due to the emergence of application business models such as Google Play Store and Apple App Store. As a result, a high-performance multicore CPUs have become a major trend in the embedded market as well as in the personal computer market. Due to this trend, many device manufacturers have been able to adopt more attractive user interfaces and high-performance applications for better user experiences on the multicore systems. In this paper, we describe how to improve the real-time performance by reducing the user waiting time on multicore systems that use a partitioned per-CPU run queue scheduling technique. Rather than focusing on naive load-balancing scheme for equally balanced CPU usage, our approach tries to minimize the cost of task migration by considering the importance level of running tasks and to optimize per-CPU utilization on multicore embedded systems. Consequently, our approach improves the real-time characteristics such as cache efficiency, user responsiveness, and latency. Experimental results under heavy background stress show that our approach reduces the average scheduling latency of an urgent task by 2.3 times.



There are no comments yet.


page 7


Supporting Parallelism in Server-based Multiprocessor Systems

Developing an efficient server-based real-time scheduling solution that ...

User-Level Memory Scheduler for Optimizing Application Performance in NUMA-Based Multicore Systems

Multicore CPU architectures have been established as a structure for gen...

Migrate when necessary: toward partitioned reclaiming for soft real-time tasks

This paper presents a new strategy for scheduling soft real-time tasks o...

Practical Scheduling for Real-World Serverless Computing

Serverless computing has seen rapid growth due to the ease-of-use and co...

OpenCL Performance Prediction using Architecture-Independent Features

OpenCL is an attractive model for heterogeneous high-performance computi...

Controlled Owicki-Gries Concurrency: Reasoning about the Preemptible eChronos Embedded Operating System

We introduce a controlled concurrency framework, derived from the Owicki...

A Hardware Time Manager Implementation for the Xenomai Real-Time Kernel of Embedded Linux

Nowadays, the use of embedded operating systems in different embedded pr...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.