Threaded Callback with priority, affinity and overrun handler

y-okumura-isp · June 22, 2020, 10:18am

Hello. We are investigating and studying the ROS2 real-time in a team.

In the previous post bellow, we reported about thread priority setting of main-thread and DDS child thread.

In these post we reported that thread proity could affect the jitter of sleep or timing of callback.
But if threads are in dirrefent CPU core, they do not affect each other.

So it seems natural to run callback as thread with specific CPU core and scheduling policy.
In fact, tasks are executed as “thread” in RTOS. By using such a thread, we’ll get

ROS executor thread can be interrupted (i.e. can read topic) even if callbacks are running
easily implement preemption
prevent duplicattion of scheduling procedure in executor and OS, this would make easy to verification
combination with LET sheduler may be also acceptable

And more, we want to detect deadline miss. By splitting callback thread, existing timer mechanism can be applied to overrun handler (timer can be triggered even if callback is running).

So we implemented PoC, toy code to run callbacks as different thread from main thread with specific CPU core and scheduling policy, and detect overrun.
It’s a PoC (proof of concept) implementation so I implement in user-land, not in ROS 2 layer.

Sample scenario

Consider following situation:

There is only one CPU core for simplicity.
There are 3 tasks(callbacks) with different priority, namely TaskA, TaskB, and TaskC.
Tasks are fired by topic, namely there are 3 paris, PubA-SubA, PubB-SubB, PubC-SubC.
Tasks are fired by SubX as thread.
TaskA has the highest priority and shortest task.
TaskB has middle priority and middle task.
TaskC has the lowest priority and longest task.
Namely TaskA shoud run even if TaskB or TaskC is running.

To illustrate this, see figure below.

TaskC fires by topic C. - means TaskC is running.
When topic B comes, TaskB fires and TaskC is stopped. O means TackC is stop
The same is true when topic A comes.
When TaskA is finished(X means this), TaskB runs because TaskB has higer priority than TaskC.
The same When TaskB finished.

(In a nutshell preemption.)

priority
   ^
   | TaskA                        ---X
   | 
   | TaskB                --------O    ------X
   | 
   | TaskC   ------------O                    ------X
   |         ^           ^        ^
   |         |           |        |
   | Topic   C           B        A
   |
   +----------------------------------------------------------> time

In the PoC code, thread priorities are main thread > DDS thread > TaskA thread > TaskB thread > TaskC thread.
Tasks run in the same core.

Implementation

POSIX thread API enables to set scheduling policy, CPU affinity.
I guess ROS 2 is developed with no (or less) OS restriction, so it may not be desirable to use the POSIX API.
But as far as I know, it’s hard to implement preemption without OS level support.

See ThreadedSubscription in https://github.com/y-okumura-isp/ROS2_ThreadedCallback/blob/master/include/threaded_subscriber.hpp.

You can see following in the constructor ThreadedSubscription.

creates callback thread
and sets up thread by pthread_setschedparam and pthread_setaffinity_np

To implement subscription callback and overrun handler, overload on_subscription and on_overrun.

ThreadedSubscription is a helper class, and to use this in node class do following:

Use ThreadedSubscription::create_subscription(rclcpp::Node *node, const std::string & topic, const rclcpp::QoS & qos) when you don’t nedd overrun handler.
Use ThreadedSubscription::create_subscription(rclcpp::Node *node, const std::string & topic, const rclcpp::QoS & qos, std::chrono::duration<DurationRepT, DurationT> overrun_period) when you need overrun handler.

It’s PoC, so I think there is more sophisticated API. I want to discuss the necessity of threaded callback rather than API now.
See README.md for detail.

Discussion

What do you think about a executor which uses threads associated to each callbacks i.e. creates a thread when create_subscription is called.

If you agree and try to implement in ROS layer, we should consider several things:

do we implement such a mechanism in rcl layer or rclcpp layer? As rcl does not have executor, it may be easy to start with rclcpp.
Decide what to do for new topic when the callback is already running. Drop? Delay?
Error handling may be the best in some case. So may developer want to select?
- If we select delay, we may need to consider Executor get_next_ready_executable and wait_for_work relation. We need to clear event flag, but execute subscription lazily.
As Data writer(executor in this situation) and data reader(subscription callback) run in parallel, we need to prevent data from changing in the middle of the callback.
So it may be good for callback thread to read data i.e. call rcl_take.

Questions, suggestions and advice are welcome.

Thank you.

ZhenshengLee · September 12, 2023, 6:30am

Great work!

Good experience for the realtime ros2 developers!

Autostone-c · September 12, 2023, 7:48am

hi, What is the difference between this idea and PiCAS_executor? GitHub - rtenlab/ros2-picas: ROS2-PiCAS source

tomoyafujita · September 12, 2023, 7:27pm

@y-okumura-isp thank you for sharing information!

with quick code scan, this design is similar with rclcpp::experipental::executors::EventsExecutor which already constructed on RMW listener APIs?

from PoC code, i see that main thread subscription just takes (Reactor, Not using RMW listener API yet) the data to the _msg and another thread which is created with user specified thread priority and policy processes the message once it is delivered. (there is no queue in the PoC sample, but i guess that we eventually need it.)

in that case, what is the difference if we use rclcpp::experipental::executors::EventsExecutor assigned with user created threads with specified priority and policy? (i think one of the difference is node object management to add the executor.)

according to thread policy and priority, probably REP-2017 Thread attributes configuration support would be interesting for you.

JRTG · September 12, 2023, 8:31pm

@Autostone-c @tomoyafujita
This post and repository date back to 2020, both PiCAS_executor as well as EventsExecutor and RMW listener API’s are more recent…

@Autostone-c The PiCAS repository seems no longer updated and holds no license information.

Autostone-c · September 14, 2023, 1:45am

If I want to create an executor that satisfies deterministic scheduling, which one can I refer to？anybody has ideas? Do the community have any ideas about this？

ZhenshengLee · September 14, 2023, 6:39am

Try the executor-level-executor provided by bosch, which is in the mainstream since Galactic.

the example is in examples/rclcpp/executors/cbg_executor at rolling · ros2/examples (github.com)

the pr is in executors should be able to operate on callback groups rather than nodes · Issue #519 · ros2/rclcpp (github.com)

tutorial is in ROS2 from the Ground Up: Part 5- Concurrency, Executors and Callback Groups | by Jegathesan Shanmugam | Medium

the paper is in Exploring Real-Time Executor on ROS 2 | IEEE Conference Publication | IEEE Xplore

Topic		Replies	Views
ROS2 generated child thread scheduling policy affects timers Quality Assurance ros2 , raspberrypi , dds , eloquent	2	7645	May 25, 2020
Experiment to inhibit DDS and ROS2 child threads Quality Assurance ros2 , raspberrypi , dds , eloquent	0	4471	May 27, 2020
ROS 2 Real-time Working Group Online Meeting 18 - May 26, 2020 - Meeting Minutes Next Generation ROS wg-real-time	20	3445	September 1, 2020
Avoiding priority inversions (includes draft pull requests) Next Generation ROS real-time	7	1588	January 29, 2023
ROS 2 timer behavior when callbacks take longer than the timer period General	4	2385	August 25, 2023

Threaded Callback with priority, affinity and overrun handler

Sample scenario

Implementation

Discussion

Related topics