Rewrite the old proxying API in terms of the new API #15880

tlively · 2022-01-04T23:54:43Z

Rewrite the old threading.h proxying API used for internal system call
implementations in terms of the new proxying API introduced in #15737.

Implement a new proxying API that is meant to be suitable both for proxying in syscall implementations as well as for proxying arbitrary user work. Since the system proxying queue is processed from so many locations, it is dangerous to use it for arbitrary work that might take a lock or otherwise block and the work sent to it has to be structured more like native signal handlers. To avoid this limitation, the new API allows users to create their own proxying queues that are processed only when the target thread returns to the JS event loop or when the user explicitly requests processing. In contrast to the existing proxying API, this new API: - Never drops tasks (except in the case of allocation failure). It grows the task queues as necessary instead. - Does not attempt to dynamically type or dispatch queued functions, but rather uses statically typed function pointers that take `void*` arguments. This simplifies both the API and implementation. Packing of varargs into dynamically typed function wrappers could easily be layered on top of this API. - Is less redundant. There is only one way to proxy work synchronously or asynchronously to any thread. - Is more general. It allows waiting for a task to be explicitly signaled as done in addition to waiting for the proxied function to return. - Uses arrays instead of linked lists for better data locality. - Has a more uniform naming convention. A follow-up PR will reimplement the existing proxying API in terms of this new API.

Rewrite the old threading.h proxying API used for internal system call implementations in terms of the new proxying API introduced in #15737.

tools/system_libs.py

sbc100 · 2022-01-11T00:28:20Z

system/lib/pthread/library_pthread.c

-    args->target_thread = target_thread;
-    args->q = q;
-    emscripten_set_timeout(dispatch_to_thread_helper, 0, args);
+    emscripten_set_timeout(_do_call, 0, args);


Won't args (the va_list) be invalid once the timeout fires? i.e. isn't this going to be a use-after-free?

Ah yes, and _do_call expects an em_queued_call*, not a va_list. args should be q here.

system/lib/pthread/library_pthread.c

sbc100 · 2022-03-09T04:50:06Z

src/library_pthread.js

-      postMessage({'cmd' : 'processQueuedMainThreadWork'});
+  _emscripten_notify_proxying_queue: function(targetThreadId, currThreadId, mainThreadId, queue) {
+    if (targetThreadId == currThreadId) {
+      setTimeout(function() { _emscripten_proxy_execute_queue(queue); });


Why turn this into a setTimeout?

Actually, this new _emscdripten_notify_proxying_queue is a duplicate of the one that already exists just below. I'm not sure how I ended up with the duplicate function, but I'll remove it. The diff here should simply be a removal of the old _emscripten_notify_thread_queue.

The reason this is setTimeout is so that we process the queue after returning to the event loop, similar to how queues on other threads will only be processed once they return to their event loops.

Interesting.. I didn't see that the condition here changed from (targetThreadId == mainThreadId) to (targetThreadId == currThreadId).

How was the (targetThreadId == currThreadId) handled in the old code.. it looks like it was not handled specially and

The old code eagerly executed the proxied work if the target thread was the same as the current thread, so _emscripten_notify_thread_queue was never called in that case. The new API is more flexible and does not force eager evaluation in that case, so the new JS code has to handle it.

system/lib/pthread/library_pthread.c

sbc100 · 2022-03-09T22:15:10Z

I'm curious to see how the pthread code size tests are effected.

Can you run ./tests/runner other.*code_size* other.*metadce* --rebase both before and after this change?

tlively · 2022-03-09T22:35:07Z

The only difference shown by running that command before and after is on tests/other/metadce/minimal_main_Oz_USE_PTHREADS_PROXY_TO_PTHREAD.

The js size goes from 32374 to 32392 (+18 bytes, +0.06%)

The wasm size goes from 17053 to 18539 (+1486 bytes, +8.7%)

tlively · 2022-03-10T20:18:43Z

@sbc100 any other comments before we land this?

sbc100 · 2022-03-10T21:19:33Z

The 8.7% rise the size of basic pthread runtime is a little worrying... Admittedly we have not been very focused on reducing this, but I would like to change. Do you have sense of where that might be coming from?

sbc100 · 2022-03-10T21:24:21Z

system/lib/pthread/library_pthread.c

@@ -644,7 +505,18 @@ void emscripten_main_thread_process_queued_calls() {
  if (!_emscripten_allow_main_runtime_queued_calls)
    return;

+  // Recursion guard to avoid infinite recursion when we arrive here from the
+  // pthread_lock calls inside `emscripten_proxy_execute_queue`. This isn't
+  // caught by the queue's own recursion guard because the lock has to be


"This isn't caught by the queue's own recursion guard because the lock has to be
acquired before that recursion guard can be checked"

This seems a little worrying, can we so something special for this internal lock to avoid the recursion.

Protecting against it here seems like treating the symptom rather than the cause.. for example what happens if a user calls emscripten_current_thread_process_queued_calls directly on the main thread instead of going via this wrapper?

I moved the recursion guard into emscripten_proxy_execute_queue, so at least it's more robust now and avoids the problem you identified.

sbc100 · 2022-03-10T22:03:03Z

src/worker.js

      if (Module['_pthread_self']()) { // If this thread is actually running?
-        Module['_emscripten_current_thread_process_queued_calls']();
+        Module['_emscripten_proxy_execute_queue'](e.data.queue);


Do we need to pass the queue pointer around or can we just assume its the system queue here?

Yes, we need to pass the queue pointer around because this is the same code path used for both the system queue and user queues. Actually, I see that this ended up duplicated somehow as well. Will fix.

sbc100 · 2022-03-10T22:04:12Z

src/library_pthread.js

          // TODO: Must post message to main Emscripten thread in PROXY_TO_WORKER mode.
-          _emscripten_main_thread_process_queued_calls();
+          _emscripten_proxy_execute_queue(d['queue']);


Could we save a little of JS codesize here by passing zero args and having _emscripten_proxy_execute_queue assume that NULL == system queue? Would save the extra post message packing / unpacking too.

Yes, potentially, but I would prefer to make it a clear-cut error for NULL to ever be passed to emscripten_proxy_execute_queue. Better for users to get an assertion failure (in debug builds) when they do that by accident than for the system queue to be executed.

tlively · 2022-03-10T22:33:09Z

The 8.7% rise the size of basic pthread runtime is a little worrying... Admittedly we have not been very focused on reducing this, but I would like to change. Do you have sense of where that might be coming from?

I assume this is basically just because the new queue implementation is larger than the old queue implementation, and I assume that is because the new queues are more general than the old queues, for example being able to dynamically grow. Using vectors rather than linked lists for the top-level set of task queues might also increase code size.

Edit: Using pthread_mutex and pthread_cond APIs probably also has some overhead over just using emscripten_futex_wait.

tlively added 25 commits December 12, 2021 21:35

Remove normalize_thread

99f14f5

Make header internal

59a7bb3

Assert that q is not null

faa6423

Simplify em_proyxing_queue_create

f241353

goto failed

c3732d5

Nicer struct initialization syntax

be9db82

Split out task_queue_grow

5b6294e

More "Not thread safe"

e5ed1ba

Split out task_queue_{de}init

8e17f91

Add test (excluding postMessage notification)

be3de8f

Test the recursion guard logic

f274a3c

Fix dangling tasks pointer

d147a5a

new postMessage notification

64c1d42

Test returner thread

4d5aa0e

test tasks_queue growth during processing

b421b3c

Fix test output

2a21b12

Address comments

82ea908

Put the emscripten_force_exit back

45f7b97

Expand comment and move EMSCRIPTEN_KEEPALIVE

3b9cbd8

rename to task_queue_is_empty

e9af913

Expand recursion guard comment

fa0d198

Merge remote-tracking branch 'origin/main' into new-proxying-api

54ae773

Wait for returner to start up

e4b05eb

Rewrite the old proxying API in terms of the new API

28be33e

Rewrite the old threading.h proxying API used for internal system call implementations in terms of the new proxying API introduced in #15737.

tlively requested review from kripken, juj and sbc100 January 4, 2022 23:54

sbc100 reviewed Jan 11, 2022

View reviewed changes

juj approved these changes Jan 12, 2022

View reviewed changes

kleisauke mentioned this pull request Jan 28, 2022

browser.test_pthread_run_on_main_thread is flaky #16136

Open

Base automatically changed from new-proxying-api to main February 2, 2022 01:49

tlively added 4 commits March 8, 2022 16:58

Merge remote-tracking branch 'origin/main' into rewrite-old-proxying-api

0fa13af

Remove pthread_create from deps_info.py to fix test

af9d2ce

Rebaseline size test

d03c3da

Recursion guard on emscripten_main_thread_process_queued_calls

9a89e69

sbc100 reviewed Mar 9, 2022

View reviewed changes

tlively added 3 commits March 9, 2022 11:46

[ci skip] improve comment

f67a82e

Remove duplicate function

18eebf5

Feedback

67aa5ef

Merge and rebaseline

1db6681

sbc100 reviewed Mar 10, 2022

View reviewed changes

Move system recursion guard to emscripten_proxy_execute_queue

72d7457

sbc100 approved these changes Mar 10, 2022

View reviewed changes

sbc100 reviewed Mar 10, 2022

View reviewed changes

tlively added 2 commits March 10, 2022 14:04

Fix build after previous commit

47fbd08

Remove duplicated worker JS

a5a151b

tlively enabled auto-merge (squash) March 10, 2022 22:42

tlively disabled auto-merge March 11, 2022 01:02

tlively merged commit 6650d80 into main Mar 11, 2022

tlively deleted the rewrite-old-proxying-api branch March 11, 2022 01:05

Rewrite the old proxying API in terms of the new API #15880

Rewrite the old proxying API in terms of the new API #15880

Uh oh!

Conversation

tlively commented Jan 4, 2022

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sbc100 commented Mar 9, 2022

Uh oh!

tlively commented Mar 9, 2022

Uh oh!

tlively commented Mar 10, 2022

Uh oh!

sbc100 commented Mar 10, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tlively commented Mar 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tlively commented Mar 10, 2022 •

edited

Loading