Tes compliance: support multiple executors #427

uniqueg · 2023-09-29T07:53:47Z

Problem:

As per the current TES specification, multiple executors can be specified in a single request:

components:
  ...
  schemas:
    ...
    tesTask:
      ...
      properties:
        ...
        executors:
          type: array
          description: |-
            An array of executors to be run. Each of the executors will run one
            at a time sequentially. Each executor is a different command that
            will be run, and each can utilize a different docker image. But each of
            the executors will see the same mapped inputs and volumes that are declared
            in the parent CreateTask message.

            Execution stops on the first error.
          items:
            $ref: '#/components/schemas/tesExecutor'

Since TES v1.1, the behavior for the case in which one of multiple executors fails can further be modified with the ignore_error property of the tesExecutor schema:

components:
  ...
  schemas:
    ...
    tesExecutor:
      ...
      properties:
        ...
        ignore_error:
          type: boolean
          description: |-
            Default behavior of running an array of executors is that execution
            stops on the first error. If `ignore_error` is `True`, then the
            runner will record error exit codes, but will continue on to the next
            tesExecutor.

As discussed with @MattMcL4475, multiple executors are currently not supported in TES on Azure.

However, some client implementations such as workflow engines may rely on this feature, e.g., for setup and teardown steps or for scheduling two or more closely related tasks with similar requirements to minimize overheads.

Solution:

Implement support for multiple executors as described.
Implement the ignore_error property of the tesExecutor schema as described.

Describe alternatives you've considered
N/A

Code dependencies
Not sure.

Additional context
Appropriate tests for this behavior are currently unavailable (see ga4gh/compliance-tests-ga4gh-tes#2).

The text was updated successfully, but these errors were encountered:

MattMcL4475 · 2024-09-27T16:48:08Z

@adamnovak would implementing this issue be a significant help to implementing TES with Toil? We are doing a round of issue prioritization and your input on this would be helpful, thanks!

adamnovak · 2024-09-27T17:54:11Z

I think that having access to multiple executors would be a big help for allowing Toil to efficiently implement some parts of WDL, such as the glob() function. I think if we could run some Toil code against the same volumes as the user container, and use TES's ability to set up a wildcard match for outputs like output_files/*/*, it would be possible to fully implement WDL without uploading any unnecessary data to shared storage. We might still be able to achieve this without multiple executors, but it would involve either implementing a lot of WDL in Bash or else somehow injecting enough Python to run Toil into the user container.

Without being able to run Toil code against the same volumes, for WDL tasks where the output files need to be dynamically determined, we'd need to upload everything that potentially could be an output to shared storage and then go through it to identify the actual outputs.

uniqueg · 2024-09-28T02:17:35Z

@MattMcL4475: Please also note that, as part of the core TES specs, TES compliant clients will rely on multiple executors to be supported. While so far we are making use of multiple executors to support individual server-side solutions (and so are not affected by TES-on-Azure's non-compliance), there is at least one project that we are working on (supporting Crypt4GH files in TES) that uses a middleware approach relying on multiple executors. But since we believe that the middleware-executor pattern is powerful for supercharging any TES implementation with individual functionalities, I think there will be more of those use cases that TES-on-Azure, without support for multiple executors, would miss out on.

MattMcL4475 · 2024-09-28T17:34:17Z

Thank you @adamnovak and @uniqueg for the feedback, we've made this a top priority and we'll have it implemented and released in 5.4.4 this week. We also noticed that with this change, the repo will pass 100% of the TES 1.1 compliance tests (currently it passes 100% of TES 1.0)! So it will be great to get this done.

Please keep the feedback coming if you see any high-priority features/bugs, thank you!

ngambani added bug Something isn't working enhancement New feature or request Usability Enable TES is easy to use for end users tobegroomed Add this label while creating new issues to get issues prioritized on the backlog Size: small labels Oct 11, 2023

MattMcL4475 added the TES Priority: P3 Groomed to a Priority 3 issue label Dec 11, 2023

BMurri removed the tobegroomed Add this label while creating new issues to get issues prioritized on the backlog label Apr 20, 2024

adamnovak mentioned this issue Aug 7, 2024

Is glob() a real normal function or a special magic thing in WDL 1.0? openwdl/wdl#680

Closed

MattMcL4475 added TES Priority: P1 Groomed to a Priority 1 issue and removed TES Priority: P3 Groomed to a Priority 3 issue labels Sep 27, 2024

BMurri linked a pull request Sep 28, 2024 that will close this issue

Implement multiple executors #790

Draft

MattMcL4475 mentioned this issue Oct 4, 2024

Executor stdout and stderr should be in their own files #793

Closed

vsmalladi mentioned this issue Dec 27, 2024

Full TES 1.1 Spec compliance #7

Open

12 tasks

vsmalladi added this to the next milestone Feb 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tes compliance: support multiple executors #427

Tes compliance: support multiple executors #427

uniqueg commented Sep 29, 2023

MattMcL4475 commented Sep 27, 2024

adamnovak commented Sep 27, 2024

uniqueg commented Sep 28, 2024

MattMcL4475 commented Sep 28, 2024

Tes compliance: support multiple executors #427

Tes compliance: support multiple executors #427

Comments

uniqueg commented Sep 29, 2023

MattMcL4475 commented Sep 27, 2024

adamnovak commented Sep 27, 2024

uniqueg commented Sep 28, 2024

MattMcL4475 commented Sep 28, 2024