Restructure Admin TaskList commands to operate on multiple types #6712

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

natemort merged 1 commit into cadence-workflow:master from natemort:tlt

Mar 6, 2025

Member

natemort commented Mar 5, 2025 •

edited

Loading

While Activity/Decision TaskLists with a given name are independent, they typically represent a common purpose and we have never operated on just a single type. Handling them entirely separately within the admin commands doubles the amount of commands needed to run to operate on the TaskList and makes it harder to understand whether a given TaskList name is used for decision and/or activity tasks. Particularly as we move TaskList partition data from dynamic config to the DB it's important that we provide good operator ergonomics to avoid errors.

What changed?
Update the Admin TaskList commands with the following:

The absence of the tasklisttype flag now indicates both Activity and Decision. It previously was inconsistent between defaulting to Decision or being required.
The describe command will output one row per type specified rather than a single row.
The describe command will output that a TaskList has 1 read/write partition rather than no output if there is no config.
The describe command will no longer output the list of pollers and instead print out the number of pollers. This functionality is still available via the non-admin version of the command, which does nothing other than printing the pollers.
The describe command now supports json as an output format.
The list command will consolidate Activity and Decision TaskLists with a given name to a single row, including separate columns for the number of decision and activity pollers.
The list command now sorts its output by TaskList name.
The list command now supports json as an output format.
The update-partition command now operates on multiple TaskListTypes at once rather than Decision or Activity. It performs a safety check on both before performing any updates.

Why?

Simplify operating on TaskLists

How did you test it?

Unit tests
Manual testing against a local Cadence instance

Potential risks

This is a breaking change to CLI users of the admin tasklist commands. Their usage is likely rather low, with the commands to update partition data being newly added.

Release notes

Documentation Changes

natemort requested review from Shaddoll, neil-xie, davidporter-id-au, Groxx, shijiesheng, jakobht, 3vilhamster, sankari165, dkrotx, taylanisikdemir and demirkayaender as code owners

March 5, 2025 22:36

davidporter-id-au reviewed

View reviewed changes

tools/cli/admin_task_list_commands.go Outdated

+              		prettyPrintJSONObject(getDeps(c).Output(), response)
+              		return nil
+              	}
               	fmt.Println("Task Lists for domain " + domain + ":")

Member

davidporter-id-au Mar 5, 2025

Not for this diff, but mixing print statements with the stdout writer is :sadpanda:

Member Author

natemort Mar 6, 2025

Done, good call.

davidporter-id-au approved these changes

View reviewed changes

davidporter-id-au approved these changes

View reviewed changes

Groxx reviewed

View reviewed changes

tools/cli/admin_task_list_commands.go Outdated

+              	} else if c.String(FlagTaskListType) == "" {
+              		taskListTypes = []types.TaskListType{types.TaskListTypeActivity, types.TaskListTypeDecision}
+              	} else {
+              		return nil, commoncli.Problem("Invalid task list type: valid types are [activity, decision]", nil)

Member

Groxx Mar 6, 2025

Suggested change

      
            		return nil, commoncli.Problem("Invalid task list type: valid types are [activity, decision]", nil)
          
            		return nil, commoncli.Problem("Invalid task list type: valid types are 'activity', 'decision', or empty (both)", nil)

Member Author

natemort Mar 6, 2025

Done.

Groxx reviewed

View reviewed changes

tools/cli/admin_task_list_commands.go Outdated

Comment on lines 283 to 284

		msg := fmt.Sprintf("Successfully updated %s:%s", tl.Name, tlType)
		_, err = getDeps(c).Output().Write([]byte(msg))

Member

Groxx Mar 6, 2025 •

edited

Loading

should be able to

fmt.Fprintf(getDeps(c).Output(), "Successfully updated %s:%s", tl.Name, tlType)

and I think errcheck won't complain about the fprintf err. but this is just an FYI / if-convenient-simplification, no need to change imo

Member Author

natemort Mar 6, 2025

That reads better, switched to that. Goland at least complains about it, so I'll add the explicit ignore.

Groxx reviewed

View reviewed changes

tools/cli/admin_task_list_commands.go

+              		if len(errors) > 0 {
+              			return commoncli.Problem("Potentially unsafe operation. Specify '--force' to proceed anyway", multierr.Combine(errors...))
+              		}
+              		if !hasPollers {

Member

Groxx Mar 6, 2025

do we want to silently allow changing config when specifying more than one type, but only one has pollers? kinda seems like a "warn first" scenario to me.

unless I'm misreading this?

Member Author

natemort Mar 6, 2025 •

edited

Loading

What you're describing is correct.

We pretty frequently partition a tasklist for both types despite only a single type being in use. Looking at our DynamicConfig we've never once partitioned a TaskList and used the TaskListType as one of the filters. So I think it provides better continuity and doesn't really create any problems for the system.

The intention behind this check is to catch cases where we specify the incorrect name. We recently had this happen with a large TaskList that we intended to change to 10 partitions but we partitioned task_list_name instead of task-list-name which caused some really bad hotsharding during the next monthly burst.

Member

Groxx Mar 6, 2025

yea, definitely useful either way. and makes sense if it's kinda common 👍

Groxx reviewed

View reviewed changes

tools/cli/admin_task_list_commands.go

Comment on lines +165 to +167

+              	slices.SortFunc(table, func(a, b TaskListRow) int {
+              		return strings.Compare(a.Name, b.Name)
+              	})

Member

Groxx Mar 6, 2025 •

edited

Loading

I think you'll want slices.SortStableFunc here, or sort by both name and type. Sort is unstable, so you might see activity/decision reorders under the same name.

e.g. I think this is possible currently:

decision  tasklist-A   3
activity  tasklist-A   3
activity  tasklist-B   3
decision  tasklist-B   3

but SortStable will ensure it's always decision/activity/decision/activity (for same name pairs).

I don't think the tests exercise this, so it's rather minor at the moment, but probably still worth doing.

Member Author

natemort Mar 6, 2025

I actually changed the table format entirely, it's now this:

Name	Activity Pollers	Decision Pollers
tasklist-A	3	3
tasklist-B	3	3

So the name is unique and we've aggregated the counts across both types. My goal was to align it more with the Cadence UI, which lists the pollers of a TaskList and has a checkbox for whether each has a Decision/Activity poller.

Member

Groxx Mar 6, 2025

ah, I see, it's just iteratively filling the same row in both loops.

yea that's fine then 👍

Groxx reviewed

View reviewed changes

tools/cli/admin_task_list_commands_test.go

Comment on lines +123 to +127

+              				output := td.consoleOutput()
+              				var result map[types.TaskListType]*types.DescribeTaskListResponse
+              				err := json.Unmarshal([]byte(output), &result)
+              				require.NoError(t, err)
+              				assert.Equal(t, expected, result)

Member

Groxx Mar 6, 2025

heh. practical - I kinda like it.

Groxx reviewed

View reviewed changes

tools/cli/admin_task_list_commands_test.go Outdated

+              					},
+              				}, nil).Times(1)
+              			},
+              			expectedError:      "Decision",

Member

Groxx Mar 6, 2025

could probably use a more descriptive expected-error, or some kind of comment. this is rather vague about what it is checking, since "Decision" could be in lots of error messages.

Member Author

natemort Mar 6, 2025

Done.

Groxx approved these changes

View reviewed changes

Member

Groxx left a comment

only minor comments / tackle whatever you agree with, otherwise looks good to me


          Restructure Admin TaskList commands to operate on multiple types

While Activity/Decision TaskLists with a given name are independent, they typically represent a common purpose and we have never operated on just a single type. Handling them entirely separately within the admin commands doubles the amount of commands needed to run to operate on the TaskList and makes it harder to understand whether a given TaskList name is used for decision and/or activity tasks. Particularly as we move TaskList partition data from dynamic config to the DB it's important that we provide good operator ergonomics to avoid errors.

Update the Admin TaskList commands with the following:
- The absence of the `tasklisttype` flag now indicates both Activity and Decision. It previously was inconsistent between defaulting to Decision or being required.
- The describe command will output one row per type specified rather than a single row.
- The describe command will output that a TaskList has 1 read/write partition rather than no output if there is no config.
- The describe command will no longer output the list of pollers and instead print out the number of pollers. This functionality is still available via the non-admin version of the command, which does nothing other than printing the pollers.
- The describe command now supports json as an output format.
- The list command will consolidate Activity and Decision TaskLists with a given name to a single row, including separate columns for the number of decision and activity pollers.
- The list command now sorts its output by TaskList name.
- The list command now supports json as an output format.
- The update-partition command now operates on multiple TaskListTypes at once rather than Decision or Activity. It performs a safety check on both before performing any updates.

natemort force-pushed the tlt branch from 668c964 to 7114037 Compare

March 6, 2025 17:57

natemort merged commit 58bf63b into cadence-workflow:master

22 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

Groxx Groxx approved these changes

davidporter-id-au davidporter-id-au approved these changes

Shaddoll Awaiting requested review from Shaddoll Shaddoll is a code owner

neil-xie Awaiting requested review from neil-xie neil-xie is a code owner

shijiesheng Awaiting requested review from shijiesheng shijiesheng is a code owner

jakobht Awaiting requested review from jakobht jakobht is a code owner

3vilhamster Awaiting requested review from 3vilhamster 3vilhamster is a code owner

sankari165 Awaiting requested review from sankari165 sankari165 is a code owner

dkrotx Awaiting requested review from dkrotx dkrotx is a code owner

taylanisikdemir Awaiting requested review from taylanisikdemir taylanisikdemir is a code owner

demirkayaender Awaiting requested review from demirkayaender demirkayaender is a code owner

Labels

None yet