Don't expand groups for trailing end-of-line comments #6464

charliermarsh · 2023-08-09T23:35:11Z

Summary

This is a PR where we need to align on desired behavior. Right now, we expand a group when writing a trailing end-of-line comment, so e.g., given:

{
    a: a  # a
    for c in e  # for # c 
}

We format this expression as:

{
    a: a  # a
    for c in e  # for # c
}

Black, meanwhile, doesn't let trailing end-of-line comments expand, so formats as:

{a: a for c in e}  # a  # for # c

Looking through the snapshot diff, this seems to strictly improve Black compatibility (and our current behavior seems like a significant deviation). Even the changes in our own fixtures now better resemble Black as verified in the playground.

I don't know that I have a super strong opinion on whether to use Black's behavior or our own, but I lead towards following Black's behavior by default.

Test Plan

Weird mix on the similarity scores -- some went down a little, some went up a little.

Before:

zulip: 0.99702
django: 0.99784
warehouse: 0.99585
build: 0.75623
transformers: 0.99469
cpython: 0.75988
typeshed: 0.74853

After:

zulip: 0.99696
django: 0.99785
warehouse: 0.99569
build: 0.75623
transformers: 0.99481
cpython: 0.75987
typeshed: 0.74842

charliermarsh · 2023-08-09T23:35:22Z

crates/ruff_python_formatter/tests/snapshots/format@expression__compare.py.snap

-    a  # comment
-    == b
-)
+(a == b)  # comment


(This now matches Black.)

charliermarsh · 2023-08-09T23:35:45Z

crates/ruff_python_formatter/tests/snapshots/format@expression__dict_comp.py.snap

-    a: a  # a
-    for c in e  # for  # c  # in  # e
-}
+{a: a for c in e}  # a  # for  # c  # in  # e


This now matches Black.

charliermarsh · 2023-08-09T23:36:12Z

...es/ruff_python_formatter/tests/snapshots/black_compatibility@simple_cases__comments2.py.snap

-        for element in collection  # yup
-        if element is not None  # right
-    ]
+    lcomp = [element for element in collection if element is not None]  # yup  # yup  # right


This is much closer to Black but still wrong due to the line suffixes not being included when determining the line length:

lcomp = [ element for element in collection if element is not None # yup # yup # right ]

github-actions · 2023-08-10T00:09:44Z

PR Check Results

Benchmark

Linux

group                                      main                                   pr
-----                                      ----                                   --
formatter/large/dataset.py                 1.01      9.3±0.42ms     4.4 MB/sec    1.00      9.3±0.50ms     4.4 MB/sec
formatter/numpy/ctypeslib.py               1.00  1885.6±83.89µs     8.8 MB/sec    1.00  1879.4±141.52µs     8.9 MB/sec
formatter/numpy/globals.py                 1.05   229.2±16.86µs    12.9 MB/sec    1.00   219.2±12.62µs    13.5 MB/sec
formatter/pydantic/types.py                1.10      4.2±0.18ms     6.0 MB/sec    1.00      3.9±0.23ms     6.6 MB/sec
linter/all-rules/large/dataset.py          1.00     11.4±0.65ms     3.6 MB/sec    1.09     12.4±0.59ms     3.3 MB/sec
linter/all-rules/numpy/ctypeslib.py        1.00      3.0±0.10ms     5.5 MB/sec    1.12      3.4±0.17ms     4.9 MB/sec
linter/all-rules/numpy/globals.py          1.00   461.9±19.82µs     6.4 MB/sec    1.05   483.4±18.76µs     6.1 MB/sec
linter/all-rules/pydantic/types.py         1.00      6.2±0.28ms     4.1 MB/sec    1.06      6.6±0.39ms     3.9 MB/sec
linter/default-rules/large/dataset.py      1.04      6.4±0.23ms     6.4 MB/sec    1.00      6.1±0.23ms     6.7 MB/sec
linter/default-rules/numpy/ctypeslib.py    1.02  1370.0±68.25µs    12.2 MB/sec    1.00  1345.0±86.95µs    12.4 MB/sec
linter/default-rules/numpy/globals.py      1.03    168.5±9.33µs    17.5 MB/sec    1.00   162.8±11.67µs    18.1 MB/sec
linter/default-rules/pydantic/types.py     1.00      3.0±0.12ms     8.6 MB/sec    1.01      3.0±0.15ms     8.5 MB/sec

Windows

group                                      main                                   pr
-----                                      ----                                   --
formatter/large/dataset.py                 1.00      9.9±0.03ms     4.1 MB/sec    1.00      9.9±0.04ms     4.1 MB/sec
formatter/numpy/ctypeslib.py               1.00   1879.9±7.87µs     8.9 MB/sec    1.01   1897.7±9.42µs     8.8 MB/sec
formatter/numpy/globals.py                 1.00    197.4±1.80µs    14.9 MB/sec    1.01    199.1±5.09µs    14.8 MB/sec
formatter/pydantic/types.py                1.00      4.2±0.01ms     6.0 MB/sec    1.01      4.2±0.02ms     6.0 MB/sec
linter/all-rules/large/dataset.py          1.00     12.5±0.04ms     3.3 MB/sec    1.00     12.5±0.04ms     3.3 MB/sec
linter/all-rules/numpy/ctypeslib.py        1.00      3.5±0.01ms     4.8 MB/sec    1.01      3.5±0.02ms     4.7 MB/sec
linter/all-rules/numpy/globals.py          1.00    361.5±3.60µs     8.2 MB/sec    1.01    363.6±3.56µs     8.1 MB/sec
linter/all-rules/pydantic/types.py         1.00      6.5±0.02ms     3.9 MB/sec    1.00      6.5±0.02ms     3.9 MB/sec
linter/default-rules/large/dataset.py      1.00      6.8±0.02ms     6.0 MB/sec    1.00      6.8±0.04ms     6.0 MB/sec
linter/default-rules/numpy/ctypeslib.py    1.00  1396.5±12.89µs    11.9 MB/sec    1.00   1399.4±7.36µs    11.9 MB/sec
linter/default-rules/numpy/globals.py      1.00    143.0±1.17µs    20.6 MB/sec    1.00    143.7±3.45µs    20.5 MB/sec
linter/default-rules/pydantic/types.py     1.00      3.0±0.01ms     8.4 MB/sec    1.00      3.0±0.02ms     8.4 MB/sec

MichaReiser · 2023-08-10T06:01:28Z

Interesting to see this change in action.

Our formatter goal is to have (close) Black parity for already formatted code, but we intentionally allow deviation for unformatted code (using Black and Ruff in the same project isn't a use case we want to support). I don't expect this change to help increase parity for already Black formatted code because Black already have moved all comments to the end of the line. That's why I'm surprised that it affects the similarity index at all (the overall change seems neutral). We'll have to manually investigate the changes before moving forward. I suspect these are accidental changes where we have a different comment placement, and the lines now happen to be short enough to place the comment in the same position as Black.

The main question to me is whether the fewest vertical lines by collapsing comments improve readability and express the author's intentions. In my view, this isn't the case because comments are moved away from where I intentionally placed them:

Pragma comments

def test(
	a, #  type-ignore
	b
):
	pass

This now gets formatted as

def test(a, b): #  type-ignore
	pass

which not only suppresses typing errors for a, but also b. I now have to go back, expand the parameters and add a trailing comma after b if I want to keep the suppression specific to a (what I initially intended by adding the comment at the end of line a). This is a lot of work.

The workaround with adding trailing commas only works in positions where they are supported. For example, the following doesn't work.

(
    call(has_type_error) # type-ignore
    and another_call()
)

gets formatted as

(call(has_type_error) and another_call())  # type-ignore

which suppresses typing errors in another_call too. I doubt anyone would bother enough (until they have a bug because of the missed type checker error) to undo Black's change and add suppression comments:

# This doesn't work, probably because we have collapsed comments
(
    call(has_type_error) # fmt: skip # type-ignore 
    and another_call()
)

(
    # fmt: off
    call(has_type_error)  # type-ignore
    # fmt: on
    and another_call()
)

Ruff won't be able to provide this escape hatch because it doesn't support suppression comments on the expression level (you would need to suppress the whole statement). And suppression comments are a net negative in my view.

Comment context

This is related to pragma comments. Moving comments too far means that the comments loos their context.

call(
	[], #  TODO fill in values
	b,
	[]
)

gets formatted to

call([], b, [])  #  TODO fill in values

It is now unclear if I have to fill in the values in the first or last list. Again, I can work around this by adding a trailing comma, but that's a lot of work to make the formatter respect my comment placement.

Similar

call(
	[], # Empty because of X
	c,
	[], # Empty because of Y
)

gets formatted as

call([], c, [])  # Empty because of X  # Empty because of Y

Again, we have the context problem, but I also dislike collapsed comments # comment A # comment B because they look strange. It also comes with problems where pragma comments stop working if tools don't support nested comments, as we've seen with Black's fmt: skip pragma comment.

konstin

i'm not really opinionated either way, both styles have their advantages and disadvantages to me

charliermarsh · 2023-08-11T15:58:42Z

We're not making this change, for now at least.

charliermarsh commented Aug 9, 2023

View reviewed changes

charliermarsh force-pushed the charlie/break branch from d536fd5 to 0ee0a6e Compare August 9, 2023 23:36

charliermarsh added the formatter Related to the formatter label Aug 9, 2023

charliermarsh requested review from MichaReiser and konstin August 9, 2023 23:36

charliermarsh mentioned this pull request Aug 9, 2023

Allow return type annotations to use their own parentheses #6436

Merged

Don't expand groups for trailing end-of-line comments

6783d8a

charliermarsh force-pushed the charlie/break branch from 0ee0a6e to 6783d8a Compare August 10, 2023 00:17

konstin approved these changes Aug 10, 2023

View reviewed changes

charliermarsh closed this Aug 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Don't expand groups for trailing end-of-line comments #6464

Don't expand groups for trailing end-of-line comments #6464

Uh oh!

charliermarsh commented Aug 9, 2023 •

edited

Loading

Uh oh!

charliermarsh Aug 9, 2023

Uh oh!

charliermarsh Aug 9, 2023

Uh oh!

charliermarsh Aug 9, 2023

Uh oh!

github-actions bot commented Aug 10, 2023 •

edited

Loading

Uh oh!

MichaReiser commented Aug 10, 2023 •

edited

Loading

Uh oh!

konstin left a comment

Uh oh!

charliermarsh commented Aug 11, 2023

Uh oh!

Uh oh!

Don't expand groups for trailing end-of-line comments #6464

Don't expand groups for trailing end-of-line comments #6464

Uh oh!

Conversation

charliermarsh commented Aug 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Uh oh!

charliermarsh Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

charliermarsh Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

charliermarsh Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Check Results

Benchmark

Linux

Windows

Uh oh!

MichaReiser commented Aug 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pragma comments

Comment context

Uh oh!

konstin left a comment

Choose a reason for hiding this comment

Uh oh!

charliermarsh commented Aug 11, 2023

Uh oh!

Uh oh!

charliermarsh commented Aug 9, 2023 •

edited

Loading

github-actions bot commented Aug 10, 2023 •

edited

Loading

MichaReiser commented Aug 10, 2023 •

edited

Loading