[`perflint`] fix invalid hoist in `perf401` #14369

w0nder1ng · 2024-11-15T21:03:52Z

This should fix #14362. This new fix currently deletes lines like this:

- tmp = 1; result = []
- for i in range(10):
-   result.append(i+1)
+ result = [i+1 for i in range(10)]

Is there a convenient way to get every statement within a given TextRange to detect when this is happening?

github-actions · 2024-11-15T21:17:55Z

`ruff-ecosystem` results

Linter (stable)

ℹ️ ecosystem check detected linter changes. (+18 -20 violations, +0 -0 fixes in 5 projects; 50 projects unchanged)

apache/airflow (+6 -6 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --no-preview --select ALL

- dev/breeze/src/airflow_breeze/commands/release_management_commands.py:3074:9: PERF401 Use `list.extend` to create a transformed list
+ dev/breeze/src/airflow_breeze/commands/release_management_commands.py:3074:9: PERF401 Use a list comprehension to create a transformed list
- dev/breeze/src/airflow_breeze/utils/exclude_from_matrix.py:32:9: PERF401 Use `list.extend` to create a transformed list
+ dev/breeze/src/airflow_breeze/utils/exclude_from_matrix.py:32:9: PERF401 Use a list comprehension to create a transformed list
- dev/breeze/src/airflow_breeze/utils/packages.py:327:9: PERF401 Use `list.extend` to create a transformed list
+ dev/breeze/src/airflow_breeze/utils/packages.py:327:9: PERF401 Use a list comprehension to create a transformed list
- docs/exts/docs_build/fetch_inventories.py:103:9: PERF401 Use `list.extend` to create a transformed list
+ docs/exts/docs_build/fetch_inventories.py:103:9: PERF401 Use a list comprehension to create a transformed list
- providers/src/airflow/providers/microsoft/azure/hooks/wasb.py:721:13: PERF401 Use `list.extend` with an async comprehension to create a transformed list
+ providers/src/airflow/providers/microsoft/azure/hooks/wasb.py:721:13: PERF401 Use an async list comprehension to create a transformed list
- scripts/in_container/update_quarantined_test_status.py:81:13: PERF401 Use `list.extend` to create a transformed list
+ scripts/in_container/update_quarantined_test_status.py:81:13: PERF401 Use a list comprehension to create a transformed list

apache/superset (+8 -8 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --no-preview --select ALL

- superset/db_engine_specs/base.py:107:9: PERF401 Use `list.extend` to create a transformed list
+ superset/db_engine_specs/base.py:107:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:245:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:245:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:251:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:251:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:261:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:261:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:281:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:281:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:293:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:293:9: PERF401 Use a list comprehension to create a transformed list
+ superset/tasks/cache.py:208:13: PERF401 Use `list.extend` to create a transformed list
- superset/tasks/cache.py:208:13: PERF401 Use a list comprehension to create a transformed list
+ tests/integration_tests/security/migrate_roles_tests.py:50:13: PERF401 Use `list.extend` to create a transformed list
- tests/integration_tests/security/migrate_roles_tests.py:50:13: PERF401 Use a list comprehension to create a transformed list

bokeh/bokeh (+1 -3 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --no-preview --select ALL

- src/bokeh/plotting/_figure.py:479:17: PERF401 Use `list.extend` to create a transformed list
- src/bokeh/plotting/_figure.py:485:17: PERF401 Use `list.extend` to create a transformed list
- src/bokeh/server/contexts.py:310:17: PERF401 Use `list.extend` to create a transformed list
+ src/bokeh/server/contexts.py:310:17: PERF401 Use a list comprehension to create a transformed list

latchbio/latch (+3 -3 violations, +0 -0 fixes)

- src/latch/ldata/_transfer/upload.py:163:25: PERF401 Use `list.extend` to create a transformed list
+ src/latch/ldata/_transfer/upload.py:163:25: PERF401 Use a list comprehension to create a transformed list
- src/latch/registry/utils.py:70:13: PERF401 Use `list.extend` to create a transformed list
+ src/latch/registry/utils.py:70:13: PERF401 Use a list comprehension to create a transformed list
- src/latch_cli/services/cp/utils.py:54:9: PERF401 Use `list.extend` to create a transformed list
+ src/latch_cli/services/cp/utils.py:54:9: PERF401 Use a list comprehension to create a transformed list

pandas-dev/pandas (+0 -0 violations, +0 -0 fixes)

Changes by rule (1 rules affected)

code	total	+ violation	- violation	+ fix	- fix
PERF401	38	18	20	0	0

Linter (preview)

ℹ️ ecosystem check detected linter changes. (+18 -20 violations, +0 -0 fixes in 4 projects; 51 projects unchanged)

apache/airflow (+6 -6 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --preview --select ALL

- dev/breeze/src/airflow_breeze/commands/release_management_commands.py:3074:9: PERF401 Use `list.extend` to create a transformed list
+ dev/breeze/src/airflow_breeze/commands/release_management_commands.py:3074:9: PERF401 Use a list comprehension to create a transformed list
- dev/breeze/src/airflow_breeze/utils/exclude_from_matrix.py:32:9: PERF401 Use `list.extend` to create a transformed list
+ dev/breeze/src/airflow_breeze/utils/exclude_from_matrix.py:32:9: PERF401 Use a list comprehension to create a transformed list
- dev/breeze/src/airflow_breeze/utils/packages.py:327:9: PERF401 Use `list.extend` to create a transformed list
+ dev/breeze/src/airflow_breeze/utils/packages.py:327:9: PERF401 Use a list comprehension to create a transformed list
- docs/exts/docs_build/fetch_inventories.py:103:9: PERF401 Use `list.extend` to create a transformed list
+ docs/exts/docs_build/fetch_inventories.py:103:9: PERF401 Use a list comprehension to create a transformed list
- providers/src/airflow/providers/microsoft/azure/hooks/wasb.py:721:13: PERF401 Use `list.extend` with an async comprehension to create a transformed list
+ providers/src/airflow/providers/microsoft/azure/hooks/wasb.py:721:13: PERF401 Use an async list comprehension to create a transformed list
- scripts/in_container/update_quarantined_test_status.py:81:13: PERF401 Use `list.extend` to create a transformed list
+ scripts/in_container/update_quarantined_test_status.py:81:13: PERF401 Use a list comprehension to create a transformed list

apache/superset (+8 -8 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --preview --select ALL

- superset/db_engine_specs/base.py:107:9: PERF401 Use `list.extend` to create a transformed list
+ superset/db_engine_specs/base.py:107:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:245:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:245:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:251:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:251:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:261:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:261:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:281:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:281:9: PERF401 Use a list comprehension to create a transformed list
+ superset/db_engine_specs/lib.py:293:9: PERF401 Use `list.extend` to create a transformed list
- superset/db_engine_specs/lib.py:293:9: PERF401 Use a list comprehension to create a transformed list
+ superset/tasks/cache.py:208:13: PERF401 Use `list.extend` to create a transformed list
- superset/tasks/cache.py:208:13: PERF401 Use a list comprehension to create a transformed list
+ tests/integration_tests/security/migrate_roles_tests.py:50:13: PERF401 Use `list.extend` to create a transformed list
- tests/integration_tests/security/migrate_roles_tests.py:50:13: PERF401 Use a list comprehension to create a transformed list

bokeh/bokeh (+1 -3 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --preview --select ALL

- src/bokeh/plotting/_figure.py:479:17: PERF401 Use `list.extend` to create a transformed list
- src/bokeh/plotting/_figure.py:485:17: PERF401 Use `list.extend` to create a transformed list
- src/bokeh/server/contexts.py:310:17: PERF401 Use `list.extend` to create a transformed list
+ src/bokeh/server/contexts.py:310:17: PERF401 Use a list comprehension to create a transformed list

latchbio/latch (+3 -3 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --preview

- src/latch/ldata/_transfer/upload.py:163:25: PERF401 Use `list.extend` to create a transformed list
+ src/latch/ldata/_transfer/upload.py:163:25: PERF401 Use a list comprehension to create a transformed list
- src/latch/registry/utils.py:70:13: PERF401 Use `list.extend` to create a transformed list
+ src/latch/registry/utils.py:70:13: PERF401 Use a list comprehension to create a transformed list
- src/latch_cli/services/cp/utils.py:54:9: PERF401 Use `list.extend` to create a transformed list
+ src/latch_cli/services/cp/utils.py:54:9: PERF401 Use a list comprehension to create a transformed list

Changes by rule (1 rules affected)

code	total	+ violation	- violation	+ fix	- fix
PERF401	38	18	20	0	0

Skylion007 · 2024-11-17T17:25:12Z

@w0nder1ng Another fun edge case that PR #14369 doesn't currently fix. Scoping rules are different between list comprehensions and for loops:

another example from pytorch

     if kwargs is None:
         kwargs = {}
-    impl_args = []
-    for a in args:
-        impl_args.append(_helper(a, map_fn))
+    impl_args = [_helper(a, map_fn) for a in args]
     impl_kwargs = {}
     for k in kwargs.keys():
         impl_kwargs[k] = _helper(a, map_fn)

the a on the line impl_kwargs[k] = _helper(a, map_fn) is now undefined (and will error with F821) since the list comprehension was converted from a loop. RUFF immeaditely flags this with an F821 error, and I test and confirm the temporary variable a does not leave the listcomp scope, while it does leave the for loop one.

@w0nder1ng If you want a good test bed, just run the autofix on PyTorch (or another large codebase) and see after applying the fixes if any other ruff rule violations are immediately detected.

Skylion007 · 2024-11-17T17:29:42Z

Also another really minor nit is that it can duplicate comments.

-        for dtype in ["f16", "bf16"]:
-            kernels.append(
-                cls(
-                    aligned=True,
-                    dtype=dtype,
-                    sm_range=(80, SM[SM.index(80) + 1]),
-                    apply_dropout=False,
-                    preload_mmas=True,
-                    block_i=128,
-                    block_j=64,
-                    max_k=96,
-                    # Sm80 has a faster kernel for this case
-                    dispatch_cond="cc == 86 || cc == 89",
-                )
+        # Sm80 has a faster kernel for this case
+        kernels.extend(
+            cls(
+                aligned=True,
+                dtype=dtype,
+                sm_range=(80, SM[SM.index(80) + 1]),
+                apply_dropout=False,
+                preload_mmas=True,
+                block_i=128,
+                block_j=64,
+                max_k=96,
+                # Sm80 has a faster kernel for this case
+                dispatch_cond="cc == 86 || cc == 89",
             )
+            for dtype in ["f16", "bf16"]
+        )

See here.

w0nder1ng · 2024-11-17T22:02:15Z

Another fun edge case that PR #14369 doesn't currently fix

I think in this case, the lint shouldn't apply at all. If applying the fix breaks the code, then someone manually doing the same thing also wouldn't work. It should probably check that all references to the loop variable are inside the for loop before reporting the lint.

I'll also try to fix the duplicate comments. I'm starting to see why this didn't have a fix before :)

w0nder1ng · 2024-11-18T02:22:05Z

The comment duplication happened when the comment was inside the append body, it should be fixed now.

w0nder1ng · 2024-11-18T02:27:56Z

crates/ruff_linter/src/rules/perflint/rules/manual_list_comprehension.rs

+    // ```
+    let for_loop_target = checker
+        .semantic()
+        .lookup_symbol(id.as_str())


For some reason, resolve_name returns None for the for-loop target. Also, references to it outside of the for-loop are not included in its list of references; e.g.

def f(): result = [] for val in range(5): result.append(val * 2) print(val) # this reference is not included in the list of references

I think the problem here is that we build the semantic model lazily as we traverse the AST for checking and it hasn't reached the point after the for loop yet

@charliermarsh any suggestions on how to best find all usages of target?

I think a working version would look something like this:

let for_loop_read = checker.semantic().resolve_load(for_loop_target_name_expr); let for_loop_binding = match for_loop_read { ReadResult::Resolved(id) => checker.semantic().binding(id), _ => unreachable!("for loop target must exist"), };

Unfortunately, resolve_load requires an &mut SemanticModel and I don't see a way to get one from the checker. Am I missing something here?

Sorry, I didn't look deeply, but often in those cases you need to make it a deferred rule, so it runs after the semantic model has been built (e.g., run it in deferred_for_loops).

MichaReiser · 2024-11-18T07:14:23Z

Could you take a look why https://github.com/apache/superset/blob/e528cb48c44543c14c1ac9a93528b147bcaecfde/scripts/benchmark_migration.py#L128 is no longer reported. I might have missed something obvious but it isn't clear to me why the diagnostic isn't reported anymore.

w0nder1ng · 2024-11-18T15:27:41Z

I suspect the regressions are because of the resolve_name currently being a lookup_symbol. In the example you gave, I think it's finding a different foreign_key binding on line 113 and concluding that the symbol is used outside of the for loop. Once the binding has the right scope, the only regressions should be ones where the lint shouldn't have applied.

Skylion007 · 2024-11-18T15:34:17Z

Could you take a look why https://github.com/apache/superset/blob/e528cb48c44543c14c1ac9a93528b147bcaecfde/scripts/benchmark_migration.py#L128 is no longer reported. I might have missed something obvious but it isn't clear to me why the diagnostic isn't reported anymore.

Because the lambda arg here https://github.com/apache/superset/blob/e528cb48c44543c14c1ac9a93528b147bcaecfde/scripts/benchmark_migration.py#L131C28-L131C33 is shadowing the variable "model" in the for loop. In this specific case, it's probably okay since it's in a lambda arg, but in general it shouldn't apply the fix there. I suppose only references rvalues are problematic outside of the forloop, not lvalues (if all they do is immediately get overwritten after all).

Skylion007 · 2024-11-18T15:49:09Z

Okay hopefully last nit:

     reduction_axes: List[int] = []
-    for i in range(input_rank):
-        if i != axis:
-            reduction_axes.append(i)
+    reduction_axes.extend(i for i in range(input_rank) if i != axis)

Doesn't seem to like to hoist if there are any type_hints on the original [] instantiation. Is this intentional?

w0nder1ng · 2024-11-18T15:52:47Z

I'll take a look

w0nder1ng · 2024-11-18T16:18:29Z

Annotated assigns have a different statement type than normal assigns, and I wasn't handling it. The fix should work on type-annotated lists now.

Skylion007 · 2024-11-18T21:20:53Z

@w0nder1ng Another fun edge case that PR #14369 doesn't currently fix. Scoping rules are different between list comprehensions and for loops:

another example from pytorch
     if kwargs is None:
         kwargs = {}
-    impl_args = []
-    for a in args:
-        impl_args.append(_helper(a, map_fn))
+    impl_args = [_helper(a, map_fn) for a in args]
     impl_kwargs = {}
     for k in kwargs.keys():
         impl_kwargs[k] = _helper(a, map_fn)
the a on the line impl_kwargs[k] = _helper(a, map_fn) is now undefined (and will error with F821) since the list comprehension was converted from a loop. RUFF immeaditely flags this with an F821 error, and I test and confirm the temporary variable a does not leave the listcomp scope, while it does leave the for loop one.

@w0nder1ng If you want a good test bed, just run the autofix on PyTorch (or another large codebase) and see after applying the fixes if any other ruff rule violations are immediately detected.

This one is still occurring sadly. Maybe because it references another loop lol?

Skylion007 · 2024-11-19T15:15:12Z

Despite this minor false positive, looks like all the other fixes worked well on the PyTorch codebase (for the torch/torchgen folders) pytorch/pytorch#140980. 👍

w0nder1ng · 2024-11-22T15:50:48Z

I'm still stuck on two things.

How can I determine whether the binding statement has another statement on the line?

result = []; test = [] # deleting this whole line is wrong, but only deleting the binding statement also is wrong
# since it would leave the comment if there were no other statement
for ...

How can I get the semantic model to find references after the for loop?

result = []
for i in range(10):
  result.append(i+1)
print(i) # "fixing" this for loop is invalid because something relies on the loop variable, but the semantic model doesn't see this reference yet

Skylion007 · 2024-11-25T15:26:26Z

@diceroll123 @MichaReiser Any idea how to tackle these edge cases?

MichaReiser · 2024-11-25T15:59:31Z

@Skylion007 no, not from the top of my head and I haven't had time yet to look into it more closely

diceroll123 · 2024-11-25T16:04:23Z

I did not look into it any further after my last comment on #14551.

This PR and my PR contain a bunch of overlapping changes (while addressing different bugs), and I don't want to step on any toes or spend more time on it if it's not going to be considered, so if one or the other is merged, or some kind of plan is made, we can go from there I suppose! No strong feelings on my part. 😄

Skylion007 · 2024-11-28T17:07:17Z

One other place this autofix breaks (but ruff does appropriately catch it!). Apply it to sympy/tensor/index_methods.py in sympy. Fixing this loop breaks: https://github.com/sympy/sympy/blob/ba68537e73dd40fb34fa9c5aaf46c5a6e1ef12c7/sympy/tensor/index_methods.py#L427 . I suspect because it's using unusual python syntax to iterate over an implicit tuple.

for d in dbase, dexp:

transforming that to a list comprehension likely breaks

I will say that even it's current form, this autofix is extremely useful, and the encountered edge cases are very rare in real code, and usually caught by other checks.

Skylion007 · 2024-12-03T14:13:50Z

This unaccepted rule would likely have to tackle many of the same problems. Did they solve this edge case? #11769

w0nder1ng · 2024-12-06T00:24:17Z

I think that's all of the edge cases, except for the one involving the loop variable. I'd be happy to try to convert the rule into a binding rule, but I think that might be outside of the scope of this PR.

w0nder1ng · 2024-12-06T17:14:56Z

Even after moving the lint into deferred_for_loops, it still doesn't properly find the loop variable:

def f():
    result = []
    for i in range(5):
    #   ^ running checker.semantic().resolve_name() on this ExprName returns None.
        result.append(i * 2)
        #             ^ when I run binding.statement() on this binding, 
        #               it returns the "i = 1" statement, even though that is clearly wrong
    i = 1
  # ^ lookup_symbol returns this i
    print(i)

Am I missing something here, or is this behavior incorrect? See the last commit for the code I used to get this result.

dhruvmanila · 2024-12-09T04:16:29Z

    for i in range(5):
    #   ^ running checker.semantic().resolve_name() on this ExprName returns None.

That might be because i is itself the definition. What I see in the semantic model is that an entry in the resolved_names is created only by the resolve_load function which would work on the ExprName with the ExprContext::Load context. But, here the i variable that is the for target would have the ExprContext::Store context (https://play.ruff.rs/44c3909e-4f46-4a46-bd6e-36ea78223322) which is why I think you're getting None there.

MichaReiser · 2024-12-09T09:17:15Z

crates/ruff_linter/src/rules/perflint/rules/manual_list_comprehension.rs

+            let binding_string = checker
+                .locator()
+                .slice(for_loop_target.statement(checker.semantic()).unwrap());
+            dbg!(binding_string);


Suggested change

dbg!(binding_string);

MichaReiser · 2024-12-09T09:54:21Z

^ lookup_symbol returns this i

This is because the semantic model now captures the state after running the entire program. It's complete, and running lookup_symbol gives you the most recent binding for i after evaluating the entire function scope. That means you now have to do some extra work to get the for-loop target binding:

    let last_target_binding = checker
        .semantic()
        .lookup_symbol(id)
        .expect("For loop to exist");

    let mut bindings = [last_target_binding].into_iter().chain(
        checker
            .semantic()
            .shadowed_bindings(checker.semantic().scope_id, last_target_binding)
            .filter_map(|shadowed| shadowed.same_scope().then_some(shadowed.shadowed_id())),
    );

    let target_binding = bindings
        .find_map(|binding_id| {
            let binding = checker.semantic().binding(binding_id);

            if dbg!(binding.statement(checker.semantic())).and_then(Stmt::as_for_stmt)
                == Some(for_stmt)
            {
                Some(binding)
            } else {
                None
            }
        })
        .expect("for target binding must exist");

    drop(bindings);

There might be a better way to detect if the binding belongs to the target that e.g. avoids picking up named expressions in the iterator part.

w0nder1ng · 2024-12-09T22:00:50Z

crates/ruff_linter/src/rules/perflint/rules/manual_list_comprehension.rs

+        })
+        .expect("for target binding must exist");
+    let target_binding = checker.semantic().binding(target_binding_id);
+    // TODO: should this be a HashMap?


I made this a vector, but if there's enough references, it might make more sense for it to be a HashMap instead. What do you think?

MichaReiser · 2024-12-10T09:03:48Z

It seems we're now skipping over the two result.append usages in https://github.com/bokeh/bokeh/blob/829b2a75c402d0d0abd7e37ff201fbdfd949d857/src/bokeh/plotting/_figure.py#L485

Is this intentional? Should we change the implementation not to provide a fix but still the usage?

Edit: I think this is intentional because kw is shadowed.

MichaReiser · 2024-12-10T09:14:20Z

crates/ruff_linter/src/rules/perflint/rules/manual_list_comprehension.rs

+    let shadowed_references: Vec<_> = checker
+        .semantic()
+        .shadowed_bindings(checker.semantic().scope_id, target_binding_id)
+        .flat_map(|shadowed| {
+            let shadowed_binding = checker.semantic().binding(shadowed.shadowed_id());
+            shadowed_binding.references()
+        })
+        .collect();
+
+    drop(bindings);
+
+    if target_binding
+        .references()
+        .filter(|r_id| !shadowed_references.contains(r_id))
+        .map(|reference| checker.semantic().reference(reference))
+        .any(|r| !for_stmt.range.contains_range(r.range()))
+    {
+        return;
+    }
+


I don't understand what this code is doing. Would you mind adding a comment explaining why looking at the shadowed_references is necessary?

w0nder1ng · 2024-12-10T19:11:04Z

It seems we're now skipping over the two result.append usages in https://github.com/bokeh/bokeh/blob/829b2a75c402d0d0abd7e37ff201fbdfd949d857/src/bokeh/plotting/_figure.py#L485

Is this intentional? Should we change the implementation not to provide a fix but still the usage?

Edit: I think this is intentional because kw is shadowed.

It would be shadowed if there weren't a return. A smaller example:

def f(switch):
    i = 1
#   ^ this is the `i` which actually gets returned
    if switch:
        items = [1, 2, 3, 4]
        result = []
        #   v if it weren't for the return, this `i` would be the binding
        for i in items:
            result.append(i + 1)
        # v unconditional return means that the binding is not actually relevant
        return result
    #       v fix thinks this `i` is the loop variable
    return [i]

MichaReiser

Uff, this rule is complicated. Thanks for pushing through it

I made it through the detection logic and pushed a few smaller simplifications. I'm now about to review the fix and noticed the parse call. I suggest to either not provide a fix in the multi assignment case (it's not that common) or to use the SimpleTokenizer to avoid using the full parser here. It might even simplify some of your code because it allows e.g. searching for the ;

MichaReiser · 2024-12-11T12:49:31Z

crates/ruff_linter/src/rules/perflint/rules/manual_list_comprehension.rs

+                let binding_text = locator
+                    .slice(locator.full_lines_range(binding_stmt_range))
+                    .trim_whitespace_start();
+                let tokens = parse(binding_text, Mode::Module).map_err(|err| {
+                    anyhow!(
+                        "Failed to tokenize binding statement: {err} {}",
+                        binding_text
+                    )
+                })?;
+                tokens
+                    .tokens()
+                    .iter()
+                    .any(|token| matches!(token.kind(), TokenKind::Semi))


I'm leaning towards not providing a fix in this case to avoid the extra complexity.

Either way, we should avoid parsing here. We can use the SimpleTokenizer if we need tokenization and can't just use the AST

w0nder1ng · 2024-12-11T21:28:55Z

I've removed the tokenization without changing the rest of the code. The code now iterates over the characters directly because I was having trouble with the SimpleTokenizer not properly tokenizing some lines.

Is it worth also removing the code to handle multiple statements to simplify the logic?

MichaReiser · 2024-12-12T08:05:28Z

The code now iterates over the characters directly because I was having trouble with the SimpleTokenizer not properly tokenizing some lines.

I pushed a change that uses the SimpleTokenizer and BackwardsTokenizer instead. That also revealed that we should remove any leading ;.

But I think we're good now and can land this change. Thank you for working on this very involved. rule

Skylion007 · 2024-12-15T17:49:37Z

Beautiful job landing this rule @w0nder1ng . Any plans on tackling the dict comprehension version since the logic is very similar?

w0nder1ng · 2024-12-15T18:08:48Z

Sure, when I have the opportunity

w0nder1ng added 3 commits November 15, 2024 11:28

fix invalid hoist

aafb689

tentative fix

fd71328

don't use a comprehension when the list is referenced

f30118e

charliermarsh added bug Something isn't working fixes Related to suggested fixes for violations labels Nov 16, 2024

w0nder1ng added 2 commits November 17, 2024 20:58

check for external references to the for loop variable

238d17a

stop duplicating comments inside the fixed append

46bd851

w0nder1ng commented Nov 18, 2024

View reviewed changes

w0nder1ng added 2 commits November 18, 2024 11:09

fix annotated returns, but lose type info

9a336fd

apply fix to type-annotated lists

63b0e1e

Skylion007 mentioned this pull request Nov 23, 2024

[perflint] - Fix manual-list-comprehension for async generators (PERF401) #14551

Open

w0nder1ng added 2 commits December 6, 2024 10:33

move manual list comprehension to deferred check

f6ccb39

add debugging code to show binding

6a98542

MichaReiser reviewed Dec 9, 2024

View reviewed changes

w0nder1ng added 4 commits December 9, 2024 14:39

find the right binding for the loop variable

e2eec2e

check that the correct for loop is picked

9d41fb7

don't stop fix if a reference to target is actually another binding

104b69e

allow shadowed bindings from any scope

e44fcaa

w0nder1ng commented Dec 9, 2024

View reviewed changes

MichaReiser reviewed Dec 10, 2024

View reviewed changes

w0nder1ng added 2 commits December 10, 2024 14:27

simplify check for loop variable usages

4696878

cargo fmt

74afdb0

MichaReiser reviewed Dec 11, 2024

View reviewed changes

MichaReiser and others added 4 commits December 11, 2024 13:54

Few simplifications

db8d0d6

Merge branch 'main' into perf401_invalid_hoist

77a0290

fix merge

c9b394c

replace tokenization with simple character iterator

9c55f1b

Use simple tokenizer

523808d

MichaReiser merged commit 2eac00c into astral-sh:main Dec 12, 2024
21 checks passed

BrewTestBot mentioned this pull request Dec 12, 2024

ruff 0.8.3 Homebrew/homebrew-core#200946

Merged

qarmin mentioned this pull request Dec 14, 2024

Checking file with rule PERF401 cause panic #14969

Closed

[perflint] fix invalid hoist in perf401 #14369

[perflint] fix invalid hoist in perf401 #14369

Conversation

w0nder1ng commented Nov 15, 2024 • edited Loading

github-actions bot commented Nov 15, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

Skylion007 commented Nov 17, 2024 • edited Loading

Skylion007 commented Nov 17, 2024 • edited Loading

w0nder1ng commented Nov 17, 2024

w0nder1ng commented Nov 18, 2024

Choose a reason for hiding this comment

MichaReiser Nov 18, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MichaReiser commented Nov 18, 2024

w0nder1ng commented Nov 18, 2024 • edited Loading

Skylion007 commented Nov 18, 2024

Skylion007 commented Nov 18, 2024

w0nder1ng commented Nov 18, 2024

w0nder1ng commented Nov 18, 2024 • edited Loading

Skylion007 commented Nov 18, 2024

Skylion007 commented Nov 19, 2024 • edited Loading

w0nder1ng commented Nov 22, 2024

Skylion007 commented Nov 25, 2024 • edited Loading

MichaReiser commented Nov 25, 2024

diceroll123 commented Nov 25, 2024 • edited Loading

Skylion007 commented Nov 28, 2024

Skylion007 commented Dec 3, 2024

w0nder1ng commented Dec 6, 2024

w0nder1ng commented Dec 6, 2024

dhruvmanila commented Dec 9, 2024

Choose a reason for hiding this comment

MichaReiser commented Dec 9, 2024 • edited Loading

^ lookup_symbol returns this i

Choose a reason for hiding this comment

MichaReiser commented Dec 10, 2024 • edited Loading

Choose a reason for hiding this comment

w0nder1ng commented Dec 10, 2024

MichaReiser left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

w0nder1ng commented Dec 11, 2024 • edited Loading

MichaReiser commented Dec 12, 2024

Skylion007 commented Dec 15, 2024

w0nder1ng commented Dec 15, 2024

[`perflint`] fix invalid hoist in `perf401` #14369

[`perflint`] fix invalid hoist in `perf401` #14369

w0nder1ng commented Nov 15, 2024 •

edited

Loading

github-actions bot commented Nov 15, 2024 •

edited

Loading

`ruff-ecosystem` results

Skylion007 commented Nov 17, 2024 •

edited

Loading

Skylion007 commented Nov 17, 2024 •

edited

Loading

MichaReiser Nov 18, 2024 •

edited

Loading

w0nder1ng commented Nov 18, 2024 •

edited

Loading

w0nder1ng commented Nov 18, 2024 •

edited

Loading

Skylion007 commented Nov 19, 2024 •

edited

Loading

Skylion007 commented Nov 25, 2024 •

edited

Loading

diceroll123 commented Nov 25, 2024 •

edited

Loading

MichaReiser commented Dec 9, 2024 •

edited

Loading

MichaReiser commented Dec 10, 2024 •

edited

Loading

w0nder1ng commented Dec 11, 2024 •

edited

Loading