Format-preserving AST transformations TODO #344

nikic · 2017-01-24T11:39:49Z

ao2 · 2017-04-28T07:54:31Z

Hi,

preserving original code formatting when performing AST transformations will be very useful for code refactoring.

However, even without it php-parser has still been useful to me as a validating tool for automatic refactoring done with other mechanisms which do preserve the formatting.

Here is an example using sed and regexes (which I don't trust myself with) for the actual refactoring and php-parser for validation: https://git.ao2.it/experiments/php-drupal-console-code-refactoring.git/tree

Feel free to take the php code as an example for php-parse, it implements a visitor which changes the second argument of a particular function to be null under certain conditions.

nikic · 2017-04-29T11:09:20Z

As a small update here: While there has been no movement on the above TODOs, all the necessary parser changes are now in 4.0, without the need to enable special options. As such, the boilerplate code to perform format-preserving transformations is now:

$lexer = new Lexer\Emulative([
    'usedAttributes' => [
        'comments',
        'startLine', 'endLine',
        'startTokenPos', 'endTokenPos',
    ],
]);
$parser = new Parser\Php7($lexer);

$traverser = new NodeTraverser();
$traverser->addVisitor(new NodeVisitor\CloningVisitor());

$printer = new PrettyPrinter\Standard();

$oldStmts = $parser->parse($code);
$oldTokens = $lexer->getTokens();

$newStmts = $traverser->traverse($oldStmts);

// MODIFY $newStmts HERE

$newCode = $printer->printFormatPreserving($newStmts, $oldStmts, $oldTokens);

gplanchat · 2017-05-30T23:53:29Z

Hello

A few months ago, I wrote an AST merger: https://github.com/kiboko-labs/akeneo-product-values-management/blob/master/src/Builder/ClassMerger.php

It is very specialized to the needs I had: appending properties and methods to existing classes. Maybe this solution could be some way to answer some of these needs?

TomasVotruba · 2017-07-16T19:58:17Z

Thanks for creating this issue. I wonder about another approach to this, let me share:

PrettyPrinter would put out code in non-namespace version. It is not nice, but it works well. What it needs to look nice? A couple of sniffs/fixers, with PSR-2, Symfony style or others.

If you already use coding standard, just use it after PHP-Parser run and code is both refactored and according your style needs. No need to integrate various coding standard rules to parser itself. Parser should parse and focus on printing, if there are maintained tools for that. Maybe there are other features of Parser that would be happy for attention and that cannot be handled by coding standards.

I'm working on this approach in EasyCodingStandard combined with Rector - CLI tool to refactor legacy code to modern and clean code. Rector will do refactoring + coding standard fixes in one command.

What do you think?

TomasVotruba · 2017-07-16T20:04:58Z

As a small update here: While there has been no movement on the above TODOs, all the necessary parser changes are now in 4.0, without the need to enable special options.

Thanks for the update and sharing specific code. Is there any related PR to check? I'm curious what have you changed.
Do you have any plans on 4.0 release? I'll use it for my stable version of package now to have this feature enabled.

nikic · 2017-07-19T15:32:38Z

I don't have concrete plans for a 4.0 release yet. I did not have time to work on this further and would like to at least resolve some of the TODO items (especially the first one is a large limitation).

TomasVotruba · 2017-07-20T17:16:20Z

I see. I'm adding a method to specific position like this.

vovafeldman · 2017-08-01T14:26:13Z

@nikic any ETA on releasing a stable version of format preservation? We really need it :)

jamesckemp · 2017-09-26T14:19:07Z

Yes please!

raducretu · 2017-09-29T15:11:29Z

+1

Juddling · 2017-11-26T13:13:39Z

Hi all, I'm trying to modify some existing code - just pushing an item onto the end of an array, but the formatting is coming out like so:

--- Expected
+++ Actual
@@ @@
 <?php

 return [
     'one',
     'two',
-    'three',
-    'four'
+    'three', 'four'

I cloned an existing node in the hope that it would copy the starting position.

Here is the relevant code:

    public function addToArray($code)
    {
        $visitor = new class extends NodeVisitorAbstract {
            public function leaveNode(Node $node) {
                if ($node instanceof Array_) {
                    $newItem = clone $node->items[0];
                    $node->items[] = $newItem;
                    return $node;
                }
            }
        };

        return $this->modify($code, $visitor);
    }

    public function modify($code, $visitor)
    {
        $oldStmts = $this->parser->parse($code);
        $oldTokens = $this->lexer->getTokens();
        $newStmts = $this->traverser->traverse($oldStmts);

        $this->traverser->addVisitor($visitor);

        $newStmts = $this->traverser->traverse($newStmts);
        return $this->printer->printFormatPreserving($newStmts, $oldStmts, $oldTokens);
    }

Could someone please advise me if this is a bug or not yet supported?

nikic · 2017-11-26T15:02:48Z

@Juddling It's half-supported. The formatting of the array is preserved, but the new element is added on the same line. There is currently no detection that the array is formatted in multi-line style and the new element should be added in multi-line style as well.

nikic · 2017-12-26T20:22:02Z

@Juddling I've just landed 1c7fd31, so this case should now work correctly (that is, use multi-line formatting).

nikic · 2018-01-27T18:21:25Z

I've released a beta version for PHP-Parser 4.0 (https://github.com/nikic/PHP-Parser/releases/tag/v4.0.0beta1) and plan to create a stable release soonishly. I'd say that at this point, this feature is working pretty well, and it just comes down to fixing formatting issues as they come up. If I'm going to wait until this functionality is "finished", I'm afraid we'll never see a release...

lisachenko · 2018-01-27T19:02:53Z

@nikic it is definitely good point not to delay release (you will be able to release next major if needed)

Anyway this feature is very important for many developers like me :) Current PHP-Parser implementation is cool, but if some code changes needed then developers should recombine AST with tokens in order to keep formatting like this. It's also impossible to generate custom code with required formatting from AST.

So will it be possible to work on this feature later? But this will require a lot of changes to switch from lexical parsing to raw nodes, because of skipping whitespace nodes :(

PS. Therefore in Java (IDEA engine) AST implementation provides an access to whitespace nodes. But you need to use special methods getNextSibling(), getPreviousSibling(), getChildren(), getParent()to traverse such AST manually and wrap it into meaningful nodes. For example, for PHP it will be like this: search for the namespace node then if you want classes in it you should scan all non-whitespace children nodes.

rainbow-alex · 2019-01-19T15:34:27Z

What about preserving redundant parentheses? When I parse:

<?php echo (3);

I get:

array(
    0: Stmt_Echo(
        exprs: array(
            0: Scalar_LNumber(
                value: 3
            )
        )
    )
)

Which makes it almost impossible to restore the parentheses without some very complicated walking.

eeliu · 2019-04-08T06:22:35Z

Hi, I may meet some "half-supported". Here is my code

namespace app;
//use PDO;

class Foo
{

}

I want add some footnodes into it

namespace app;
//use PDO;

class Foo
{

}
require_once __CACHE__."header.php"

While, if this file includes "use xxx;", everything is fine, the formatting is the expected.
If not, the file was reformatted.

namespace app;

//use PDO;                               <---------------------here -----------------
class Foo
{
}
require_once __CACHE__."header.php"

my code

......
public function leaveNode(Node $node){
......
         if ($node instanceof Node\Stmt\Namespace_)
        {
            $express  =   new Node\Stmt\Expression(
                new Node\Expr\Include_(
                    new Node\Scalar\String_('xxxx'),
                   Node\Expr\Include_::TYPE_REQUIRE
                )
            );
            $node->stmts[]= $express ;
            return $node;   
      }
......
}
......

$curNode =  $traverser->traverse($origStmts);
$newCode = $printer->printFormatPreserving(
      $curNode,
      $origStmts,
      $lexer->getTokens()
  );

nikic · 2019-05-12T08:42:52Z

@eeliu I can't reproduce this issue. The following test passes for me:

<?php
namespace app;

class
Foo
{}
-----
$stmts[0]->stmts[] = new Stmt\Echo_([new Scalar\String_('Test')]);
-----
<?php
namespace app;

class
Foo
{}
echo 'Test';

Adding a use also didn't make a difference to the behavior.

eeliu · 2019-05-13T07:28:04Z

@nikic Add some annotations in it.

namespace app;
//use PDO; 

class Foo

become

namespace app;

//use PDO;                               <---------------------here -----------------
class Foo

kirugan · 2019-07-25T10:25:56Z

Unfortunately for me this simple code:

<?php
function test() {
  global $x, $y;
  var_dump($x + $y);
}

after deletion of global stmt become this:

<?php
function test()
{
    var_dump($x + $y);
}

New code doesn't preserve tabulation and open bracket in functions.

nikic · 2023-05-21T15:57:16Z

Closing this tracking issue as done. If there are issues with formatting preservation, they should get reported as separate issues.

nikic added this to the 4.0 milestone Jan 24, 2017

nikic mentioned this issue Jan 24, 2017

Optionally add nodes for whitespace #41

Closed

loren-osborn mentioned this issue Mar 20, 2017

Seeking more flexibility is ways I can dynamically transform PHP code goaop/framework#317

Closed

ao2 mentioned this issue Apr 28, 2017

[console] Standardize the second argument of Command::addOption(). hechoendrupal/drupal-console#3286

Merged

TomasVotruba mentioned this issue Jul 16, 2017

[Symfony] New Rector: named services in Symfony => constructor injection rectorphp/rector#2

Merged

TomasVotruba mentioned this issue Jul 20, 2017

Format-preserving bug for multipes NodeVisitors #400

Closed

TomasVotruba mentioned this issue Aug 6, 2017

Add minimal starndard coding standard, so print is nice rectorphp/rector#1

Closed

TomasVotruba mentioned this issue Sep 21, 2017

MemoizingParser: Fix parent interface misstype of parse() method Roave/BetterReflection#371

Closed

nikic mentioned this issue Sep 26, 2017

Could the Standard pretty printer add a space after "use" keyword #418

Closed

nikic removed this from the 4.0 milestone Feb 28, 2018

bm-pzegardlo mentioned this issue Jun 28, 2018

Preserve whitespace/indentation during transformations Roave/FunctionFQNReplacer#2

Open

rask mentioned this issue Jun 24, 2019

Line-level @codeCoverageIgnore is not respected infection/infection#709

Open

vovafeldman mentioned this issue Dec 24, 2019

Code being reformatted poorly Freemius/wordpress-sdk#380

Open

TomasVotruba mentioned this issue Apr 22, 2020

Additional rectors are ran despite not being referenced in my config rectorphp/rector#3212

Closed

TomasVotruba mentioned this issue May 5, 2020

Multiple lines become single line rectorphp/rector#3316

Closed

TomasVotruba mentioned this issue Sep 30, 2020

[FPDP] Format preserving doc printer rectorphp/rector#4334

Closed

TomasVotruba mentioned this issue Nov 10, 2020

Symfony Route annotations are broken by Rector DEAD_CODE rectorphp/rector#4573

Closed

samsonasik mentioned this issue Jan 19, 2021

PHPStan Fixes codeigniter4/CodeIgniter4#4136

Merged

4 tasks

TomasVotruba mentioned this issue Feb 7, 2021

AnnotationToAttributeRector removes empty lines rectorphp/rector#5447

Closed

TomasVotruba mentioned this issue Jun 27, 2022

Rector makes whitespace changes even when no rules are run rectorphp/rector#7258

Closed

TomasVotruba mentioned this issue Nov 22, 2022

Updating a node comments removes empty lines between the comment block and the node rectorphp/rector#7617

Closed

samsonasik mentioned this issue Feb 20, 2023

Incorrect behavior of RemoveDeadStmtRector (and others?) with short echo tags rectorphp/rector#7789

Closed

nikic closed this as completed May 21, 2023

n-valverde mentioned this issue Jun 10, 2023

fix: api:upgrade-resource output formatting api-platform/core#5624

Merged

kkmuffme mentioned this issue Feb 11, 2024

Documentation: Clearly and visibly document that rector does not work reliably when PHP closing tags are used anywhere in the file rectorphp/rector#8479

Closed

mindplay-dk mentioned this issue Mar 11, 2024

README: "AST doesn't know about spaces" rectorphp/rector#8537

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Format-preserving AST transformations TODO #344

Format-preserving AST transformations TODO #344

nikic commented Jan 24, 2017 •

edited

Loading

ao2 commented Apr 28, 2017

nikic commented Apr 29, 2017

gplanchat commented May 30, 2017 •

edited

Loading

TomasVotruba commented Jul 16, 2017 •

edited

Loading

TomasVotruba commented Jul 16, 2017

nikic commented Jul 19, 2017

TomasVotruba commented Jul 20, 2017

vovafeldman commented Aug 1, 2017

jamesckemp commented Sep 26, 2017

raducretu commented Sep 29, 2017

Juddling commented Nov 26, 2017 •

edited

Loading

nikic commented Nov 26, 2017

nikic commented Dec 26, 2017

nikic commented Jan 27, 2018

lisachenko commented Jan 27, 2018

rainbow-alex commented Jan 19, 2019

eeliu commented Apr 8, 2019

nikic commented May 12, 2019

eeliu commented May 13, 2019

kirugan commented Jul 25, 2019

nikic commented May 21, 2023

Format-preserving AST transformations TODO #344

Format-preserving AST transformations TODO #344

Comments

nikic commented Jan 24, 2017 • edited Loading

ao2 commented Apr 28, 2017

nikic commented Apr 29, 2017

gplanchat commented May 30, 2017 • edited Loading

TomasVotruba commented Jul 16, 2017 • edited Loading

TomasVotruba commented Jul 16, 2017

nikic commented Jul 19, 2017

TomasVotruba commented Jul 20, 2017

vovafeldman commented Aug 1, 2017

jamesckemp commented Sep 26, 2017

raducretu commented Sep 29, 2017

Juddling commented Nov 26, 2017 • edited Loading

nikic commented Nov 26, 2017

nikic commented Dec 26, 2017

nikic commented Jan 27, 2018

lisachenko commented Jan 27, 2018

rainbow-alex commented Jan 19, 2019

eeliu commented Apr 8, 2019

nikic commented May 12, 2019

eeliu commented May 13, 2019

kirugan commented Jul 25, 2019

nikic commented May 21, 2023

nikic commented Jan 24, 2017 •

edited

Loading

gplanchat commented May 30, 2017 •

edited

Loading

TomasVotruba commented Jul 16, 2017 •

edited

Loading

Juddling commented Nov 26, 2017 •

edited

Loading