Some introductions to the Abstract Syntax Tree (AST) in the new features of PHP7

不言
Release: 2023-04-03 10:22:02
Original
2141 people have browsed it

This article shares with you some introduction to the Abstract Syntax Tree (AST) in the new features of PHP7. The content is very good. Friends in need can refer to it. I hope it can help everyone.

This article analyzes the changes brought about by the new feature of PHP7, Abstract Syntax Tree (AST). Share it with everyone for your reference, as follows:

Most of the content here is based on the RFC document of AST: https://wiki.php.net/rfc/abstractsyntaxtree, which is excerpted from the source document for ease of understanding. section is introduced.

This article will not tell you what an abstract syntax tree is. This needs to be understood by yourself. This article only describes some changes that AST brings to PHP.

New execution process

An important change in the core of PHP7 is the addition of AST. In PHP5, the execution process from php scripts to opcodes is:

  1. Lexing: lexical scanning analysis, converting source files into token streams;

  2. Parsing: Syntax analysis, op arrays are generated at this stage.

In PHP7, op arrays are no longer directly generated during the syntax analysis phase, but AST is generated first, so there is one more step in the process:

  1. Lexing : Lexical scan analysis, convert the source file into a token stream;

  2. Parsing: Syntax analysis, generate an abstract syntax tree from the token stream;

  3. Compilation: Generate op arrays from abstract syntax trees.

Execution time and memory consumption

From the above steps, this is one more step than the previous process, so according to common sense, this will increase the program execution time and memory usage. But in fact, the memory usage has indeed increased, but the execution time has decreased.

The following results are obtained by testing three scripts: small (about 100 lines of code), medium (about 700 lines), and large (about 2800 lines). Test script: https://gist.github .com/nikic/289b0c7538b46c2220bc.

Execution time of compiling each file 100 times (note that the test results of the article are from 14 years, when PHP7 was still called PHP-NG):


php-ng php-ast diff
SMALL 0.180s 0.160s -12.5%
MEDIUM 1.492s 1.268s -17.7%
LARGE 6.703s 5.736s -16.9%

Memory peak in a single compilation:

##MEDIUM 507kB 643kB 26.8% LARGE 1084kB 1857kB 71.3% ##The test results of a single compilation may not represent actual use , the following are the results of a complete project test using PhpParser:

php-ng php-ast diff
SMALL 378kB 414kB 9.5 %

Tests show that after using AST, the overall execution time of the program is improved by about 10% to 15%, but the memory consumption also increases. The increase is obvious in a single compilation of large files, but not during the entire project execution process. Very serious problem.

Also note that the above results are all without Opcache. When Opcache is turned on in a production environment, the increase in memory consumption is not a big problem.

Semantic changes

If it is just a time optimization, it does not seem to be a sufficient reason to use AST. In fact, the implementation of AST is not based on time optimization considerations, but to solve syntax problems. Let’s take a look at some changes in semantics.

yield does not require parentheses

In the PHP5 implementation, ifyieldis used in an expression context (such as on the right side of an assignment expression), you You must use parentheses on both sides of theyielddeclaration:


        
Copy after login

This behavior is only due to the implementation limitations of PHP5. In PHP7, parentheses are no longer necessary. Therefore, the following writing methods are also legal:

 $v;
Copy after login

Of course, you must follow the application scenarios ofyield.

Brackets do not affect behavior

In PHP5,($foo)['bar'] = 'baz'and$foo['bar'] = 'baz'The two statements have different meanings. In fact, the former way of writing is illegal, and you will get the following error:


        
Copy after login

But in PHP7, the two ways of writing mean the same thing.

Similarly, if the parameters of the function are wrapped in parentheses, there is a problem with type checking. This problem has also been solved in PHP7:


        
Copy after login

The above code will not alarm in PHP5 unless # is used ##byRef(func()), but in PHP7, regardless of whether there are parentheses on both sides offunc(), the following error will occur:

$obj->__clone()to call the__clonemethod.__cloneis the only magic method that was previously prohibited from being called directly. Previously you would get an error like this:

php-ast diff TIME 25.5ms 22.8ms -11.8% MEMORY 2360kB 2482kB 5.1% PHP Strict standards: Only variables should be passed by reference ...Changes in list() The behavior of thelist keyword has changed a lot. The order in which list assigns values to variables (the order on the left and right of the equal sign at the same time) used to be from right to left, but now it is from left to right:

           
Copy after login
The reason for the above change is precisely because in the assignment process of PHP5, 3 would be filled into the array first, 1 last, but now the order has changed.The same changes are:

           
Copy after login
This is because in the previous assignment process, $b first got 2, and then the value of $a became 1, but now $a changes first It becomes 1 and is no longer an array, so $b becomes null.list is now accessed only once per offset:

           
Copy after login
Empty list members are now all prohibited, previously only under certain circumstances:

           
Copy after login
The order of reference assignmentThe order of reference assignment is from right to left in PHP5, and now from left to right:
a = &$obj->b; $obj->b = 1; var_dump($obj); // PHP5: object(stdClass)#1 (2) { ["b"] => &int(1) ["a"] => &int(1) } // PHP7: object(stdClass)#1 (2) { ["a"] => &int(1) ["b"] => &int(1) }
Copy after login
__clone method can be called directlyNow You can directly useFatal error: Cannot call __clone() method on objects - use 'clone $ obj' instead in...Variable syntax consistencyAST also solves some syntax consistency issues, which were raised in another RFC: https:// wiki.php.net/rfc/uniform_variable_syntax.In the new implementation, the meaning of some previous grammatical expressions is somewhat different from the current one. For details, please refer to the following table:
##php-ng
Expression PHP5 PHP7 ##$$foo['bar']['baz'] $foo->$bar['baz'] $foo->$bar['baz']() ##Foo::$bar['baz']() Foo::{$bar['baz']}() (Foo::$bar)['baz']() On the whole, the previous order is from right to left, now it is from left to right, and it also follows the principle that brackets do not affect behavior. These complex variable writing methods need to be paid attention to in actual development.
${$foo['bar']['baz']} ($$foo)['bar']['baz']
$foo->{$bar['baz']} ($foo->$bar)[' baz']
$foo->{$bar['baz']}( ) ($foo->$bar)['baz']()
Related recommendations:

Five ways to teach you how to turn off php error echo information


FastCGI protocol in php source code Analysis

The above is the detailed content of Some introductions to the Abstract Syntax Tree (AST) in the new features of PHP7. For more information, please follow other related articles on the PHP Chinese website!

Related labels:
source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template
About us Disclaimer Sitemap
php.cn:Public welfare online PHP training,Help PHP learners grow quickly!