Spaces:
Running
Running
| =encoding utf8 | |
| =head1 NAME | |
| perl5160delta - what is new for perl v5.16.0 | |
| =head1 DESCRIPTION | |
| This document describes differences between the 5.14.0 release and | |
| the 5.16.0 release. | |
| If you are upgrading from an earlier release such as 5.12.0, first read | |
| L<perl5140delta>, which describes differences between 5.12.0 and | |
| 5.14.0. | |
| Some bug fixes in this release have been backported to later | |
| releases of 5.14.x. Those are indicated with the 5.14.x version in | |
| parentheses. | |
| =head1 Notice | |
| With the release of Perl 5.16.0, the 5.12.x series of releases is now out of | |
| its support period. There may be future 5.12.x releases, but only in the | |
| event of a critical security issue. Users of Perl 5.12 or earlier should | |
| consider upgrading to a more recent release of Perl. | |
| This policy is described in greater detail in | |
| L<perlpolicy|perlpolicy/MAINTENANCE AND SUPPORT>. | |
| =head1 Core Enhancements | |
| =head2 C<use I<VERSION>> | |
| As of this release, version declarations like C<use v5.16> now disable | |
| all features before enabling the new feature bundle. This means that | |
| the following holds true: | |
| use 5.016; | |
| # only 5.16 features enabled here | |
| use 5.014; | |
| # only 5.14 features enabled here (not 5.16) | |
| C<use v5.12> and higher continue to enable strict, but explicit C<use | |
| strict> and C<no strict> now override the version declaration, even | |
| when they come first: | |
| no strict; | |
| use 5.012; | |
| # no strict here | |
| There is a new ":default" feature bundle that represents the set of | |
| features enabled before any version declaration or C<use feature> has | |
| been seen. Version declarations below 5.10 now enable the ":default" | |
| feature set. This does not actually change the behavior of C<use | |
| v5.8>, because features added to the ":default" set are those that were | |
| traditionally enabled by default, before they could be turned off. | |
| C<< no feature >> now resets to the default feature set. To disable all | |
| features (which is likely to be a pretty special-purpose request, since | |
| it presumably won't match any named set of semantics) you can now | |
| write C<< no feature ':all' >>. | |
| C<$[> is now disabled under C<use v5.16>. It is part of the default | |
| feature set and can be turned on or off explicitly with C<use feature | |
| 'array_base'>. | |
| =head2 C<__SUB__> | |
| The new C<__SUB__> token, available under the C<current_sub> feature | |
| (see L<feature>) or C<use v5.16>, returns a reference to the current | |
| subroutine, making it easier to write recursive closures. | |
| =head2 New and Improved Built-ins | |
| =head3 More consistent C<eval> | |
| The C<eval> operator sometimes treats a string argument as a sequence of | |
| characters and sometimes as a sequence of bytes, depending on the | |
| internal encoding. The internal encoding is not supposed to make any | |
| difference, but there is code that relies on this inconsistency. | |
| The new C<unicode_eval> and C<evalbytes> features (enabled under C<use | |
| 5.16.0>) resolve this. The C<unicode_eval> feature causes C<eval | |
| $string> to treat the string always as Unicode. The C<evalbytes> | |
| features provides a function, itself called C<evalbytes>, which | |
| evaluates its argument always as a string of bytes. | |
| These features also fix oddities with source filters leaking to outer | |
| dynamic scopes. | |
| See L<feature> for more detail. | |
| =head3 C<substr> lvalue revamp | |
| =for comment Does this belong here, or under Incompatible Changes? | |
| When C<substr> is called in lvalue or potential lvalue context with two | |
| or three arguments, a special lvalue scalar is returned that modifies | |
| the original string (the first argument) when assigned to. | |
| Previously, the offsets (the second and third arguments) passed to | |
| C<substr> would be converted immediately to match the string, negative | |
| offsets being translated to positive and offsets beyond the end of the | |
| string being truncated. | |
| Now, the offsets are recorded without modification in the special | |
| lvalue scalar that is returned, and the original string is not even | |
| looked at by C<substr> itself, but only when the returned lvalue is | |
| read or modified. | |
| These changes result in an incompatible change: | |
| If the original string changes length after the call to C<substr> but | |
| before assignment to its return value, negative offsets will remember | |
| their position from the end of the string, affecting code like this: | |
| my $string = "string"; | |
| my $lvalue = \substr $string, -4, 2; | |
| print $$lvalue, "\n"; # prints "ri" | |
| $string = "bailing twine"; | |
| print $$lvalue, "\n"; # prints "wi"; used to print "il" | |
| The same thing happens with an omitted third argument. The returned | |
| lvalue will always extend to the end of the string, even if the string | |
| becomes longer. | |
| Since this change also allowed many bugs to be fixed (see | |
| L</The C<substr> operator>), and since the behavior | |
| of negative offsets has never been specified, the | |
| change was deemed acceptable. | |
| =head3 Return value of C<tied> | |
| The value returned by C<tied> on a tied variable is now the actual | |
| scalar that holds the object to which the variable is tied. This | |
| lets ties be weakened with C<Scalar::Util::weaken(tied | |
| $tied_variable)>. | |
| =head2 Unicode Support | |
| =head3 Supports (I<almost>) Unicode 6.1 | |
| Besides the addition of whole new scripts, and new characters in | |
| existing scripts, this new version of Unicode, as always, makes some | |
| changes to existing characters. One change that may trip up some | |
| applications is that the General Category of two characters in the | |
| Latin-1 range, PILCROW SIGN and SECTION SIGN, has been changed from | |
| Other_Symbol to Other_Punctuation. The same change has been made for | |
| a character in each of Tibetan, Ethiopic, and Aegean. | |
| The code points U+3248..U+324F (CIRCLED NUMBER TEN ON BLACK SQUARE | |
| through CIRCLED NUMBER EIGHTY ON BLACK SQUARE) have had their General | |
| Category changed from Other_Symbol to Other_Numeric. The Line Break | |
| property has changes for Hebrew and Japanese; and because of | |
| other changes in 6.1, the Perl regular expression construct C<\X> now | |
| works differently for some characters in Thai and Lao. | |
| New aliases (synonyms) have been defined for many property values; | |
| these, along with the previously existing ones, are all cross-indexed in | |
| L<perluniprops>. | |
| The return value of C<charnames::viacode()> is affected by other | |
| changes: | |
| Code point Old Name New Name | |
| U+000A LINE FEED (LF) LINE FEED | |
| U+000C FORM FEED (FF) FORM FEED | |
| U+000D CARRIAGE RETURN (CR) CARRIAGE RETURN | |
| U+0085 NEXT LINE (NEL) NEXT LINE | |
| U+008E SINGLE-SHIFT 2 SINGLE-SHIFT-2 | |
| U+008F SINGLE-SHIFT 3 SINGLE-SHIFT-3 | |
| U+0091 PRIVATE USE 1 PRIVATE USE-1 | |
| U+0092 PRIVATE USE 2 PRIVATE USE-2 | |
| U+2118 SCRIPT CAPITAL P WEIERSTRASS ELLIPTIC FUNCTION | |
| Perl will accept any of these names as input, but | |
| C<charnames::viacode()> now returns the new name of each pair. The | |
| change for U+2118 is considered by Unicode to be a correction, that is | |
| the original name was a mistake (but again, it will remain forever valid | |
| to use it to refer to U+2118). But most of these changes are the | |
| fallout of the mistake Unicode 6.0 made in naming a character used in | |
| Japanese cell phones to be "BELL", which conflicts with the longstanding | |
| industry use of (and Unicode's recommendation to use) that name | |
| to mean the ASCII control character at U+0007. Therefore, that name | |
| has been deprecated in Perl since v5.14, and any use of it will raise a | |
| warning message (unless turned off). The name "ALERT" is now the | |
| preferred name for this code point, with "BEL" an acceptable short | |
| form. The name for the new cell phone character, at code point U+1F514, | |
| remains undefined in this version of Perl (hence we don't | |
| implement quite all of Unicode 6.1), but starting in v5.18, BELL will mean | |
| this character, and not U+0007. | |
| Unicode has taken steps to make sure that this sort of mistake does not | |
| happen again. The Standard now includes all generally accepted | |
| names and abbreviations for control characters, whereas previously it | |
| didn't (though there were recommended names for most of them, which Perl | |
| used). This means that most of those recommended names are now | |
| officially in the Standard. Unicode did not recommend names for the | |
| four code points listed above between U+008E and U+008F, and in | |
| standardizing them Unicode subtly changed the names that Perl had | |
| previously given them, by replacing the final blank in each name by a | |
| hyphen. Unicode also officially accepts names that Perl had deprecated, | |
| such as FILE SEPARATOR. Now the only deprecated name is BELL. | |
| Finally, Perl now uses the new official names instead of the old | |
| (now considered obsolete) names for the first four code points in the | |
| list above (the ones which have the parentheses in them). | |
| Now that the names have been placed in the Unicode standard, these kinds | |
| of changes should not happen again, though corrections, such as to | |
| U+2118, are still possible. | |
| Unicode also added some name abbreviations, which Perl now accepts: | |
| SP for SPACE; | |
| TAB for CHARACTER TABULATION; | |
| NEW LINE, END OF LINE, NL, and EOL for LINE FEED; | |
| LOCKING-SHIFT ONE for SHIFT OUT; | |
| LOCKING-SHIFT ZERO for SHIFT IN; | |
| and ZWNBSP for ZERO WIDTH NO-BREAK SPACE. | |
| More details on this version of Unicode are provided in | |
| L<http://www.unicode.org/versions/Unicode6.1.0/>. | |
| =head3 C<use charnames> is no longer needed for C<\N{I<name>}> | |
| When C<\N{I<name>}> is encountered, the C<charnames> module is now | |
| automatically loaded when needed as if the C<:full> and C<:short> | |
| options had been specified. See L<charnames> for more information. | |
| =head3 C<\N{...}> can now have Unicode loose name matching | |
| This is described in the C<charnames> item in | |
| L</Updated Modules and Pragmata> below. | |
| =head3 Unicode Symbol Names | |
| Perl now has proper support for Unicode in symbol names. It used to be | |
| that C<*{$foo}> would ignore the internal UTF8 flag and use the bytes of | |
| the underlying representation to look up the symbol. That meant that | |
| C<*{"\x{100}"}> and C<*{"\xc4\x80"}> would return the same thing. All | |
| these parts of Perl have been fixed to account for Unicode: | |
| =over | |
| =item * | |
| Method names (including those passed to C<use overload>) | |
| =item * | |
| Typeglob names (including names of variables, subroutines, and filehandles) | |
| =item * | |
| Package names | |
| =item * | |
| C<goto> | |
| =item * | |
| Symbolic dereferencing | |
| =item * | |
| Second argument to C<bless()> and C<tie()> | |
| =item * | |
| Return value of C<ref()> | |
| =item * | |
| Subroutine prototypes | |
| =item * | |
| Attributes | |
| =item * | |
| Various warnings and error messages that mention variable names or values, | |
| methods, etc. | |
| =back | |
| In addition, a parsing bug has been fixed that prevented C<*{é}> from | |
| implicitly quoting the name, but instead interpreted it as C<*{+é}>, which | |
| would cause a strict violation. | |
| C<*{"*a::b"}> automatically strips off the * if it is followed by an ASCII | |
| letter. That has been extended to all Unicode identifier characters. | |
| One-character non-ASCII non-punctuation variables (like C<$é>) are now | |
| subject to "Used only once" warnings. They used to be exempt, as they | |
| were treated as punctuation variables. | |
| Also, single-character Unicode punctuation variables (like C<$‰>) are now | |
| supported [perl #69032]. | |
| =head3 Improved ability to mix locales and Unicode, including UTF-8 locales | |
| An optional parameter has been added to C<use locale> | |
| use locale ':not_characters'; | |
| which tells Perl to use all but the C<LC_CTYPE> and C<LC_COLLATE> | |
| portions of the current locale. Instead, the character set is assumed | |
| to be Unicode. This lets locales and Unicode be seamlessly mixed, | |
| including the increasingly frequent UTF-8 locales. When using this | |
| hybrid form of locales, the C<:locale> layer to the L<open> pragma can | |
| be used to interface with the file system, and there are CPAN modules | |
| available for ARGV and environment variable conversions. | |
| Full details are in L<perllocale>. | |
| =head3 New function C<fc> and corresponding escape sequence C<\F> for Unicode foldcase | |
| Unicode foldcase is an extension to lowercase that gives better results | |
| when comparing two strings case-insensitively. It has long been used | |
| internally in regular expression C</i> matching. Now it is available | |
| explicitly through the new C<fc> function call (enabled by | |
| S<C<"use feature 'fc'">>, or C<use v5.16>, or explicitly callable via | |
| C<CORE::fc>) or through the new C<\F> sequence in double-quotish | |
| strings. | |
| Full details are in L<perlfunc/fc>. | |
| =head3 The Unicode C<Script_Extensions> property is now supported. | |
| New in Unicode 6.0, this is an improved C<Script> property. Details | |
| are in L<perlunicode/Scripts>. | |
| =head2 XS Changes | |
| =head3 Improved typemaps for Some Builtin Types | |
| Most XS authors will know there is a longstanding bug in the | |
| OUTPUT typemap for T_AVREF (C<AV*>), T_HVREF (C<HV*>), T_CVREF (C<CV*>), | |
| and T_SVREF (C<SVREF> or C<\$foo>) that requires manually decrementing | |
| the reference count of the return value instead of the typemap taking | |
| care of this. For backwards-compatibility, this cannot be changed in the | |
| default typemaps. But we now provide additional typemaps | |
| C<T_AVREF_REFCOUNT_FIXED>, etc. that do not exhibit this bug. Using | |
| them in your extension is as simple as having one line in your | |
| C<TYPEMAP> section: | |
| HV* T_HVREF_REFCOUNT_FIXED | |
| =head3 C<is_utf8_char()> | |
| The XS-callable function C<is_utf8_char()>, when presented with | |
| malformed UTF-8 input, can read up to 12 bytes beyond the end of the | |
| string. This cannot be fixed without changing its API, and so its | |
| use is now deprecated. Use C<is_utf8_char_buf()> (described just below) | |
| instead. | |
| =head3 Added C<is_utf8_char_buf()> | |
| This function is designed to replace the deprecated L</is_utf8_char()> | |
| function. It includes an extra parameter to make sure it doesn't read | |
| past the end of the input buffer. | |
| =head3 Other C<is_utf8_foo()> functions, as well as C<utf8_to_foo()>, etc. | |
| Most other XS-callable functions that take UTF-8 encoded input | |
| implicitly assume that the UTF-8 is valid (not malformed) with respect to | |
| buffer length. Do not do things such as change a character's case or | |
| see if it is alphanumeric without first being sure that it is valid | |
| UTF-8. This can be safely done for a whole string by using one of the | |
| functions C<is_utf8_string()>, C<is_utf8_string_loc()>, and | |
| C<is_utf8_string_loclen()>. | |
| =head3 New Pad API | |
| Many new functions have been added to the API for manipulating lexical | |
| pads. See L<perlapi/Pad Data Structures> for more information. | |
| =head2 Changes to Special Variables | |
| =head3 C<$$> can be assigned to | |
| C<$$> was made read-only in Perl 5.8.0. But only sometimes: C<local $$> | |
| would make it writable again. Some CPAN modules were using C<local $$> or | |
| XS code to bypass the read-only check, so there is no reason to keep C<$$> | |
| read-only. (This change also allowed a bug to be fixed while maintaining | |
| backward compatibility.) | |
| =head3 C<$^X> converted to an absolute path on FreeBSD, OS X and Solaris | |
| C<$^X> is now converted to an absolute path on OS X, FreeBSD (without | |
| needing F</proc> mounted) and Solaris 10 and 11. This augments the | |
| previous approach of using F</proc> on Linux, FreeBSD, and NetBSD | |
| (in all cases, where mounted). | |
| This makes relocatable perl installations more useful on these platforms. | |
| (See "Relocatable @INC" in F<INSTALL>) | |
| =head2 Debugger Changes | |
| =head3 Features inside the debugger | |
| The current Perl's L<feature> bundle is now enabled for commands entered | |
| in the interactive debugger. | |
| =head3 New option for the debugger's B<t> command | |
| The B<t> command in the debugger, which toggles tracing mode, now | |
| accepts a numeric argument that determines how many levels of subroutine | |
| calls to trace. | |
| =head3 C<enable> and C<disable> | |
| The debugger now has C<disable> and C<enable> commands for disabling | |
| existing breakpoints and re-enabling them. See L<perldebug>. | |
| =head3 Breakpoints with file names | |
| The debugger's "b" command for setting breakpoints now lets a line | |
| number be prefixed with a file name. See | |
| L<perldebug/"b [file]:[line] [condition]">. | |
| =head2 The C<CORE> Namespace | |
| =head3 The C<CORE::> prefix | |
| The C<CORE::> prefix can now be used on keywords enabled by | |
| L<feature.pm|feature>, even outside the scope of C<use feature>. | |
| =head3 Subroutines in the C<CORE> namespace | |
| Many Perl keywords are now available as subroutines in the CORE namespace. | |
| This lets them be aliased: | |
| BEGIN { *entangle = \&CORE::tie } | |
| entangle $variable, $package, @args; | |
| And for prototypes to be bypassed: | |
| sub mytie(\[%$*@]$@) { | |
| my ($ref, $pack, @args) = @_; | |
| ... do something ... | |
| goto &CORE::tie; | |
| } | |
| Some of these cannot be called through references or via C<&foo> syntax, | |
| but must be called as barewords. | |
| See L<CORE> for details. | |
| =head2 Other Changes | |
| =head3 Anonymous handles | |
| Automatically generated file handles are now named __ANONIO__ when the | |
| variable name cannot be determined, rather than $__ANONIO__. | |
| =head3 Autoloaded sort Subroutines | |
| Custom sort subroutines can now be autoloaded [perl #30661]: | |
| sub AUTOLOAD { ... } | |
| @sorted = sort foo @list; # uses AUTOLOAD | |
| =head3 C<continue> no longer requires the "switch" feature | |
| The C<continue> keyword has two meanings. It can introduce a C<continue> | |
| block after a loop, or it can exit the current C<when> block. Up to now, | |
| the latter meaning was valid only with the "switch" feature enabled, and | |
| was a syntax error otherwise. Since the main purpose of feature.pm is to | |
| avoid conflicts with user-defined subroutines, there is no reason for | |
| C<continue> to depend on it. | |
| =head3 DTrace probes for interpreter phase change | |
| The C<phase-change> probes will fire when the interpreter's phase | |
| changes, which tracks the C<${^GLOBAL_PHASE}> variable. C<arg0> is | |
| the new phase name; C<arg1> is the old one. This is useful | |
| for limiting your instrumentation to one or more of: compile time, | |
| run time, or destruct time. | |
| =head3 C<__FILE__()> Syntax | |
| The C<__FILE__>, C<__LINE__> and C<__PACKAGE__> tokens can now be written | |
| with an empty pair of parentheses after them. This makes them parse the | |
| same way as C<time>, C<fork> and other built-in functions. | |
| =head3 The C<\$> prototype accepts any scalar lvalue | |
| The C<\$> and C<\[$]> subroutine prototypes now accept any scalar lvalue | |
| argument. Previously they accepted only scalars beginning with C<$> and | |
| hash and array elements. This change makes them consistent with the way | |
| the built-in C<read> and C<recv> functions (among others) parse their | |
| arguments. This means that one can override the built-in functions with | |
| custom subroutines that parse their arguments the same way. | |
| =head3 C<_> in subroutine prototypes | |
| The C<_> character in subroutine prototypes is now allowed before C<@> or | |
| C<%>. | |
| =head1 Security | |
| =head2 Use C<is_utf8_char_buf()> and not C<is_utf8_char()> | |
| The latter function is now deprecated because its API is insufficient to | |
| guarantee that it doesn't read (up to 12 bytes in the worst case) beyond | |
| the end of its input string. See | |
| L<is_utf8_char_buf()|/Added is_utf8_char_buf()>. | |
| =head2 Malformed UTF-8 input could cause attempts to read beyond the end of the buffer | |
| Two new XS-accessible functions, C<utf8_to_uvchr_buf()> and | |
| C<utf8_to_uvuni_buf()> are now available to prevent this, and the Perl | |
| core has been converted to use them. | |
| See L</Internal Changes>. | |
| =head2 C<File::Glob::bsd_glob()> memory error with GLOB_ALTDIRFUNC (CVE-2011-2728). | |
| Calling C<File::Glob::bsd_glob> with the unsupported flag | |
| GLOB_ALTDIRFUNC would cause an access violation / segfault. A Perl | |
| program that accepts a flags value from an external source could expose | |
| itself to denial of service or arbitrary code execution attacks. There | |
| are no known exploits in the wild. The problem has been corrected by | |
| explicitly disabling all unsupported flags and setting unused function | |
| pointers to null. Bug reported by Clément Lecigne. (5.14.2) | |
| =head2 Privileges are now set correctly when assigning to C<$(> | |
| A hypothetical bug (probably unexploitable in practice) because the | |
| incorrect setting of the effective group ID while setting C<$(> has been | |
| fixed. The bug would have affected only systems that have C<setresgid()> | |
| but not C<setregid()>, but no such systems are known to exist. | |
| =head1 Deprecations | |
| =head2 Don't read the Unicode data base files in F<lib/unicore> | |
| It is now deprecated to directly read the Unicode data base files. | |
| These are stored in the F<lib/unicore> directory. Instead, you should | |
| use the new functions in L<Unicode::UCD>. These provide a stable API, | |
| and give complete information. | |
| Perl may at some point in the future change or remove these files. The | |
| file which applications were most likely to have used is | |
| F<lib/unicore/ToDigit.pl>. L<Unicode::UCD/prop_invmap()> can be used to | |
| get at its data instead. | |
| =head2 XS functions C<is_utf8_char()>, C<utf8_to_uvchr()> and | |
| C<utf8_to_uvuni()> | |
| This function is deprecated because it could read beyond the end of the | |
| input string. Use the new L<is_utf8_char_buf()|/Added is_utf8_char_buf()>, | |
| C<utf8_to_uvchr_buf()> and C<utf8_to_uvuni_buf()> instead. | |
| =head1 Future Deprecations | |
| This section serves as a notice of features that are I<likely> to be | |
| removed or L<deprecated|perlpolicy/deprecated> in the next release of | |
| perl (5.18.0). If your code depends on these features, you should | |
| contact the Perl 5 Porters via the L<mailing | |
| list|http://lists.perl.org/list/perl5-porters.html> or L<perlbug> to | |
| explain your use case and inform the deprecation process. | |
| =head2 Core Modules | |
| These modules may be marked as deprecated I<from the core>. This only | |
| means that they will no longer be installed by default with the core | |
| distribution, but will remain available on the CPAN. | |
| =over | |
| =item * | |
| CPANPLUS | |
| =item * | |
| Filter::Simple | |
| =item * | |
| PerlIO::mmap | |
| =item * | |
| Pod::LaTeX | |
| =item * | |
| Pod::Parser | |
| =item * | |
| SelfLoader | |
| =item * | |
| Text::Soundex | |
| =item * | |
| Thread.pm | |
| =back | |
| =head2 Platforms with no supporting programmers | |
| These platforms will probably have their | |
| special build support removed during the | |
| 5.17.0 development series. | |
| =over | |
| =item * | |
| BeOS | |
| =item * | |
| djgpp | |
| =item * | |
| dgux | |
| =item * | |
| EPOC | |
| =item * | |
| MPE/iX | |
| =item * | |
| Rhapsody | |
| =item * | |
| UTS | |
| =item * | |
| VM/ESA | |
| =back | |
| =head2 Other Future Deprecations | |
| =over | |
| =item * | |
| Swapping of $< and $> | |
| For more information about this future deprecation, see L<the relevant RT | |
| ticket|https://github.com/Perl/perl5/issues/11547>. | |
| =item * | |
| sfio, stdio | |
| Perl supports being built without PerlIO proper, using a stdio or sfio | |
| wrapper instead. A perl build like this will not support IO layers and | |
| thus Unicode IO, making it rather handicapped. | |
| PerlIO supports a C<stdio> layer if stdio use is desired, and similarly a | |
| sfio layer could be produced. | |
| =item * | |
| Unescaped literal C<< "{" >> in regular expressions. | |
| Starting with v5.20, it is planned to require a literal C<"{"> to be | |
| escaped, for example by preceding it with a backslash. In v5.18, a | |
| deprecated warning message will be emitted for all such uses. | |
| This affects only patterns that are to match a literal C<"{">. Other | |
| uses of this character, such as part of a quantifier or sequence as in | |
| those below, are completely unaffected: | |
| /foo{3,5}/ | |
| /\p{Alphabetic}/ | |
| /\N{DIGIT ZERO} | |
| Removing this will permit extensions to Perl's pattern syntax and better | |
| error checking for existing syntax. See L<perlre/Quantifiers> for an | |
| example. | |
| =item * | |
| Revamping C<< "\Q" >> semantics in double-quotish strings when combined with other escapes. | |
| There are several bugs and inconsistencies involving combinations | |
| of C<\Q> and escapes like C<\x>, C<\L>, etc., within a C<\Q...\E> pair. | |
| These need to be fixed, and doing so will necessarily change current | |
| behavior. The changes have not yet been settled. | |
| =back | |
| =head1 Incompatible Changes | |
| =head2 Special blocks called in void context | |
| Special blocks (C<BEGIN>, C<CHECK>, C<INIT>, C<UNITCHECK>, C<END>) are now | |
| called in void context. This avoids wasteful copying of the result of the | |
| last statement [perl #108794]. | |
| =head2 The C<overloading> pragma and regexp objects | |
| With C<no overloading>, regular expression objects returned by C<qr//> are | |
| now stringified as "Regexp=REGEXP(0xbe600d)" instead of the regular | |
| expression itself [perl #108780]. | |
| =head2 Two XS typemap Entries removed | |
| Two presumably unused XS typemap entries have been removed from the | |
| core typemap: T_DATAUNIT and T_CALLBACK. If you are, against all odds, | |
| a user of these, please see the instructions on how to restore them | |
| in L<perlxstypemap>. | |
| =head2 Unicode 6.1 has incompatibilities with Unicode 6.0 | |
| These are detailed in L</Supports (almost) Unicode 6.1> above. | |
| You can compile this version of Perl to use Unicode 6.0. See | |
| L<perlunicode/Hacking Perl to work on earlier Unicode versions (for very serious hackers only)>. | |
| =head2 Borland compiler | |
| All support for the Borland compiler has been dropped. The code had not | |
| worked for a long time anyway. | |
| =head2 Certain deprecated Unicode properties are no longer supported by default | |
| Perl should never have exposed certain Unicode properties that are used | |
| by Unicode internally and not meant to be publicly available. Use of | |
| these has generated deprecated warning messages since Perl 5.12. The | |
| removed properties are Other_Alphabetic, | |
| Other_Default_Ignorable_Code_Point, Other_Grapheme_Extend, | |
| Other_ID_Continue, Other_ID_Start, Other_Lowercase, Other_Math, and | |
| Other_Uppercase. | |
| Perl may be recompiled to include any or all of them; instructions are | |
| given in | |
| L<perluniprops/Unicode character properties that are NOT accepted by Perl>. | |
| =head2 Dereferencing IO thingies as typeglobs | |
| The C<*{...}> operator, when passed a reference to an IO thingy (as in | |
| C<*{*STDIN{IO}}>), creates a new typeglob containing just that IO object. | |
| Previously, it would stringify as an empty string, but some operators would | |
| treat it as undefined, producing an "uninitialized" warning. | |
| Now it stringifies as __ANONIO__ [perl #96326]. | |
| =head2 User-defined case-changing operations | |
| This feature was deprecated in Perl 5.14, and has now been removed. | |
| The CPAN module L<Unicode::Casing> provides better functionality without | |
| the drawbacks that this feature had, as are detailed in the 5.14 | |
| documentation: | |
| L<http://perldoc.perl.org/5.14.0/perlunicode.html#User-Defined-Case-Mappings-%28for-serious-hackers-only%29> | |
| =head2 XSUBs are now 'static' | |
| XSUB C functions are now 'static', that is, they are not visible from | |
| outside the compilation unit. Users can use the new C<XS_EXTERNAL(name)> | |
| and C<XS_INTERNAL(name)> macros to pick the desired linking behavior. | |
| The ordinary C<XS(name)> declaration for XSUBs will continue to declare | |
| non-'static' XSUBs for compatibility, but the XS compiler, | |
| L<ExtUtils::ParseXS> (C<xsubpp>) will emit 'static' XSUBs by default. | |
| L<ExtUtils::ParseXS>'s behavior can be reconfigured from XS using the | |
| C<EXPORT_XSUB_SYMBOLS> keyword. See L<perlxs> for details. | |
| =head2 Weakening read-only references | |
| Weakening read-only references is no longer permitted. It should never | |
| have worked anyway, and could sometimes result in crashes. | |
| =head2 Tying scalars that hold typeglobs | |
| Attempting to tie a scalar after a typeglob was assigned to it would | |
| instead tie the handle in the typeglob's IO slot. This meant that it was | |
| impossible to tie the scalar itself. Similar problems affected C<tied> and | |
| C<untie>: C<tied $scalar> would return false on a tied scalar if the last | |
| thing returned was a typeglob, and C<untie $scalar> on such a tied scalar | |
| would do nothing. | |
| We fixed this problem before Perl 5.14.0, but it caused problems with some | |
| CPAN modules, so we put in a deprecation cycle instead. | |
| Now the deprecation has been removed and this bug has been fixed. So | |
| C<tie $scalar> will always tie the scalar, not the handle it holds. To tie | |
| the handle, use C<tie *$scalar> (with an explicit asterisk). The same | |
| applies to C<tied *$scalar> and C<untie *$scalar>. | |
| =head2 IPC::Open3 no longer provides C<xfork()>, C<xclose_on_exec()> | |
| and C<xpipe_anon()> | |
| All three functions were private, undocumented, and unexported. They do | |
| not appear to be used by any code on CPAN. Two have been inlined and one | |
| deleted entirely. | |
| =head2 C<$$> no longer caches PID | |
| Previously, if one called fork(3) from C, Perl's | |
| notion of C<$$> could go out of sync with what getpid() returns. By always | |
| fetching the value of C<$$> via getpid(), this potential bug is eliminated. | |
| Code that depends on the caching behavior will break. As described in | |
| L<Core Enhancements|/C<$$> can be assigned to>, | |
| C<$$> is now writable, but it will be reset during a | |
| fork. | |
| =head2 C<$$> and C<getppid()> no longer emulate POSIX semantics under LinuxThreads | |
| The POSIX emulation of C<$$> and C<getppid()> under the obsolete | |
| LinuxThreads implementation has been removed. | |
| This only impacts users of Linux 2.4 and | |
| users of Debian GNU/kFreeBSD up to and including 6.0, not the vast | |
| majority of Linux installations that use NPTL threads. | |
| This means that C<getppid()>, like C<$$>, is now always guaranteed to | |
| return the OS's idea of the current state of the process, not perl's | |
| cached version of it. | |
| See the documentation for L<$$|perlvar/$$> for details. | |
| =head2 C<< $< >>, C<< $> >>, C<$(> and C<$)> are no longer cached | |
| Similarly to the changes to C<$$> and C<getppid()>, the internal | |
| caching of C<< $< >>, C<< $> >>, C<$(> and C<$)> has been removed. | |
| When we cached these values our idea of what they were would drift out | |
| of sync with reality if someone (e.g., someone embedding perl) called | |
| C<sete?[ug]id()> without updating C<PL_e?[ug]id>. Having to deal with | |
| this complexity wasn't worth it given how cheap the C<gete?[ug]id()> | |
| system call is. | |
| This change will break a handful of CPAN modules that use the XS-level | |
| C<PL_uid>, C<PL_gid>, C<PL_euid> or C<PL_egid> variables. | |
| The fix for those breakages is to use C<PerlProc_gete?[ug]id()> to | |
| retrieve them (e.g., C<PerlProc_getuid()>), and not to assign to | |
| C<PL_e?[ug]id> if you change the UID/GID/EUID/EGID. There is no longer | |
| any need to do so since perl will always retrieve the up-to-date | |
| version of those values from the OS. | |
| =head2 Which Non-ASCII characters get quoted by C<quotemeta> and C<\Q> has changed | |
| This is unlikely to result in a real problem, as Perl does not attach | |
| special meaning to any non-ASCII character, so it is currently | |
| irrelevant which are quoted or not. This change fixes bug [perl #77654] and | |
| brings Perl's behavior more into line with Unicode's recommendations. | |
| See L<perlfunc/quotemeta>. | |
| =head1 Performance Enhancements | |
| =over | |
| =item * | |
| Improved performance for Unicode properties in regular expressions | |
| =for comment Can this be compacted some? -- rjbs, 2012-02-20 | |
| Matching a code point against a Unicode property is now done via a | |
| binary search instead of linear. This means for example that the worst | |
| case for a 1000 item property is 10 probes instead of 1000. This | |
| inefficiency has been compensated for in the past by permanently storing | |
| in a hash the results of a given probe plus the results for the adjacent | |
| 64 code points, under the theory that near-by code points are likely to | |
| be searched for. A separate hash was used for each mention of a Unicode | |
| property in each regular expression. Thus, C<qr/\p{foo}abc\p{foo}/> | |
| would generate two hashes. Any probes in one instance would be unknown | |
| to the other, and the hashes could expand separately to be quite large | |
| if the regular expression were used on many different widely-separated | |
| code points. | |
| Now, however, there is just one hash shared by all instances of a given | |
| property. This means that if C<\p{foo}> is matched against "A" in one | |
| regular expression in a thread, the result will be known immediately to | |
| all regular expressions, and the relentless march of using up memory is | |
| slowed considerably. | |
| =item * | |
| Version declarations with the C<use> keyword (e.g., C<use 5.012>) are now | |
| faster, as they enable features without loading F<feature.pm>. | |
| =item * | |
| C<local $_> is faster now, as it no longer iterates through magic that it | |
| is not going to copy anyway. | |
| =item * | |
| Perl 5.12.0 sped up the destruction of objects whose classes define | |
| empty C<DESTROY> methods (to prevent autoloading), by simply not | |
| calling such empty methods. This release takes this optimization a | |
| step further, by not calling any C<DESTROY> method that begins with a | |
| C<return> statement. This can be useful for destructors that are only | |
| used for debugging: | |
| use constant DEBUG => 1; | |
| sub DESTROY { return unless DEBUG; ... } | |
| Constant-folding will reduce the first statement to C<return;> if DEBUG | |
| is set to 0, triggering this optimization. | |
| =item * | |
| Assigning to a variable that holds a typeglob or copy-on-write scalar | |
| is now much faster. Previously the typeglob would be stringified or | |
| the copy-on-write scalar would be copied before being clobbered. | |
| =item * | |
| Assignment to C<substr> in void context is now more than twice its | |
| previous speed. Instead of creating and returning a special lvalue | |
| scalar that is then assigned to, C<substr> modifies the original string | |
| itself. | |
| =item * | |
| C<substr> no longer calculates a value to return when called in void | |
| context. | |
| =item * | |
| Due to changes in L<File::Glob>, Perl's C<glob> function and its C<< | |
| <...> >> equivalent are now much faster. The splitting of the pattern | |
| into words has been rewritten in C, resulting in speed-ups of 20% for | |
| some cases. | |
| This does not affect C<glob> on VMS, as it does not use File::Glob. | |
| =item * | |
| The short-circuiting operators C<&&>, C<||>, and C<//>, when chained | |
| (such as C<$a || $b || $c>), are now considerably faster to short-circuit, | |
| due to reduced optree traversal. | |
| =item * | |
| The implementation of C<s///r> makes one fewer copy of the scalar's value. | |
| =item * | |
| Recursive calls to lvalue subroutines in lvalue scalar context use less | |
| memory. | |
| =back | |
| =head1 Modules and Pragmata | |
| =head2 Deprecated Modules | |
| =over | |
| =item L<Version::Requirements> | |
| Version::Requirements is now DEPRECATED, use L<CPAN::Meta::Requirements>, | |
| which is a drop-in replacement. It will be deleted from perl.git blead | |
| in v5.17.0. | |
| =back | |
| =head2 New Modules and Pragmata | |
| =over 4 | |
| =item * | |
| L<arybase> -- this new module implements the C<$[> variable. | |
| =item * | |
| L<PerlIO::mmap> 0.010 has been added to the Perl core. | |
| The C<mmap> PerlIO layer is no longer implemented by perl itself, but has | |
| been moved out into the new L<PerlIO::mmap> module. | |
| =back | |
| =head2 Updated Modules and Pragmata | |
| This is only an overview of selected module updates. For a complete list of | |
| updates, run: | |
| $ corelist --diff 5.14.0 5.16.0 | |
| You can substitute your favorite version in place of 5.14.0, too. | |
| =over 4 | |
| =item * | |
| L<Archive::Extract> has been upgraded from version 0.48 to 0.58. | |
| Includes a fix for FreeBSD to only use C<unzip> if it is located in | |
| C</usr/local/bin>, as FreeBSD 9.0 will ship with a limited C<unzip> in | |
| C</usr/bin>. | |
| =item * | |
| L<Archive::Tar> has been upgraded from version 1.76 to 1.82. | |
| Adjustments to handle files >8gb (>0777777777777 octal) and a feature | |
| to return the MD5SUM of files in the archive. | |
| =item * | |
| L<base> has been upgraded from version 2.16 to 2.18. | |
| C<base> no longer sets a module's C<$VERSION> to "-1" when a module it | |
| loads does not define a C<$VERSION>. This change has been made because | |
| "-1" is not a valid version number under the new "lax" criteria used | |
| internally by C<UNIVERSAL::VERSION>. (See L<version> for more on "lax" | |
| version criteria.) | |
| C<base> no longer internally skips loading modules it has already loaded | |
| and instead relies on C<require> to inspect C<%INC>. This fixes a bug | |
| when C<base> is used with code that clear C<%INC> to force a module to | |
| be reloaded. | |
| =item * | |
| L<Carp> has been upgraded from version 1.20 to 1.26. | |
| It now includes last read filehandle info and puts a dot after the file | |
| and line number, just like errors from C<die> [perl #106538]. | |
| =item * | |
| L<charnames> has been updated from version 1.18 to 1.30. | |
| C<charnames> can now be invoked with a new option, C<:loose>, | |
| which is like the existing C<:full> option, but enables Unicode loose | |
| name matching. Details are in L<charnames/LOOSE MATCHES>. | |
| =item * | |
| L<B::Deparse> has been upgraded from version 1.03 to 1.14. This fixes | |
| numerous deparsing bugs. | |
| =item * | |
| L<CGI> has been upgraded from version 3.52 to 3.59. | |
| It uses the public and documented FCGI.pm API in CGI::Fast. CGI::Fast was | |
| using an FCGI API that was deprecated and removed from documentation | |
| more than ten years ago. Usage of this deprecated API with FCGI E<gt>= | |
| 0.70 or FCGI E<lt>= 0.73 introduces a security issue. | |
| L<https://rt.cpan.org/Public/Bug/Display.html?id=68380> | |
| L<http://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2011-2766> | |
| Things that may break your code: | |
| C<url()> was fixed to return C<PATH_INFO> when it is explicitly requested | |
| with either the C<path=E<gt>1> or C<path_info=E<gt>1> flag. | |
| If your code is running under mod_rewrite (or compatible) and you are | |
| calling C<self_url()> or you are calling C<url()> and passing | |
| C<path_info=E<gt>1>, these methods will actually be returning | |
| C<PATH_INFO> now, as you have explicitly requested or C<self_url()> | |
| has requested on your behalf. | |
| The C<PATH_INFO> has been omitted in such URLs since the issue was | |
| introduced in the 3.12 release in December, 2005. | |
| This bug is so old your application may have come to depend on it or | |
| workaround it. Check for application before upgrading to this release. | |
| Examples of affected method calls: | |
| $q->url(-absolute => 1, -query => 1, -path_info => 1); | |
| $q->url(-path=>1); | |
| $q->url(-full=>1,-path=>1); | |
| $q->url(-rewrite=>1,-path=>1); | |
| $q->self_url(); | |
| We no longer read from STDIN when the Content-Length is not set, | |
| preventing requests with no Content-Length from sometimes freezing. | |
| This is consistent with the CGI RFC 3875, and is also consistent with | |
| CGI::Simple. However, the old behavior may have been expected by some | |
| command-line uses of CGI.pm. | |
| In addition, the DELETE HTTP verb is now supported. | |
| =item * | |
| L<Compress::Zlib> has been upgraded from version 2.035 to 2.048. | |
| IO::Compress::Zip and IO::Uncompress::Unzip now have support for LZMA | |
| (method 14). There is a fix for a CRC issue in IO::Compress::Unzip and | |
| it supports Streamed Stored context now. And fixed a Zip64 issue in | |
| IO::Compress::Zip when the content size was exactly 0xFFFFFFFF. | |
| =item * | |
| L<Digest::SHA> has been upgraded from version 5.61 to 5.71. | |
| Added BITS mode to the addfile method and shasum. This makes | |
| partial-byte inputs possible via files/STDIN and lets shasum check | |
| all 8074 NIST Msg vectors, where previously special programming was | |
| required to do this. | |
| =item * | |
| L<Encode> has been upgraded from version 2.42 to 2.44. | |
| Missing aliases added, a deep recursion error fixed and various | |
| documentation updates. | |
| Addressed 'decode_xs n-byte heap-overflow' security bug in Unicode.xs | |
| (CVE-2011-2939). (5.14.2) | |
| =item * | |
| L<ExtUtils::CBuilder> updated from version 0.280203 to 0.280206. | |
| The new version appends CFLAGS and LDFLAGS to their Config.pm | |
| counterparts. | |
| =item * | |
| L<ExtUtils::ParseXS> has been upgraded from version 2.2210 to 3.16. | |
| Much of L<ExtUtils::ParseXS>, the module behind the XS compiler C<xsubpp>, | |
| was rewritten and cleaned up. It has been made somewhat more extensible | |
| and now finally uses strictures. | |
| The typemap logic has been moved into a separate module, | |
| L<ExtUtils::Typemaps>. See L</New Modules and Pragmata>, above. | |
| For a complete set of changes, please see the ExtUtils::ParseXS | |
| changelog, available on the CPAN. | |
| =item * | |
| L<File::Glob> has been upgraded from version 1.12 to 1.17. | |
| On Windows, tilde (~) expansion now checks the C<USERPROFILE> environment | |
| variable, after checking C<HOME>. | |
| It has a new C<:bsd_glob> export tag, intended to replace C<:glob>. Like | |
| C<:glob> it overrides C<glob> with a function that does not split the glob | |
| pattern into words, but, unlike C<:glob>, it iterates properly in scalar | |
| context, instead of returning the last file. | |
| There are other changes affecting Perl's own C<glob> operator (which uses | |
| File::Glob internally, except on VMS). See L</Performance Enhancements> | |
| and L</Selected Bug Fixes>. | |
| =item * | |
| L<FindBin> updated from version 1.50 to 1.51. | |
| It no longer returns a wrong result if a script of the same name as the | |
| current one exists in the path and is executable. | |
| =item * | |
| L<HTTP::Tiny> has been upgraded from version 0.012 to 0.017. | |
| Added support for using C<$ENV{http_proxy}> to set the default proxy host. | |
| Adds additional shorthand methods for all common HTTP verbs, | |
| a C<post_form()> method for POST-ing x-www-form-urlencoded data and | |
| a C<www_form_urlencode()> utility method. | |
| =item * | |
| L<IO> has been upgraded from version 1.25_04 to 1.25_06, and L<IO::Handle> | |
| from version 1.31 to 1.33. | |
| Together, these upgrades fix a problem with IO::Handle's C<getline> and | |
| C<getlines> methods. When these methods are called on the special ARGV | |
| handle, the next file is automatically opened, as happens with the built-in | |
| C<E<lt>E<gt>> and C<readline> functions. But, unlike the built-ins, these | |
| methods were not respecting the caller's use of the L<open> pragma and | |
| applying the appropriate I/O layers to the newly-opened file | |
| [rt.cpan.org #66474]. | |
| =item * | |
| L<IPC::Cmd> has been upgraded from version 0.70 to 0.76. | |
| Capturing of command output (both C<STDOUT> and C<STDERR>) is now supported | |
| using L<IPC::Open3> on MSWin32 without requiring L<IPC::Run>. | |
| =item * | |
| L<IPC::Open3> has been upgraded from version 1.09 to 1.12. | |
| Fixes a bug which prevented use of C<open3> on Windows when C<*STDIN>, | |
| C<*STDOUT> or C<*STDERR> had been localized. | |
| Fixes a bug which prevented duplicating numeric file descriptors on Windows. | |
| C<open3> with "-" for the program name works once more. This was broken in | |
| version 1.06 (and hence in Perl 5.14.0) [perl #95748]. | |
| =item * | |
| L<Locale::Codes> has been upgraded from version 3.16 to 3.21. | |
| Added Language Extension codes (langext) and Language Variation codes (langvar) | |
| as defined in the IANA language registry. | |
| Added language codes from ISO 639-5 | |
| Added language/script codes from the IANA language subtag registry | |
| Fixed an uninitialized value warning [rt.cpan.org #67438]. | |
| Fixed the return value for the all_XXX_codes and all_XXX_names functions | |
| [rt.cpan.org #69100]. | |
| Reorganized modules to move Locale::MODULE to Locale::Codes::MODULE to allow | |
| for cleaner future additions. The original four modules (Locale::Language, | |
| Locale::Currency, Locale::Country, Locale::Script) will continue to work, but | |
| all new sets of codes will be added in the Locale::Codes namespace. | |
| The code2XXX, XXX2code, all_XXX_codes, and all_XXX_names functions now | |
| support retired codes. All codesets may be specified by a constant or | |
| by their name now. Previously, they were specified only by a constant. | |
| The alias_code function exists for backward compatibility. It has been | |
| replaced by rename_country_code. The alias_code function will be | |
| removed some time after September, 2013. | |
| All work is now done in the central module (Locale::Codes). Previously, | |
| some was still done in the wrapper modules (Locale::Codes::*). Added | |
| Language Family codes (langfam) as defined in ISO 639-5. | |
| =item * | |
| L<Math::BigFloat> has been upgraded from version 1.993 to 1.997. | |
| The C<numify> method has been corrected to return a normalized Perl number | |
| (the result of C<0 + $thing>), instead of a string [rt.cpan.org #66732]. | |
| =item * | |
| L<Math::BigInt> has been upgraded from version 1.994 to 1.998. | |
| It provides a new C<bsgn> method that complements the C<babs> method. | |
| It fixes the internal C<objectify> function's handling of "foreign objects" | |
| so they are converted to the appropriate class (Math::BigInt or | |
| Math::BigFloat). | |
| =item * | |
| L<Math::BigRat> has been upgraded from version 0.2602 to 0.2603. | |
| C<int()> on a Math::BigRat object containing -1/2 now creates a | |
| Math::BigInt containing 0, rather than -0. L<Math::BigInt> does not even | |
| support negative zero, so the resulting object was actually malformed | |
| [perl #95530]. | |
| =item * | |
| L<Math::Complex> has been upgraded from version 1.56 to 1.59 | |
| and L<Math::Trig> from version 1.2 to 1.22. | |
| Fixes include: correct copy constructor usage; fix polarwise formatting with | |
| numeric format specifier; and more stable C<great_circle_direction> algorithm. | |
| =item * | |
| L<Module::CoreList> has been upgraded from version 2.51 to 2.66. | |
| The C<corelist> utility now understands the C<-r> option for displaying | |
| Perl release dates and the C<--diff> option to print the set of modlib | |
| changes between two perl distributions. | |
| =item * | |
| L<Module::Metadata> has been upgraded from version 1.000004 to 1.000009. | |
| Adds C<provides> method to generate a CPAN META provides data structure | |
| correctly; use of C<package_versions_from_directory> is discouraged. | |
| =item * | |
| L<ODBM_File> has been upgraded from version 1.10 to 1.12. | |
| The XS code is now compiled with C<PERL_NO_GET_CONTEXT>, which will aid | |
| performance under ithreads. | |
| =item * | |
| L<open> has been upgraded from version 1.08 to 1.10. | |
| It no longer turns off layers on standard handles when invoked without the | |
| ":std" directive. Similarly, when invoked I<with> the ":std" directive, it | |
| now clears layers on STDERR before applying the new ones, and not just on | |
| STDIN and STDOUT [perl #92728]. | |
| =item * | |
| L<overload> has been upgraded from version 1.13 to 1.18. | |
| C<overload::Overloaded> no longer calls C<can> on the class, but uses | |
| another means to determine whether the object has overloading. It was | |
| never correct for it to call C<can>, as overloading does not respect | |
| AUTOLOAD. So classes that autoload methods and implement C<can> no longer | |
| have to account for overloading [perl #40333]. | |
| A warning is now produced for invalid arguments. See L</New Diagnostics>. | |
| =item * | |
| L<PerlIO::scalar> has been upgraded from version 0.11 to 0.14. | |
| (This is the module that implements C<< open $fh, '>', \$scalar >>.) | |
| It fixes a problem with C<< open my $fh, ">", \$scalar >> not working if | |
| C<$scalar> is a copy-on-write scalar. (5.14.2) | |
| It also fixes a hang that occurs with C<readline> or C<< <$fh> >> if a | |
| typeglob has been assigned to $scalar [perl #92258]. | |
| It no longer assumes during C<seek> that $scalar is a string internally. | |
| If it didn't crash, it was close to doing so [perl #92706]. Also, the | |
| internal print routine no longer assumes that the position set by C<seek> | |
| is valid, but extends the string to that position, filling the intervening | |
| bytes (between the old length and the seek position) with nulls | |
| [perl #78980]. | |
| Printing to an in-memory handle now works if the $scalar holds a reference, | |
| stringifying the reference before modifying it. References used to be | |
| treated as empty strings. | |
| Printing to an in-memory handle no longer crashes if the $scalar happens to | |
| hold a number internally, but no string buffer. | |
| Printing to an in-memory handle no longer creates scalars that confuse | |
| the regular expression engine [perl #108398]. | |
| =item * | |
| L<Pod::Functions> has been upgraded from version 1.04 to 1.05. | |
| F<Functions.pm> is now generated at perl build time from annotations in | |
| F<perlfunc.pod>. This will ensure that L<Pod::Functions> and L<perlfunc> | |
| remain in synchronisation. | |
| =item * | |
| L<Pod::Html> has been upgraded from version 1.11 to 1.1502. | |
| This is an extensive rewrite of Pod::Html to use L<Pod::Simple> under | |
| the hood. The output has changed significantly. | |
| =item * | |
| L<Pod::Perldoc> has been upgraded from version 3.15_03 to 3.17. | |
| It corrects the search paths on VMS [perl #90640]. (5.14.1) | |
| The B<-v> option now fetches the right section for C<$0>. | |
| This upgrade has numerous significant fixes. Consult its changelog on | |
| the CPAN for more information. | |
| =item * | |
| L<POSIX> has been upgraded from version 1.24 to 1.30. | |
| L<POSIX> no longer uses L<AutoLoader>. Any code which was relying on this | |
| implementation detail was buggy, and may fail because of this change. | |
| The module's Perl code has been considerably simplified, roughly halving | |
| the number of lines, with no change in functionality. The XS code has | |
| been refactored to reduce the size of the shared object by about 12%, | |
| with no change in functionality. More POSIX functions now have tests. | |
| C<sigsuspend> and C<pause> now run signal handlers before returning, as the | |
| whole point of these two functions is to wait until a signal has | |
| arrived, and then return I<after> it has been triggered. Delayed, or | |
| "safe", signals were preventing that from happening, possibly resulting in | |
| race conditions [perl #107216]. | |
| C<POSIX::sleep> is now a direct call into the underlying OS C<sleep> | |
| function, instead of being a Perl wrapper on C<CORE::sleep>. | |
| C<POSIX::dup2> now returns the correct value on Win32 (I<i.e.>, the file | |
| descriptor). C<POSIX::SigSet> C<sigsuspend> and C<sigpending> and | |
| C<POSIX::pause> now dispatch safe signals immediately before returning to | |
| their caller. | |
| C<POSIX::Termios::setattr> now defaults the third argument to C<TCSANOW>, | |
| instead of 0. On most platforms C<TCSANOW> is defined to be 0, but on some | |
| 0 is not a valid parameter, which caused a call with defaults to fail. | |
| =item * | |
| L<Socket> has been upgraded from version 1.94 to 2.001. | |
| It has new functions and constants for handling IPv6 sockets: | |
| pack_ipv6_mreq | |
| unpack_ipv6_mreq | |
| IPV6_ADD_MEMBERSHIP | |
| IPV6_DROP_MEMBERSHIP | |
| IPV6_MTU | |
| IPV6_MTU_DISCOVER | |
| IPV6_MULTICAST_HOPS | |
| IPV6_MULTICAST_IF | |
| IPV6_MULTICAST_LOOP | |
| IPV6_UNICAST_HOPS | |
| IPV6_V6ONLY | |
| =item * | |
| L<Storable> has been upgraded from version 2.27 to 2.34. | |
| It no longer turns copy-on-write scalars into read-only scalars when | |
| freezing and thawing. | |
| =item * | |
| L<Sys::Syslog> has been upgraded from version 0.27 to 0.29. | |
| This upgrade closes many outstanding bugs. | |
| =item * | |
| L<Term::ANSIColor> has been upgraded from version 3.00 to 3.01. | |
| Only interpret an initial array reference as a list of colors, not any initial | |
| reference, allowing the colored function to work properly on objects with | |
| stringification defined. | |
| =item * | |
| L<Term::ReadLine> has been upgraded from version 1.07 to 1.09. | |
| Term::ReadLine now supports any event loop, including unpublished ones and | |
| simple L<IO::Select>, loops without the need to rewrite existing code for | |
| any particular framework [perl #108470]. | |
| =item * | |
| L<threads::shared> has been upgraded from version 1.37 to 1.40. | |
| Destructors on shared objects used to be ignored sometimes if the objects | |
| were referenced only by shared data structures. This has been mostly | |
| fixed, but destructors may still be ignored if the objects still exist at | |
| global destruction time [perl #98204]. | |
| =item * | |
| L<Unicode::Collate> has been upgraded from version 0.73 to 0.89. | |
| Updated to CLDR 1.9.1 | |
| Locales updated to CLDR 2.0: mk, mt, nb, nn, ro, ru, sk, sr, sv, uk, | |
| zh__pinyin, zh__stroke | |
| Newly supported locales: bn, fa, ml, mr, or, pa, sa, si, si__dictionary, | |
| sr_Latn, sv__reformed, ta, te, th, ur, wae. | |
| Tailored compatibility ideographs as well as unified ideographs for the | |
| locales: ja, ko, zh__big5han, zh__gb2312han, zh__pinyin, zh__stroke. | |
| Locale/*.pl files are now searched for in @INC. | |
| =item * | |
| L<Unicode::Normalize> has been upgraded from version 1.10 to 1.14. | |
| Fixes for the removal of F<unicore/CompositionExclusions.txt> from core. | |
| =item * | |
| L<Unicode::UCD> has been upgraded from version 0.32 to 0.43. | |
| This adds four new functions: C<prop_aliases()> and | |
| C<prop_value_aliases()>, which are used to find all Unicode-approved | |
| synonyms for property names, or to convert from one name to another; | |
| C<prop_invlist> which returns all code points matching a given | |
| Unicode binary property; and C<prop_invmap> which returns the complete | |
| specification of a given Unicode property. | |
| =item * | |
| L<Win32API::File> has been upgraded from version 0.1101 to 0.1200. | |
| Added SetStdHandle and GetStdHandle functions | |
| =back | |
| =head2 Removed Modules and Pragmata | |
| As promised in Perl 5.14.0's release notes, the following modules have | |
| been removed from the core distribution, and if needed should be installed | |
| from CPAN instead. | |
| =over | |
| =item * | |
| L<Devel::DProf> has been removed from the Perl core. Prior version was | |
| 20110228.00. | |
| =item * | |
| L<Shell> has been removed from the Perl core. Prior version was 0.72_01. | |
| =item * | |
| Several old perl4-style libraries which have been deprecated with 5.14 | |
| are now removed: | |
| abbrev.pl assert.pl bigfloat.pl bigint.pl bigrat.pl cacheout.pl | |
| complete.pl ctime.pl dotsh.pl exceptions.pl fastcwd.pl flush.pl | |
| getcwd.pl getopt.pl getopts.pl hostname.pl importenv.pl | |
| lib/find{,depth}.pl look.pl newgetopt.pl open2.pl open3.pl | |
| pwd.pl shellwords.pl stat.pl tainted.pl termcap.pl timelocal.pl | |
| They can be found on CPAN as L<Perl4::CoreLibs>. | |
| =back | |
| =head1 Documentation | |
| =head2 New Documentation | |
| =head3 L<perldtrace> | |
| L<perldtrace> describes Perl's DTrace support, listing the provided probes | |
| and gives examples of their use. | |
| =head3 L<perlexperiment> | |
| This document is intended to provide a list of experimental features in | |
| Perl. It is still a work in progress. | |
| =head3 L<perlootut> | |
| This a new OO tutorial. It focuses on basic OO concepts, and then recommends | |
| that readers choose an OO framework from CPAN. | |
| =head3 L<perlxstypemap> | |
| The new manual describes the XS typemapping mechanism in unprecedented | |
| detail and combines new documentation with information extracted from | |
| L<perlxs> and the previously unofficial list of all core typemaps. | |
| =head2 Changes to Existing Documentation | |
| =head3 L<perlapi> | |
| =over 4 | |
| =item * | |
| The HV API has long accepted negative lengths to show that the key is | |
| in UTF8. This is now documented. | |
| =item * | |
| The C<boolSV()> macro is now documented. | |
| =back | |
| =head3 L<perlfunc> | |
| =over 4 | |
| =item * | |
| C<dbmopen> treats a 0 mode as a special case, that prevents a nonexistent | |
| file from being created. This has been the case since Perl 5.000, but was | |
| never documented anywhere. Now the perlfunc entry mentions it | |
| [perl #90064]. | |
| =item * | |
| As an accident of history, C<open $fh, '<:', ...> applies the default | |
| layers for the platform (C<:raw> on Unix, C<:crlf> on Windows), ignoring | |
| whatever is declared by L<open.pm|open>. This seems such a useful feature | |
| it has been documented in L<perlfunc|perlfunc/open> and L<open>. | |
| =item * | |
| The entry for C<split> has been rewritten. It is now far clearer than | |
| before. | |
| =back | |
| =head3 L<perlguts> | |
| =over 4 | |
| =item * | |
| A new section, L<Autoloading with XSUBs|perlguts/Autoloading with XSUBs>, | |
| has been added, which explains the two APIs for accessing the name of the | |
| autoloaded sub. | |
| =item * | |
| Some function descriptions in L<perlguts> were confusing, as it was | |
| not clear whether they referred to the function above or below the | |
| description. This has been clarified [perl #91790]. | |
| =back | |
| =head3 L<perlobj> | |
| =over 4 | |
| =item * | |
| This document has been rewritten from scratch, and its coverage of various OO | |
| concepts has been expanded. | |
| =back | |
| =head3 L<perlop> | |
| =over 4 | |
| =item * | |
| Documentation of the smartmatch operator has been reworked and moved from | |
| perlsyn to perlop where it belongs. | |
| It has also been corrected for the case of C<undef> on the left-hand | |
| side. The list of different smart match behaviors had an item in the | |
| wrong place. | |
| =item * | |
| Documentation of the ellipsis statement (C<...>) has been reworked and | |
| moved from perlop to perlsyn. | |
| =item * | |
| The explanation of bitwise operators has been expanded to explain how they | |
| work on Unicode strings (5.14.1). | |
| =item * | |
| More examples for C<m//g> have been added (5.14.1). | |
| =item * | |
| The C<<< <<\FOO >>> here-doc syntax has been documented (5.14.1). | |
| =back | |
| =head3 L<perlpragma> | |
| =over 4 | |
| =item * | |
| There is now a standard convention for naming keys in the C<%^H>, | |
| documented under L<Key naming|perlpragma/Key naming>. | |
| =back | |
| =head3 L<perlsec/Laundering and Detecting Tainted Data> | |
| =over 4 | |
| =item * | |
| The example function for checking for taintedness contained a subtle | |
| error. C<$@> needs to be localized to prevent its changing this | |
| global's value outside the function. The preferred method to check for | |
| this remains L<Scalar::Util/tainted>. | |
| =back | |
| =head3 L<perllol> | |
| =over | |
| =item * | |
| L<perllol> has been expanded with examples using the new C<push $scalar> | |
| syntax introduced in Perl 5.14.0 (5.14.1). | |
| =back | |
| =head3 L<perlmod> | |
| =over | |
| =item * | |
| L<perlmod> now states explicitly that some types of explicit symbol table | |
| manipulation are not supported. This codifies what was effectively already | |
| the case [perl #78074]. | |
| =back | |
| =head3 L<perlpodstyle> | |
| =over 4 | |
| =item * | |
| The tips on which formatting codes to use have been corrected and greatly | |
| expanded. | |
| =item * | |
| There are now a couple of example one-liners for previewing POD files after | |
| they have been edited. | |
| =back | |
| =head3 L<perlre> | |
| =over | |
| =item * | |
| The C<(*COMMIT)> directive is now listed in the right section | |
| (L<Verbs without an argument|perlre/Verbs without an argument>). | |
| =back | |
| =head3 L<perlrun> | |
| =over | |
| =item * | |
| L<perlrun> has undergone a significant clean-up. Most notably, the | |
| B<-0x...> form of the B<-0> flag has been clarified, and the final section | |
| on environment variables has been corrected and expanded (5.14.1). | |
| =back | |
| =head3 L<perlsub> | |
| =over | |
| =item * | |
| The ($;) prototype syntax, which has existed for rather a long time, is now | |
| documented in L<perlsub>. It lets a unary function have the same | |
| precedence as a list operator. | |
| =back | |
| =head3 L<perltie> | |
| =over | |
| =item * | |
| The required syntax for tying handles has been documented. | |
| =back | |
| =head3 L<perlvar> | |
| =over | |
| =item * | |
| The documentation for L<$!|perlvar/$!> has been corrected and clarified. | |
| It used to state that $! could be C<undef>, which is not the case. It was | |
| also unclear whether system calls set C's C<errno> or Perl's C<$!> | |
| [perl #91614]. | |
| =item * | |
| Documentation for L<$$|perlvar/$$> has been amended with additional | |
| cautions regarding changing the process ID. | |
| =back | |
| =head3 Other Changes | |
| =over 4 | |
| =item * | |
| L<perlxs> was extended with documentation on inline typemaps. | |
| =item * | |
| L<perlref> has a new L<Circular References|perlref/Circular References> | |
| section explaining how circularities may not be freed and how to solve that | |
| with weak references. | |
| =item * | |
| Parts of L<perlapi> were clarified, and Perl equivalents of some C | |
| functions have been added as an additional mode of exposition. | |
| =item * | |
| A few parts of L<perlre> and L<perlrecharclass> were clarified. | |
| =back | |
| =head2 Removed Documentation | |
| =head3 Old OO Documentation | |
| The old OO tutorials, perltoot, perltooc, and perlboot, have been | |
| removed. The perlbot (bag of object tricks) document has been removed | |
| as well. | |
| =head3 Development Deltas | |
| The perldelta files for development releases are no longer packaged with | |
| perl. These can still be found in the perl source code repository. | |
| =head1 Diagnostics | |
| The following additions or changes have been made to diagnostic output, | |
| including warnings and fatal error messages. For the complete list of | |
| diagnostic messages, see L<perldiag>. | |
| =head2 New Diagnostics | |
| =head3 New Errors | |
| =over 4 | |
| =item * | |
| L<Cannot set tied @DB::args|perldiag/"Cannot set tied @DB::args"> | |
| This error occurs when C<caller> tries to set C<@DB::args> but finds it | |
| tied. Before this error was added, it used to crash instead. | |
| =item * | |
| L<Cannot tie unreifiable array|perldiag/"Cannot tie unreifiable array"> | |
| This error is part of a safety check that the C<tie> operator does before | |
| tying a special array like C<@_>. You should never see this message. | |
| =item * | |
| L<&CORE::%s cannot be called directly|perldiag/"&CORE::%s cannot be called directly"> | |
| This occurs when a subroutine in the C<CORE::> namespace is called | |
| with C<&foo> syntax or through a reference. Some subroutines | |
| in this package cannot yet be called that way, but must be | |
| called as barewords. See L</Subroutines in the C<CORE> namespace>, above. | |
| =item * | |
| L<Source filters apply only to byte streams|perldiag/"Source filters apply only to byte streams"> | |
| This new error occurs when you try to activate a source filter (usually by | |
| loading a source filter module) within a string passed to C<eval> under the | |
| C<unicode_eval> feature. | |
| =back | |
| =head3 New Warnings | |
| =over 4 | |
| =item * | |
| L<defined(@array) is deprecated|perldiag/"defined(@array) is deprecated"> | |
| The long-deprecated C<defined(@array)> now also warns for package variables. | |
| Previously it issued a warning for lexical variables only. | |
| =item * | |
| L<length() used on %s|perldiag/length() used on %s> | |
| This new warning occurs when C<length> is used on an array or hash, instead | |
| of C<scalar(@array)> or C<scalar(keys %hash)>. | |
| =item * | |
| L<lvalue attribute %s already-defined subroutine|perldiag/"lvalue attribute %s already-defined subroutine"> | |
| L<attributes.pm|attributes> now emits this warning when the :lvalue | |
| attribute is applied to a Perl subroutine that has already been defined, as | |
| doing so can have unexpected side-effects. | |
| =item * | |
| L<overload arg '%s' is invalid|perldiag/"overload arg '%s' is invalid"> | |
| This warning, in the "overload" category, is produced when the overload | |
| pragma is given an argument it doesn't recognize, presumably a mistyped | |
| operator. | |
| =item * | |
| L<$[ used in %s (did you mean $] ?)|perldiag/"$[ used in %s (did you mean $] ?)"> | |
| This new warning exists to catch the mistaken use of C<$[> in version | |
| checks. C<$]>, not C<$[>, contains the version number. | |
| =item * | |
| L<Useless assignment to a temporary|perldiag/"Useless assignment to a temporary"> | |
| Assigning to a temporary scalar returned | |
| from an lvalue subroutine now produces this | |
| warning [perl #31946]. | |
| =item * | |
| L<Useless use of \E|perldiag/"Useless use of \E"> | |
| C<\E> does nothing unless preceded by C<\Q>, C<\L> or C<\U>. | |
| =back | |
| =head2 Removed Errors | |
| =over | |
| =item * | |
| "sort is now a reserved word" | |
| This error used to occur when C<sort> was called without arguments, | |
| followed by C<;> or C<)>. (E.g., C<sort;> would die, but C<{sort}> was | |
| OK.) This error message was added in Perl 3 to catch code like | |
| C<close(sort)> which would no longer work. More than two decades later, | |
| this message is no longer appropriate. Now C<sort> without arguments is | |
| always allowed, and returns an empty list, as it did in those cases | |
| where it was already allowed [perl #90030]. | |
| =back | |
| =head2 Changes to Existing Diagnostics | |
| =over 4 | |
| =item * | |
| The "Applying pattern match..." or similar warning produced when an | |
| array or hash is on the left-hand side of the C<=~> operator now | |
| mentions the name of the variable. | |
| =item * | |
| The "Attempt to free non-existent shared string" has had the spelling | |
| of "non-existent" corrected to "nonexistent". It was already listed | |
| with the correct spelling in L<perldiag>. | |
| =item * | |
| The error messages for using C<default> and C<when> outside a | |
| topicalizer have been standardized to match the messages for C<continue> | |
| and loop controls. They now read 'Can't "default" outside a | |
| topicalizer' and 'Can't "when" outside a topicalizer'. They both used | |
| to be 'Can't use when() outside a topicalizer' [perl #91514]. | |
| =item * | |
| The message, "Code point 0x%X is not Unicode, no properties match it; | |
| all inverse properties do" has been changed to "Code point 0x%X is not | |
| Unicode, all \p{} matches fail; all \P{} matches succeed". | |
| =item * | |
| Redefinition warnings for constant subroutines used to be mandatory, | |
| even occurring under C<no warnings>. Now they respect the L<warnings> | |
| pragma. | |
| =item * | |
| The "glob failed" warning message is now suppressible via C<no warnings> | |
| [perl #111656]. | |
| =item * | |
| The L<Invalid version format|perldiag/"Invalid version format (%s)"> | |
| error message now says "negative version number" within the parentheses, | |
| rather than "non-numeric data", for negative numbers. | |
| =item * | |
| The two warnings | |
| L<Possible attempt to put comments in qw() list|perldiag/"Possible attempt to put comments in qw() list"> | |
| and | |
| L<Possible attempt to separate words with commas|perldiag/"Possible attempt to separate words with commas"> | |
| are no longer mutually exclusive: the same C<qw> construct may produce | |
| both. | |
| =item * | |
| The uninitialized warning for C<y///r> when C<$_> is implicit and | |
| undefined now mentions the variable name, just like the non-/r variation | |
| of the operator. | |
| =item * | |
| The 'Use of "foo" without parentheses is ambiguous' warning has been | |
| extended to apply also to user-defined subroutines with a (;$) | |
| prototype, and not just to built-in functions. | |
| =item * | |
| Warnings that mention the names of lexical (C<my>) variables with | |
| Unicode characters in them now respect the presence or absence of the | |
| C<:utf8> layer on the output handle, instead of outputting UTF8 | |
| regardless. Also, the correct names are included in the strings passed | |
| to C<$SIG{__WARN__}> handlers, rather than the raw UTF8 bytes. | |
| =back | |
| =head1 Utility Changes | |
| =head3 L<h2ph> | |
| =over 4 | |
| =item * | |
| L<h2ph> used to generate code of the form | |
| unless(defined(&FOO)) { | |
| sub FOO () {42;} | |
| } | |
| But the subroutine is a compile-time declaration, and is hence unaffected | |
| by the condition. It has now been corrected to emit a string C<eval> | |
| around the subroutine [perl #99368]. | |
| =back | |
| =head3 L<splain> | |
| =over 4 | |
| =item * | |
| F<splain> no longer emits backtraces with the first line number repeated. | |
| This: | |
| Uncaught exception from user code: | |
| Cannot fwiddle the fwuddle at -e line 1. | |
| at -e line 1 | |
| main::baz() called at -e line 1 | |
| main::bar() called at -e line 1 | |
| main::foo() called at -e line 1 | |
| has become this: | |
| Uncaught exception from user code: | |
| Cannot fwiddle the fwuddle at -e line 1. | |
| main::baz() called at -e line 1 | |
| main::bar() called at -e line 1 | |
| main::foo() called at -e line 1 | |
| =item * | |
| Some error messages consist of multiple lines that are listed as separate | |
| entries in L<perldiag>. splain has been taught to find the separate | |
| entries in these cases, instead of simply failing to find the message. | |
| =back | |
| =head3 L<zipdetails> | |
| =over 4 | |
| =item * | |
| This is a new utility, included as part of an | |
| L<IO::Compress::Base> upgrade. | |
| L<zipdetails> displays information about the internal record structure | |
| of the zip file. It is not concerned with displaying any details of | |
| the compressed data stored in the zip file. | |
| =back | |
| =head1 Configuration and Compilation | |
| =over 4 | |
| =item * | |
| F<regexp.h> has been modified for compatibility with GCC's B<-Werror> | |
| option, as used by some projects that include perl's header files (5.14.1). | |
| =item * | |
| C<USE_LOCALE{,_COLLATE,_CTYPE,_NUMERIC}> have been added the output of perl -V | |
| as they have affect the behavior of the interpreter binary (albeit | |
| in only a small area). | |
| =item * | |
| The code and tests for L<IPC::Open2> have been moved from F<ext/IPC-Open2> | |
| into F<ext/IPC-Open3>, as C<IPC::Open2::open2()> is implemented as a thin | |
| wrapper around C<IPC::Open3::_open3()>, and hence is very tightly coupled to | |
| it. | |
| =item * | |
| The magic types and magic vtables are now generated from data in a new script | |
| F<regen/mg_vtable.pl>, instead of being maintained by hand. As different | |
| EBCDIC variants can't agree on the code point for '~', the character to code | |
| point conversion is done at build time by F<generate_uudmap> to a new generated | |
| header F<mg_data.h>. C<PL_vtbl_bm> and C<PL_vtbl_fm> are now defined by the | |
| pre-processor as C<PL_vtbl_regexp>, instead of being distinct C variables. | |
| C<PL_vtbl_sig> has been removed. | |
| =item * | |
| Building with C<-DPERL_GLOBAL_STRUCT> works again. This configuration is not | |
| generally used. | |
| =item * | |
| Perl configured with I<MAD> now correctly frees C<MADPROP> structures when | |
| OPs are freed. C<MADPROP>s are now allocated with C<PerlMemShared_malloc()> | |
| =item * | |
| F<makedef.pl> has been refactored. This should have no noticeable affect on | |
| any of the platforms that use it as part of their build (AIX, VMS, Win32). | |
| =item * | |
| C<useperlio> can no longer be disabled. | |
| =item * | |
| The file F<global.sym> is no longer needed, and has been removed. It | |
| contained a list of all exported functions, one of the files generated by | |
| F<regen/embed.pl> from data in F<embed.fnc> and F<regen/opcodes>. The code | |
| has been refactored so that the only user of F<global.sym>, F<makedef.pl>, | |
| now reads F<embed.fnc> and F<regen/opcodes> directly, removing the need to | |
| store the list of exported functions in an intermediate file. | |
| As F<global.sym> was never installed, this change should not be visible | |
| outside the build process. | |
| =item * | |
| F<pod/buildtoc>, used by the build process to build L<perltoc>, has been | |
| refactored and simplified. It now contains only code to build L<perltoc>; | |
| the code to regenerate Makefiles has been moved to F<Porting/pod_rules.pl>. | |
| It's a bug if this change has any material effect on the build process. | |
| =item * | |
| F<pod/roffitall> is now built by F<pod/buildtoc>, instead of being | |
| shipped with the distribution. Its list of manpages is now generated | |
| (and therefore current). See also RT #103202 for an unresolved related | |
| issue. | |
| =item * | |
| The man page for C<XS::Typemap> is no longer installed. C<XS::Typemap> | |
| is a test module which is not installed, hence installing its | |
| documentation makes no sense. | |
| =item * | |
| The -Dusesitecustomize and -Duserelocatableinc options now work | |
| together properly. | |
| =back | |
| =head1 Platform Support | |
| =head2 Platform-Specific Notes | |
| =head3 Cygwin | |
| =over 4 | |
| =item * | |
| Since version 1.7, Cygwin supports native UTF-8 paths. If Perl is built | |
| under that environment, directory and filenames will be UTF-8 encoded. | |
| =item * | |
| Cygwin does not initialize all original Win32 environment variables. See | |
| F<README.cygwin> for a discussion of the newly-added | |
| C<Cygwin::sync_winenv()> function [perl #110190] and for | |
| further links. | |
| =back | |
| =head3 HP-UX | |
| =over 4 | |
| =item * | |
| HP-UX PA-RISC/64 now supports gcc-4.x | |
| A fix to correct the socketsize now makes the test suite pass on HP-UX | |
| PA-RISC for 64bitall builds. (5.14.2) | |
| =back | |
| =head3 VMS | |
| =over 4 | |
| =item * | |
| Remove unnecessary includes, fix miscellaneous compiler warnings and | |
| close some unclosed comments on F<vms/vms.c>. | |
| =item * | |
| Remove sockadapt layer from the VMS build. | |
| =item * | |
| Explicit support for VMS versions before v7.0 and DEC C versions | |
| before v6.0 has been removed. | |
| =item * | |
| Since Perl 5.10.1, the home-grown C<stat> wrapper has been unable to | |
| distinguish between a directory name containing an underscore and an | |
| otherwise-identical filename containing a dot in the same position | |
| (e.g., t/test_pl as a directory and t/test.pl as a file). This problem | |
| has been corrected. | |
| =item * | |
| The build on VMS now permits names of the resulting symbols in C code for | |
| Perl longer than 31 characters. Symbols like | |
| C<Perl__it_was_the_best_of_times_it_was_the_worst_of_times> can now be | |
| created freely without causing the VMS linker to seize up. | |
| =back | |
| =head3 GNU/Hurd | |
| =over 4 | |
| =item * | |
| Numerous build and test failures on GNU/Hurd have been resolved with hints | |
| for building DBM modules, detection of the library search path, and enabling | |
| of large file support. | |
| =back | |
| =head3 OpenVOS | |
| =over 4 | |
| =item * | |
| Perl is now built with dynamic linking on OpenVOS, the minimum supported | |
| version of which is now Release 17.1.0. | |
| =back | |
| =head3 SunOS | |
| The CC workshop C++ compiler is now detected and used on systems that ship | |
| without cc. | |
| =head1 Internal Changes | |
| =over 4 | |
| =item * | |
| The compiled representation of formats is now stored via the C<mg_ptr> of | |
| their C<PERL_MAGIC_fm>. Previously it was stored in the string buffer, | |
| beyond C<SvLEN()>, the regular end of the string. C<SvCOMPILED()> and | |
| C<SvCOMPILED_{on,off}()> now exist solely for compatibility for XS code. | |
| The first is always 0, the other two now no-ops. (5.14.1) | |
| =item * | |
| Some global variables have been marked C<const>, members in the interpreter | |
| structure have been re-ordered, and the opcodes have been re-ordered. The | |
| op C<OP_AELEMFAST> has been split into C<OP_AELEMFAST> and C<OP_AELEMFAST_LEX>. | |
| =item * | |
| When empting a hash of its elements (e.g., via undef(%h), or %h=()), HvARRAY | |
| field is no longer temporarily zeroed. Any destructors called on the freed | |
| elements see the remaining elements. Thus, %h=() becomes more like | |
| C<delete $h{$_} for keys %h>. | |
| =item * | |
| Boyer-Moore compiled scalars are now PVMGs, and the Boyer-Moore tables are now | |
| stored via the mg_ptr of their C<PERL_MAGIC_bm>. | |
| Previously they were PVGVs, with the tables stored in | |
| the string buffer, beyond C<SvLEN()>. This eliminates | |
| the last place where the core stores data beyond C<SvLEN()>. | |
| =item * | |
| Simplified logic in C<Perl_sv_magic()> introduces a small change of | |
| behavior for error cases involving unknown magic types. Previously, if | |
| C<Perl_sv_magic()> was passed a magic type unknown to it, it would | |
| =over | |
| =item 1. | |
| Croak "Modification of a read-only value attempted" if read only | |
| =item 2. | |
| Return without error if the SV happened to already have this magic | |
| =item 3. | |
| otherwise croak "Don't know how to handle magic of type \\%o" | |
| =back | |
| Now it will always croak "Don't know how to handle magic of type \\%o", even | |
| on read-only values, or SVs which already have the unknown magic type. | |
| =item * | |
| The experimental C<fetch_cop_label> function has been renamed to | |
| C<cop_fetch_label>. | |
| =item * | |
| The C<cop_store_label> function has been added to the API, but is | |
| experimental. | |
| =item * | |
| F<embedvar.h> has been simplified, and one level of macro indirection for | |
| PL_* variables has been removed for the default (non-multiplicity) | |
| configuration. PERLVAR*() macros now directly expand their arguments to | |
| tokens such as C<PL_defgv>, instead of expanding to C<PL_Idefgv>, with | |
| F<embedvar.h> defining a macro to map C<PL_Idefgv> to C<PL_defgv>. XS code | |
| which has unwarranted chumminess with the implementation may need updating. | |
| =item * | |
| An API has been added to explicitly choose whether to export XSUB | |
| symbols. More detail can be found in the comments for commit e64345f8. | |
| =item * | |
| The C<is_gv_magical_sv> function has been eliminated and merged with | |
| C<gv_fetchpvn_flags>. It used to be called to determine whether a GV | |
| should be autovivified in rvalue context. Now it has been replaced with a | |
| new C<GV_ADDMG> flag (not part of the API). | |
| =item * | |
| The returned code point from the function C<utf8n_to_uvuni()> | |
| when the input is malformed UTF-8, malformations are allowed, and | |
| C<utf8> warnings are off is now the Unicode REPLACEMENT CHARACTER | |
| whenever the malformation is such that no well-defined code point can be | |
| computed. Previously the returned value was essentially garbage. The | |
| only malformations that have well-defined values are a zero-length | |
| string (0 is the return), and overlong UTF-8 sequences. | |
| =item * | |
| Padlists are now marked C<AvREAL>; i.e., reference-counted. They have | |
| always been reference-counted, but were not marked real, because F<pad.c> | |
| did its own clean-up, instead of using the usual clean-up code in F<sv.c>. | |
| That caused problems in thread cloning, so now the C<AvREAL> flag is on, | |
| but is turned off in F<pad.c> right before the padlist is freed (after | |
| F<pad.c> has done its custom freeing of the pads). | |
| =item * | |
| All C files that make up the Perl core have been converted to UTF-8. | |
| =item * | |
| These new functions have been added as part of the work on Unicode symbols: | |
| HvNAMELEN | |
| HvNAMEUTF8 | |
| HvENAMELEN | |
| HvENAMEUTF8 | |
| gv_init_pv | |
| gv_init_pvn | |
| gv_init_pvsv | |
| gv_fetchmeth_pv | |
| gv_fetchmeth_pvn | |
| gv_fetchmeth_sv | |
| gv_fetchmeth_pv_autoload | |
| gv_fetchmeth_pvn_autoload | |
| gv_fetchmeth_sv_autoload | |
| gv_fetchmethod_pv_flags | |
| gv_fetchmethod_pvn_flags | |
| gv_fetchmethod_sv_flags | |
| gv_autoload_pv | |
| gv_autoload_pvn | |
| gv_autoload_sv | |
| newGVgen_flags | |
| sv_derived_from_pv | |
| sv_derived_from_pvn | |
| sv_derived_from_sv | |
| sv_does_pv | |
| sv_does_pvn | |
| sv_does_sv | |
| whichsig_pv | |
| whichsig_pvn | |
| whichsig_sv | |
| newCONSTSUB_flags | |
| The gv_fetchmethod_*_flags functions, like gv_fetchmethod_flags, are | |
| experimental and may change in a future release. | |
| =item * | |
| The following functions were added. These are I<not> part of the API: | |
| GvNAMEUTF8 | |
| GvENAMELEN | |
| GvENAME_HEK | |
| CopSTASH_flags | |
| CopSTASH_flags_set | |
| PmopSTASH_flags | |
| PmopSTASH_flags_set | |
| sv_sethek | |
| HEKfARG | |
| There is also a C<HEKf> macro corresponding to C<SVf>, for | |
| interpolating HEKs in formatted strings. | |
| =item * | |
| C<sv_catpvn_flags> takes a couple of new internal-only flags, | |
| C<SV_CATBYTES> and C<SV_CATUTF8>, which tell it whether the char array to | |
| be concatenated is UTF8. This allows for more efficient concatenation than | |
| creating temporary SVs to pass to C<sv_catsv>. | |
| =item * | |
| For XS AUTOLOAD subs, $AUTOLOAD is set once more, as it was in 5.6.0. This | |
| is in addition to setting C<SvPVX(cv)>, for compatibility with 5.8 to 5.14. | |
| See L<perlguts/Autoloading with XSUBs>. | |
| =item * | |
| Perl now checks whether the array (the linearized isa) returned by a MRO | |
| plugin begins with the name of the class itself, for which the array was | |
| created, instead of assuming that it does. This prevents the first element | |
| from being skipped during method lookup. It also means that | |
| C<mro::get_linear_isa> may return an array with one more element than the | |
| MRO plugin provided [perl #94306]. | |
| =item * | |
| C<PL_curstash> is now reference-counted. | |
| =item * | |
| There are now feature bundle hints in C<PL_hints> (C<$^H>) that version | |
| declarations use, to avoid having to load F<feature.pm>. One setting of | |
| the hint bits indicates a "custom" feature bundle, which means that the | |
| entries in C<%^H> still apply. F<feature.pm> uses that. | |
| The C<HINT_FEATURE_MASK> macro is defined in F<perl.h> along with other | |
| hints. Other macros for setting and testing features and bundles are in | |
| the new F<feature.h>. C<FEATURE_IS_ENABLED> (which has moved to | |
| F<feature.h>) is no longer used throughout the codebase, but more specific | |
| macros, e.g., C<FEATURE_SAY_IS_ENABLED>, that are defined in F<feature.h>. | |
| =item * | |
| F<lib/feature.pm> is now a generated file, created by the new | |
| F<regen/feature.pl> script, which also generates F<feature.h>. | |
| =item * | |
| Tied arrays are now always C<AvREAL>. If C<@_> or C<DB::args> is tied, it | |
| is reified first, to make sure this is always the case. | |
| =item * | |
| Two new functions C<utf8_to_uvchr_buf()> and C<utf8_to_uvuni_buf()> have | |
| been added. These are the same as C<utf8_to_uvchr> and | |
| C<utf8_to_uvuni> (which are now deprecated), but take an extra parameter | |
| that is used to guard against reading beyond the end of the input | |
| string. | |
| See L<perlapi/utf8_to_uvchr_buf> and L<perlapi/utf8_to_uvuni_buf>. | |
| =item * | |
| The regular expression engine now does TRIE case insensitive matches | |
| under Unicode. This may change the output of C<< use re 'debug'; >>, | |
| and will speed up various things. | |
| =item * | |
| There is a new C<wrap_op_checker()> function, which provides a thread-safe | |
| alternative to writing to C<PL_check> directly. | |
| =back | |
| =head1 Selected Bug Fixes | |
| =head2 Array and hash | |
| =over | |
| =item * | |
| A bug has been fixed that would cause a "Use of freed value in iteration" | |
| error if the next two hash elements that would be iterated over are | |
| deleted [perl #85026]. (5.14.1) | |
| =item * | |
| Deleting the current hash iterator (the hash element that would be returned | |
| by the next call to C<each>) in void context used not to free it | |
| [perl #85026]. | |
| =item * | |
| Deletion of methods via C<delete $Class::{method}> syntax used to update | |
| method caches if called in void context, but not scalar or list context. | |
| =item * | |
| When hash elements are deleted in void context, the internal hash entry is | |
| now freed before the value is freed, to prevent destructors called by that | |
| latter freeing from seeing the hash in an inconsistent state. It was | |
| possible to cause double-frees if the destructor freed the hash itself | |
| [perl #100340]. | |
| =item * | |
| A C<keys> optimization in Perl 5.12.0 to make it faster on empty hashes | |
| caused C<each> not to reset the iterator if called after the last element | |
| was deleted. | |
| =item * | |
| Freeing deeply nested hashes no longer crashes [perl #44225]. | |
| =item * | |
| It is possible from XS code to create hashes with elements that have no | |
| values. The hash element and slice operators used to crash | |
| when handling these in lvalue context. They now | |
| produce a "Modification of non-creatable hash value attempted" error | |
| message. | |
| =item * | |
| If list assignment to a hash or array triggered destructors that freed the | |
| hash or array itself, a crash would ensue. This is no longer the case | |
| [perl #107440]. | |
| =item * | |
| It used to be possible to free the typeglob of a localized array or hash | |
| (e.g., C<local @{"x"}; delete $::{x}>), resulting in a crash on scope exit. | |
| =item * | |
| Some core bugs affecting L<Hash::Util> have been fixed: locking a hash | |
| element that is a glob copy no longer causes the next assignment to it to | |
| corrupt the glob (5.14.2), and unlocking a hash element that holds a | |
| copy-on-write scalar no longer causes modifications to that scalar to | |
| modify other scalars that were sharing the same string buffer. | |
| =back | |
| =head2 C API fixes | |
| =over | |
| =item * | |
| The C<newHVhv> XS function now works on tied hashes, instead of crashing or | |
| returning an empty hash. | |
| =item * | |
| The C<SvIsCOW> C macro now returns false for read-only copies of typeglobs, | |
| such as those created by: | |
| $hash{elem} = *foo; | |
| Hash::Util::lock_value %hash, 'elem'; | |
| It used to return true. | |
| =item * | |
| The C<SvPVutf8> C function no longer tries to modify its argument, | |
| resulting in errors [perl #108994]. | |
| =item * | |
| C<SvPVutf8> now works properly with magical variables. | |
| =item * | |
| C<SvPVbyte> now works properly non-PVs. | |
| =item * | |
| When presented with malformed UTF-8 input, the XS-callable functions | |
| C<is_utf8_string()>, C<is_utf8_string_loc()>, and | |
| C<is_utf8_string_loclen()> could read beyond the end of the input | |
| string by up to 12 bytes. This no longer happens. [perl #32080]. | |
| However, currently, C<is_utf8_char()> still has this defect, see | |
| L</is_utf8_char()> above. | |
| =item * | |
| The C-level C<pregcomp> function could become confused about whether the | |
| pattern was in UTF8 if the pattern was an overloaded, tied, or otherwise | |
| magical scalar [perl #101940]. | |
| =back | |
| =head2 Compile-time hints | |
| =over | |
| =item * | |
| Tying C<%^H> no longer causes perl to crash or ignore the contents of | |
| C<%^H> when entering a compilation scope [perl #106282]. | |
| =item * | |
| C<eval $string> and C<require> used not to | |
| localize C<%^H> during compilation if it | |
| was empty at the time the C<eval> call itself was compiled. This could | |
| lead to scary side effects, like C<use re "/m"> enabling other flags that | |
| the surrounding code was trying to enable for its caller [perl #68750]. | |
| =item * | |
| C<eval $string> and C<require> no longer localize hints (C<$^H> and C<%^H>) | |
| at run time, but only during compilation of the $string or required file. | |
| This makes C<BEGIN { $^H{foo}=7 }> equivalent to | |
| C<BEGIN { eval '$^H{foo}=7' }> [perl #70151]. | |
| =item * | |
| Creating a BEGIN block from XS code (via C<newXS> or C<newATTRSUB>) would, | |
| on completion, make the hints of the current compiling code the current | |
| hints. This could cause warnings to occur in a non-warning scope. | |
| =back | |
| =head2 Copy-on-write scalars | |
| Copy-on-write or shared hash key scalars | |
| were introduced in 5.8.0, but most Perl code | |
| did not encounter them (they were used mostly internally). Perl | |
| 5.10.0 extended them, such that assigning C<__PACKAGE__> or a | |
| hash key to a scalar would make it copy-on-write. Several parts | |
| of Perl were not updated to account for them, but have now been fixed. | |
| =over | |
| =item * | |
| C<utf8::decode> had a nasty bug that would modify copy-on-write scalars' | |
| string buffers in place (i.e., skipping the copy). This could result in | |
| hashes having two elements with the same key [perl #91834]. (5.14.2) | |
| =item * | |
| Lvalue subroutines were not allowing COW scalars to be returned. This was | |
| fixed for lvalue scalar context in Perl 5.12.3 and 5.14.0, but list context | |
| was not fixed until this release. | |
| =item * | |
| Elements of restricted hashes (see the L<fields> pragma) containing | |
| copy-on-write values couldn't be deleted, nor could such hashes be cleared | |
| (C<%hash = ()>). (5.14.2) | |
| =item * | |
| Localizing a tied variable used to make it read-only if it contained a | |
| copy-on-write string. (5.14.2) | |
| =item * | |
| Assigning a copy-on-write string to a stash | |
| element no longer causes a double free. Regardless of this change, the | |
| results of such assignments are still undefined. | |
| =item * | |
| Assigning a copy-on-write string to a tied variable no longer stops that | |
| variable from being tied if it happens to be a PVMG or PVLV internally. | |
| =item * | |
| Doing a substitution on a tied variable returning a copy-on-write | |
| scalar used to cause an assertion failure or an "Attempt to free | |
| nonexistent shared string" warning. | |
| =item * | |
| This one is a regression from 5.12: In 5.14.0, the bitwise assignment | |
| operators C<|=>, C<^=> and C<&=> started leaving the left-hand side | |
| undefined if it happened to be a copy-on-write string [perl #108480]. | |
| =item * | |
| L<Storable>, L<Devel::Peek> and L<PerlIO::scalar> had similar problems. | |
| See L</Updated Modules and Pragmata>, above. | |
| =back | |
| =head2 The debugger | |
| =over | |
| =item * | |
| F<dumpvar.pl>, and therefore the C<x> command in the debugger, have been | |
| fixed to handle objects blessed into classes whose names contain "=". The | |
| contents of such objects used not to be dumped [perl #101814]. | |
| =item * | |
| The "R" command for restarting a debugger session has been fixed to work on | |
| Windows, or any other system lacking a C<POSIX::_SC_OPEN_MAX> constant | |
| [perl #87740]. | |
| =item * | |
| The C<#line 42 foo> directive used not to update the arrays of lines used | |
| by the debugger if it occurred in a string eval. This was partially fixed | |
| in 5.14, but it worked only for a single C<#line 42 foo> in each eval. Now | |
| it works for multiple. | |
| =item * | |
| When subroutine calls are intercepted by the debugger, the name of the | |
| subroutine or a reference to it is stored in C<$DB::sub>, for the debugger | |
| to access. Sometimes (such as C<$foo = *bar; undef *bar; &$foo>) | |
| C<$DB::sub> would be set to a name that could not be used to find the | |
| subroutine, and so the debugger's attempt to call it would fail. Now the | |
| check to see whether a reference is needed is more robust, so those | |
| problems should not happen anymore [rt.cpan.org #69862]. | |
| =item * | |
| Every subroutine has a filename associated with it that the debugger uses. | |
| The one associated with constant subroutines used to be misallocated when | |
| cloned under threads. Consequently, debugging threaded applications could | |
| result in memory corruption [perl #96126]. | |
| =back | |
| =head2 Dereferencing operators | |
| =over | |
| =item * | |
| C<defined(${"..."})>, C<defined(*{"..."})>, etc., used to | |
| return true for most, but not all built-in variables, if | |
| they had not been used yet. This bug affected C<${^GLOBAL_PHASE}> and | |
| C<${^UTF8CACHE}>, among others. It also used to return false if the | |
| package name was given as well (C<${"::!"}>) [perl #97978, #97492]. | |
| =item * | |
| Perl 5.10.0 introduced a similar bug: C<defined(*{"foo"})> where "foo" | |
| represents the name of a built-in global variable used to return false if | |
| the variable had never been used before, but only on the I<first> call. | |
| This, too, has been fixed. | |
| =item * | |
| Since 5.6.0, C<*{ ... }> has been inconsistent in how it treats undefined | |
| values. It would die in strict mode or lvalue context for most undefined | |
| values, but would be treated as the empty string (with a warning) for the | |
| specific scalar return by C<undef()> (C<&PL_sv_undef> internally). This | |
| has been corrected. C<undef()> is now treated like other undefined | |
| scalars, as in Perl 5.005. | |
| =back | |
| =head2 Filehandle, last-accessed | |
| Perl has an internal variable that stores the last filehandle to be | |
| accessed. It is used by C<$.> and by C<tell> and C<eof> without | |
| arguments. | |
| =over | |
| =item * | |
| It used to be possible to set this internal variable to a glob copy and | |
| then modify that glob copy to be something other than a glob, and still | |
| have the last-accessed filehandle associated with the variable after | |
| assigning a glob to it again: | |
| my $foo = *STDOUT; # $foo is a glob copy | |
| <$foo>; # $foo is now the last-accessed handle | |
| $foo = 3; # no longer a glob | |
| $foo = *STDERR; # still the last-accessed handle | |
| Now the C<$foo = 3> assignment unsets that internal variable, so there | |
| is no last-accessed filehandle, just as if C<< <$foo> >> had never | |
| happened. | |
| This also prevents some unrelated handle from becoming the last-accessed | |
| handle if $foo falls out of scope and the same internal SV gets used for | |
| another handle [perl #97988]. | |
| =item * | |
| A regression in 5.14 caused these statements not to set that internal | |
| variable: | |
| my $fh = *STDOUT; | |
| tell $fh; | |
| eof $fh; | |
| seek $fh, 0,0; | |
| tell *$fh; | |
| eof *$fh; | |
| seek *$fh, 0,0; | |
| readline *$fh; | |
| This is now fixed, but C<tell *{ *$fh }> still has the problem, and it | |
| is not clear how to fix it [perl #106536]. | |
| =back | |
| =head2 Filetests and C<stat> | |
| The term "filetests" refers to the operators that consist of a hyphen | |
| followed by a single letter: C<-r>, C<-x>, C<-M>, etc. The term "stacked" | |
| when applied to filetests means followed by another filetest operator | |
| sharing the same operand, as in C<-r -x -w $fooo>. | |
| =over | |
| =item * | |
| C<stat> produces more consistent warnings. It no longer warns for "_" | |
| [perl #71002] and no longer skips the warning at times for other unopened | |
| handles. It no longer warns about an unopened handle when the operating | |
| system's C<fstat> function fails. | |
| =item * | |
| C<stat> would sometimes return negative numbers for large inode numbers, | |
| because it was using the wrong internal C type. [perl #84590] | |
| =item * | |
| C<lstat> is documented to fall back to C<stat> (with a warning) when given | |
| a filehandle. When passed an IO reference, it was actually doing the | |
| equivalent of S<C<stat _>> and ignoring the handle. | |
| =item * | |
| C<-T _> with no preceding C<stat> used to produce a | |
| confusing "uninitialized" warning, even though there | |
| is no visible uninitialized value to speak of. | |
| =item * | |
| C<-T>, C<-B>, C<-l> and C<-t> now work | |
| when stacked with other filetest operators | |
| [perl #77388]. | |
| =item * | |
| In 5.14.0, filetest ops (C<-r>, C<-x>, etc.) started calling FETCH on a | |
| tied argument belonging to the previous argument to a list operator, if | |
| called with a bareword argument or no argument at all. This has been | |
| fixed, so C<push @foo, $tied, -r> no longer calls FETCH on C<$tied>. | |
| =item * | |
| In Perl 5.6, C<-l> followed by anything other than a bareword would treat | |
| its argument as a file name. That was changed in 5.8 for glob references | |
| (C<\*foo>), but not for globs themselves (C<*foo>). C<-l> started | |
| returning C<undef> for glob references without setting the last | |
| stat buffer that the "_" handle uses, but only if warnings | |
| were turned on. With warnings off, it was the same as 5.6. | |
| In other words, it was simply buggy and inconsistent. Now the 5.6 | |
| behavior has been restored. | |
| =item * | |
| C<-l> followed by a bareword no longer "eats" the previous argument to | |
| the list operator in whose argument list it resides. Hence, | |
| C<print "bar", -l foo> now actually prints "bar", because C<-l> | |
| on longer eats it. | |
| =item * | |
| Perl keeps several internal variables to keep track of the last stat | |
| buffer, from which file(handle) it originated, what type it was, and | |
| whether the last stat succeeded. | |
| There were various cases where these could get out of synch, resulting in | |
| inconsistent or erratic behavior in edge cases (every mention of C<-T> | |
| applies to C<-B> as well): | |
| =over | |
| =item * | |
| C<-T I<HANDLE>>, even though it does a C<stat>, was not resetting the last | |
| stat type, so an C<lstat _> following it would merrily return the wrong | |
| results. Also, it was not setting the success status. | |
| =item * | |
| Freeing the handle last used by C<stat> or a filetest could result in | |
| S<C<-T _>> using an unrelated handle. | |
| =item * | |
| C<stat> with an IO reference would not reset the stat type or record the | |
| filehandle for S<C<-T _>> to use. | |
| =item * | |
| Fatal warnings could cause the stat buffer not to be reset | |
| for a filetest operator on an unopened filehandle or C<-l> on any handle. | |
| Fatal warnings also stopped C<-T> from setting C<$!>. | |
| =item * | |
| When the last stat was on an unreadable file, C<-T _> is supposed to | |
| return C<undef>, leaving the last stat buffer unchanged. But it was | |
| setting the stat type, causing C<lstat _> to stop working. | |
| =item * | |
| C<-T I<FILENAME>> was not resetting the internal stat buffers for | |
| unreadable files. | |
| =back | |
| These have all been fixed. | |
| =back | |
| =head2 Formats | |
| =over | |
| =item * | |
| Several edge cases have been fixed with formats and C<formline>; | |
| in particular, where the format itself is potentially variable (such as | |
| with ties and overloading), and where the format and data differ in their | |
| encoding. In both these cases, it used to possible for the output to be | |
| corrupted [perl #91032]. | |
| =item * | |
| C<formline> no longer converts its argument into a string in-place. So | |
| passing a reference to C<formline> no longer destroys the reference | |
| [perl #79532]. | |
| =item * | |
| Assignment to C<$^A> (the format output accumulator) now recalculates | |
| the number of lines output. | |
| =back | |
| =head2 C<given> and C<when> | |
| =over | |
| =item * | |
| C<given> was not scoping its implicit $_ properly, resulting in memory | |
| leaks or "Variable is not available" warnings [perl #94682]. | |
| =item * | |
| C<given> was not calling set-magic on the implicit lexical C<$_> that it | |
| uses. This meant, for example, that C<pos> would be remembered from one | |
| execution of the same C<given> block to the next, even if the input were a | |
| different variable [perl #84526]. | |
| =item * | |
| C<when> blocks are now capable of returning variables declared inside the | |
| enclosing C<given> block [perl #93548]. | |
| =back | |
| =head2 The C<glob> operator | |
| =over | |
| =item * | |
| On OSes other than VMS, Perl's C<glob> operator (and the C<< <...> >> form) | |
| use L<File::Glob> underneath. L<File::Glob> splits the pattern into words, | |
| before feeding each word to its C<bsd_glob> function. | |
| There were several inconsistencies in the way the split was done. Now | |
| quotation marks (' and ") are always treated as shell-style word delimiters | |
| (that allow whitespace as part of a word) and backslashes are always | |
| preserved, unless they exist to escape quotation marks. Before, those | |
| would only sometimes be the case, depending on whether the pattern | |
| contained whitespace. Also, escaped whitespace at the end of the pattern | |
| is no longer stripped [perl #40470]. | |
| =item * | |
| C<CORE::glob> now works as a way to call the default globbing function. It | |
| used to respect overrides, despite the C<CORE::> prefix. | |
| =item * | |
| Under miniperl (used to configure modules when perl itself is built), | |
| C<glob> now clears %ENV before calling csh, since the latter croaks on some | |
| systems if it does not like the contents of the LS_COLORS environment | |
| variable [perl #98662]. | |
| =back | |
| =head2 Lvalue subroutines | |
| =over | |
| =item * | |
| Explicit return now returns the actual argument passed to return, instead | |
| of copying it [perl #72724, #72706]. | |
| =item * | |
| Lvalue subroutines used to enforce lvalue syntax (i.e., whatever can go on | |
| the left-hand side of C<=>) for the last statement and the arguments to | |
| return. Since lvalue subroutines are not always called in lvalue context, | |
| this restriction has been lifted. | |
| =item * | |
| Lvalue subroutines are less restrictive about what values can be returned. | |
| It used to croak on values returned by C<shift> and C<delete> and from | |
| other subroutines, but no longer does so [perl #71172]. | |
| =item * | |
| Empty lvalue subroutines (C<sub :lvalue {}>) used to return C<@_> in list | |
| context. All subroutines used to do this, but regular subs were fixed in | |
| Perl 5.8.2. Now lvalue subroutines have been likewise fixed. | |
| =item * | |
| Autovivification now works on values returned from lvalue subroutines | |
| [perl #7946], as does returning C<keys> in lvalue context. | |
| =item * | |
| Lvalue subroutines used to copy their return values in rvalue context. Not | |
| only was this a waste of CPU cycles, but it also caused bugs. A C<($)> | |
| prototype would cause an lvalue sub to copy its return value [perl #51408], | |
| and C<while(lvalue_sub() =~ m/.../g) { ... }> would loop endlessly | |
| [perl #78680]. | |
| =item * | |
| When called in potential lvalue context | |
| (e.g., subroutine arguments or a list | |
| passed to C<for>), lvalue subroutines used to copy | |
| any read-only value that was returned. E.g., C< sub :lvalue { $] } > | |
| would not return C<$]>, but a copy of it. | |
| =item * | |
| When called in potential lvalue context, an lvalue subroutine returning | |
| arrays or hashes used to bind the arrays or hashes to scalar variables, | |
| resulting in bugs. This was fixed in 5.14.0 if an array were the first | |
| thing returned from the subroutine (but not for C<$scalar, @array> or | |
| hashes being returned). Now a more general fix has been applied | |
| [perl #23790]. | |
| =item * | |
| Method calls whose arguments were all surrounded with C<my()> or C<our()> | |
| (as in C<< $object->method(my($a,$b)) >>) used to force lvalue context on | |
| the subroutine. This would prevent lvalue methods from returning certain | |
| values. | |
| =item * | |
| Lvalue sub calls that are not determined to be such at compile time | |
| (C<&$name> or &{"name"}) are no longer exempt from strict refs if they | |
| occur in the last statement of an lvalue subroutine [perl #102486]. | |
| =item * | |
| Sub calls whose subs are not visible at compile time, if | |
| they occurred in the last statement of an lvalue subroutine, | |
| would reject non-lvalue subroutines and die with "Can't modify non-lvalue | |
| subroutine call" [perl #102486]. | |
| Non-lvalue sub calls whose subs I<are> visible at compile time exhibited | |
| the opposite bug. If the call occurred in the last statement of an lvalue | |
| subroutine, there would be no error when the lvalue sub was called in | |
| lvalue context. Perl would blindly assign to the temporary value returned | |
| by the non-lvalue subroutine. | |
| =item * | |
| C<AUTOLOAD> routines used to take precedence over the actual sub being | |
| called (i.e., when autoloading wasn't needed), for sub calls in lvalue or | |
| potential lvalue context, if the subroutine was not visible at compile | |
| time. | |
| =item * | |
| Applying the C<:lvalue> attribute to an XSUB or to an aliased subroutine | |
| stub with C<< sub foo :lvalue; >> syntax stopped working in Perl 5.12. | |
| This has been fixed. | |
| =item * | |
| Applying the :lvalue attribute to subroutine that is already defined does | |
| not work properly, as the attribute changes the way the sub is compiled. | |
| Hence, Perl 5.12 began warning when an attempt is made to apply the | |
| attribute to an already defined sub. In such cases, the attribute is | |
| discarded. | |
| But the change in 5.12 missed the case where custom attributes are also | |
| present: that case still silently and ineffectively applied the attribute. | |
| That omission has now been corrected. C<sub foo :lvalue :Whatever> (when | |
| C<foo> is already defined) now warns about the :lvalue attribute, and does | |
| not apply it. | |
| =item * | |
| A bug affecting lvalue context propagation through nested lvalue subroutine | |
| calls has been fixed. Previously, returning a value in nested rvalue | |
| context would be treated as lvalue context by the inner subroutine call, | |
| resulting in some values (such as read-only values) being rejected. | |
| =back | |
| =head2 Overloading | |
| =over | |
| =item * | |
| Arithmetic assignment (C<$left += $right>) involving overloaded objects | |
| that rely on the 'nomethod' override no longer segfault when the left | |
| operand is not overloaded. | |
| =item * | |
| Errors that occur when methods cannot be found during overloading now | |
| mention the correct package name, as they did in 5.8.x, instead of | |
| erroneously mentioning the "overload" package, as they have since 5.10.0. | |
| =item * | |
| Undefining C<%overload::> no longer causes a crash. | |
| =back | |
| =head2 Prototypes of built-in keywords | |
| =over | |
| =item * | |
| The C<prototype> function no longer dies for the C<__FILE__>, C<__LINE__> | |
| and C<__PACKAGE__> directives. It now returns an empty-string prototype | |
| for them, because they are syntactically indistinguishable from nullary | |
| functions like C<time>. | |
| =item * | |
| C<prototype> now returns C<undef> for all overridable infix operators, | |
| such as C<eq>, which are not callable in any way resembling functions. | |
| It used to return incorrect prototypes for some and die for others | |
| [perl #94984]. | |
| =item * | |
| The prototypes of several built-in functions--C<getprotobynumber>, C<lock>, | |
| C<not> and C<select>--have been corrected, or at least are now closer to | |
| reality than before. | |
| =back | |
| =head2 Regular expressions | |
| =for comment Is it possible to merge some of these items? | |
| =over 4 | |
| =item * | |
| C</[[:ascii:]]/> and C</[[:blank:]]/> now use locale rules under | |
| C<use locale> when the platform supports that. Previously, they used | |
| the platform's native character set. | |
| =item * | |
| C<m/[[:ascii:]]/i> and C</\p{ASCII}/i> now match identically (when not | |
| under a differing locale). This fixes a regression introduced in 5.14 | |
| in which the first expression could match characters outside of ASCII, | |
| such as the KELVIN SIGN. | |
| =item * | |
| C</.*/g> would sometimes refuse to match at the end of a string that ends | |
| with "\n". This has been fixed [perl #109206]. | |
| =item * | |
| Starting with 5.12.0, Perl used to get its internal bookkeeping muddled up | |
| after assigning C<${ qr// }> to a hash element and locking it with | |
| L<Hash::Util>. This could result in double frees, crashes, or erratic | |
| behavior. | |
| =item * | |
| The new (in 5.14.0) regular expression modifier C</a> when repeated like | |
| C</aa> forbids the characters outside the ASCII range that match | |
| characters inside that range from matching under C</i>. This did not | |
| work under some circumstances, all involving alternation, such as: | |
| "\N{KELVIN SIGN}" =~ /k|foo/iaa; | |
| succeeded inappropriately. This is now fixed. | |
| =item * | |
| 5.14.0 introduced some memory leaks in regular expression character | |
| classes such as C<[\w\s]>, which have now been fixed. (5.14.1) | |
| =item * | |
| An edge case in regular expression matching could potentially loop. | |
| This happened only under C</i> in bracketed character classes that have | |
| characters with multi-character folds, and the target string to match | |
| against includes the first portion of the fold, followed by another | |
| character that has a multi-character fold that begins with the remaining | |
| portion of the fold, plus some more. | |
| "s\N{U+DF}" =~ /[\x{DF}foo]/i | |
| is one such case. C<\xDF> folds to C<"ss">. (5.14.1) | |
| =item * | |
| A few characters in regular expression pattern matches did not | |
| match correctly in some circumstances, all involving C</i>. The | |
| affected characters are: | |
| COMBINING GREEK YPOGEGRAMMENI, | |
| GREEK CAPITAL LETTER IOTA, | |
| GREEK CAPITAL LETTER UPSILON, | |
| GREEK PROSGEGRAMMENI, | |
| GREEK SMALL LETTER IOTA WITH DIALYTIKA AND OXIA, | |
| GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS, | |
| GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND OXIA, | |
| GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS, | |
| LATIN SMALL LETTER LONG S, | |
| LATIN SMALL LIGATURE LONG S T, | |
| and | |
| LATIN SMALL LIGATURE ST. | |
| =item * | |
| A memory leak regression in regular expression compilation | |
| under threading has been fixed. | |
| =item * | |
| A regression introduced in 5.14.0 has | |
| been fixed. This involved an inverted | |
| bracketed character class in a regular expression that consisted solely | |
| of a Unicode property. That property wasn't getting inverted outside the | |
| Latin1 range. | |
| =item * | |
| Three problematic Unicode characters now work better in regex pattern matching under C</i>. | |
| In the past, three Unicode characters: | |
| LATIN SMALL LETTER SHARP S, | |
| GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS, | |
| and | |
| GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS, | |
| along with the sequences that they fold to | |
| (including "ss" for LATIN SMALL LETTER SHARP S), | |
| did not properly match under C</i>. 5.14.0 fixed some of these cases, | |
| but introduced others, including a panic when one of the characters or | |
| sequences was used in the C<(?(DEFINE)> regular expression predicate. | |
| The known bugs that were introduced in 5.14 have now been fixed; as well | |
| as some other edge cases that have never worked until now. These all | |
| involve using the characters and sequences outside bracketed character | |
| classes under C</i>. This closes [perl #98546]. | |
| There remain known problems when using certain characters with | |
| multi-character folds inside bracketed character classes, including such | |
| constructs as C<qr/[\N{LATIN SMALL LETTER SHARP}a-z]/i>. These | |
| remaining bugs are addressed in [perl #89774]. | |
| =item * | |
| RT #78266: The regex engine has been leaking memory when accessing | |
| named captures that weren't matched as part of a regex ever since 5.10 | |
| when they were introduced; e.g., this would consume over a hundred MB of | |
| memory: | |
| for (1..10_000_000) { | |
| if ("foo" =~ /(foo|(?<capture>bar))?/) { | |
| my $capture = $+{capture} | |
| } | |
| } | |
| system "ps -o rss $$"' | |
| =item * | |
| In 5.14, C</[[:lower:]]/i> and C</[[:upper:]]/i> no longer matched the | |
| opposite case. This has been fixed [perl #101970]. | |
| =item * | |
| A regular expression match with an overloaded object on the right-hand side | |
| would sometimes stringify the object too many times. | |
| =item * | |
| A regression has been fixed that was introduced in 5.14, in C</i> | |
| regular expression matching, in which a match improperly fails if the | |
| pattern is in UTF-8, the target string is not, and a Latin-1 character | |
| precedes a character in the string that should match the pattern. | |
| [perl #101710] | |
| =item * | |
| In case-insensitive regular expression pattern matching, no longer on | |
| UTF-8 encoded strings does the scan for the start of match look only at | |
| the first possible position. This caused matches such as | |
| C<"f\x{FB00}" =~ /ff/i> to fail. | |
| =item * | |
| The regexp optimizer no longer crashes on debugging builds when merging | |
| fixed-string nodes with inconvenient contents. | |
| =item * | |
| A panic involving the combination of the regular expression modifiers | |
| C</aa> and the C<\b> escape sequence introduced in 5.14.0 has been | |
| fixed [perl #95964]. (5.14.2) | |
| =item * | |
| The combination of the regular expression modifiers C</aa> and the C<\b> | |
| and C<\B> escape sequences did not work properly on UTF-8 encoded | |
| strings. All non-ASCII characters under C</aa> should be treated as | |
| non-word characters, but what was happening was that Unicode rules were | |
| used to determine wordness/non-wordness for non-ASCII characters. This | |
| is now fixed [perl #95968]. | |
| =item * | |
| C<< (?foo: ...) >> no longer loses passed in character set. | |
| =item * | |
| The trie optimization used to have problems with alternations containing | |
| an empty C<(?:)>, causing C<< "x" =~ /\A(?>(?:(?:)A|B|C?x))\z/ >> not to | |
| match, whereas it should [perl #111842]. | |
| =item * | |
| Use of lexical (C<my>) variables in code blocks embedded in regular | |
| expressions will no longer result in memory corruption or crashes. | |
| Nevertheless, these code blocks are still experimental, as there are still | |
| problems with the wrong variables being closed over (in loops for instance) | |
| and with abnormal exiting (e.g., C<die>) causing memory corruption. | |
| =item * | |
| The C<\h>, C<\H>, C<\v> and C<\V> regular expression metacharacters used to | |
| cause a panic error message when trying to match at the end of the | |
| string [perl #96354]. | |
| =item * | |
| The abbreviations for four C1 control characters C<MW> C<PM>, C<RI>, and | |
| C<ST> were previously unrecognized by C<\N{}>, vianame(), and | |
| string_vianame(). | |
| =item * | |
| Mentioning a variable named "&" other than C<$&> (i.e., C<@&> or C<%&>) no | |
| longer stops C<$&> from working. The same applies to variables named "'" | |
| and "`" [perl #24237]. | |
| =item * | |
| Creating a C<UNIVERSAL::AUTOLOAD> sub no longer stops C<%+>, C<%-> and | |
| C<%!> from working some of the time [perl #105024]. | |
| =back | |
| =head2 Smartmatching | |
| =over | |
| =item * | |
| C<~~> now correctly handles the precedence of Any~~Object, and is not tricked | |
| by an overloaded object on the left-hand side. | |
| =item * | |
| In Perl 5.14.0, C<$tainted ~~ @array> stopped working properly. Sometimes | |
| it would erroneously fail (when C<$tainted> contained a string that occurs | |
| in the array I<after> the first element) or erroneously succeed (when | |
| C<undef> occurred after the first element) [perl #93590]. | |
| =back | |
| =head2 The C<sort> operator | |
| =over | |
| =item * | |
| C<sort> was not treating C<sub {}> and C<sub {()}> as equivalent when | |
| such a sub was provided as the comparison routine. It used to croak on | |
| C<sub {()}>. | |
| =item * | |
| C<sort> now works once more with custom sort routines that are XSUBs. It | |
| stopped working in 5.10.0. | |
| =item * | |
| C<sort> with a constant for a custom sort routine, although it produces | |
| unsorted results, no longer crashes. It started crashing in 5.10.0. | |
| =item * | |
| Warnings emitted by C<sort> when a custom comparison routine returns a | |
| non-numeric value now contain "in sort" and show the line number of the | |
| C<sort> operator, rather than the last line of the comparison routine. The | |
| warnings also now occur only if warnings are enabled in the scope where | |
| C<sort> occurs. Previously the warnings would occur if enabled in the | |
| comparison routine's scope. | |
| =item * | |
| C<< sort { $a <=> $b } >>, which is optimized internally, now produces | |
| "uninitialized" warnings for NaNs (not-a-number values), since C<< <=> >> | |
| returns C<undef> for those. This brings it in line with | |
| S<C<< sort { 1; $a <=> $b } >>> and other more complex cases, which are not | |
| optimized [perl #94390]. | |
| =back | |
| =head2 The C<substr> operator | |
| =over | |
| =item * | |
| Tied (and otherwise magical) variables are no longer exempt from the | |
| "Attempt to use reference as lvalue in substr" warning. | |
| =item * | |
| That warning now occurs when the returned lvalue is assigned to, not | |
| when C<substr> itself is called. This makes a difference only if the | |
| return value of C<substr> is referenced and later assigned to. | |
| =item * | |
| Passing a substring of a read-only value or a typeglob to a function | |
| (potential lvalue context) no longer causes an immediate "Can't coerce" | |
| or "Modification of a read-only value" error. That error occurs only | |
| if the passed value is assigned to. | |
| The same thing happens with the "substr outside of string" error. If | |
| the lvalue is only read from, not written to, it is now just a warning, as | |
| with rvalue C<substr>. | |
| =item * | |
| C<substr> assignments no longer call FETCH twice if the first argument | |
| is a tied variable, just once. | |
| =back | |
| =head2 Support for embedded nulls | |
| Some parts of Perl did not work correctly with nulls (C<chr 0>) embedded in | |
| strings. That meant that, for instance, C<< $m = "a\0b"; foo->$m >> would | |
| call the "a" method, instead of the actual method name contained in $m. | |
| These parts of perl have been fixed to support nulls: | |
| =over | |
| =item * | |
| Method names | |
| =item * | |
| Typeglob names (including filehandle and subroutine names) | |
| =item * | |
| Package names, including the return value of C<ref()> | |
| =item * | |
| Typeglob elements (C<*foo{"THING\0stuff"}>) | |
| =item * | |
| Signal names | |
| =item * | |
| Various warnings and error messages that mention variable names or values, | |
| methods, etc. | |
| =back | |
| One side effect of these changes is that blessing into "\0" no longer | |
| causes C<ref()> to return false. | |
| =head2 Threading bugs | |
| =over | |
| =item * | |
| Typeglobs returned from threads are no longer cloned if the parent thread | |
| already has a glob with the same name. This means that returned | |
| subroutines will now assign to the right package variables [perl #107366]. | |
| =item * | |
| Some cases of threads crashing due to memory allocation during cloning have | |
| been fixed [perl #90006]. | |
| =item * | |
| Thread joining would sometimes emit "Attempt to free unreferenced scalar" | |
| warnings if C<caller> had been used from the C<DB> package before thread | |
| creation [perl #98092]. | |
| =item * | |
| Locking a subroutine (via C<lock &sub>) is no longer a compile-time error | |
| for regular subs. For lvalue subroutines, it no longer tries to return the | |
| sub as a scalar, resulting in strange side effects like C<ref \$_> | |
| returning "CODE" in some instances. | |
| C<lock &sub> is now a run-time error if L<threads::shared> is loaded (a | |
| no-op otherwise), but that may be rectified in a future version. | |
| =back | |
| =head2 Tied variables | |
| =over | |
| =item * | |
| Various cases in which FETCH was being ignored or called too many times | |
| have been fixed: | |
| =over | |
| =item * | |
| C<PerlIO::get_layers> [perl #97956] | |
| =item * | |
| C<$tied =~ y/a/b/>, C<chop $tied> and C<chomp $tied> when $tied holds a | |
| reference. | |
| =item * | |
| When calling C<local $_> [perl #105912] | |
| =item * | |
| Four-argument C<select> | |
| =item * | |
| A tied buffer passed to C<sysread> | |
| =item * | |
| C<< $tied .= <> >> | |
| =item * | |
| Three-argument C<open>, the third being a tied file handle | |
| (as in C<< open $fh, ">&", $tied >>) | |
| =item * | |
| C<sort> with a reference to a tied glob for the comparison routine. | |
| =item * | |
| C<..> and C<...> in list context [perl #53554]. | |
| =item * | |
| C<${$tied}>, C<@{$tied}>, C<%{$tied}> and C<*{$tied}> where the tied | |
| variable returns a string (C<&{}> was unaffected) | |
| =item * | |
| C<defined ${ $tied_variable }> | |
| =item * | |
| Various functions that take a filehandle argument in rvalue context | |
| (C<close>, C<readline>, etc.) [perl #97482] | |
| =item * | |
| Some cases of dereferencing a complex expression, such as | |
| C<${ (), $tied } = 1>, used to call C<FETCH> multiple times, but now call | |
| it once. | |
| =item * | |
| C<$tied-E<gt>method> where $tied returns a package name--even resulting in | |
| a failure to call the method, due to memory corruption | |
| =item * | |
| Assignments like C<*$tied = \&{"..."}> and C<*glob = $tied> | |
| =item * | |
| C<chdir>, C<chmod>, C<chown>, C<utime>, C<truncate>, C<stat>, C<lstat> and | |
| the filetest ops (C<-r>, C<-x>, etc.) | |
| =back | |
| =item * | |
| C<caller> sets C<@DB::args> to the subroutine arguments when called from | |
| the DB package. It used to crash when doing so if C<@DB::args> happened to | |
| be tied. Now it croaks instead. | |
| =item * | |
| Tying an element of %ENV or C<%^H> and then deleting that element would | |
| result in a call to the tie object's DELETE method, even though tying the | |
| element itself is supposed to be equivalent to tying a scalar (the element | |
| is, of course, a scalar) [perl #67490]. | |
| =item * | |
| When Perl autovivifies an element of a tied array or hash (which entails | |
| calling STORE with a new reference), it now calls FETCH immediately after | |
| the STORE, instead of assuming that FETCH would have returned the same | |
| reference. This can make it easier to implement tied objects [perl #35865, #43011]. | |
| =item * | |
| Four-argument C<select> no longer produces its "Non-string passed as | |
| bitmask" warning on tied or tainted variables that are strings. | |
| =item * | |
| Localizing a tied scalar that returns a typeglob no longer stops it from | |
| being tied till the end of the scope. | |
| =item * | |
| Attempting to C<goto> out of a tied handle method used to cause memory | |
| corruption or crashes. Now it produces an error message instead | |
| [perl #8611]. | |
| =item * | |
| A bug has been fixed that occurs when a tied variable is used as a | |
| subroutine reference: if the last thing assigned to or returned from the | |
| variable was a reference or typeglob, the C<\&$tied> could either crash or | |
| return the wrong subroutine. The reference case is a regression introduced | |
| in Perl 5.10.0. For typeglobs, it has probably never worked till now. | |
| =back | |
| =head2 Version objects and vstrings | |
| =over | |
| =item * | |
| The bitwise complement operator (and possibly other operators, too) when | |
| passed a vstring would leave vstring magic attached to the return value, | |
| even though the string had changed. This meant that | |
| C<< version->new(~v1.2.3) >> would create a version looking like "v1.2.3" | |
| even though the string passed to C<< version->new >> was actually | |
| "\376\375\374". This also caused L<B::Deparse> to deparse C<~v1.2.3> | |
| incorrectly, without the C<~> [perl #29070]. | |
| =item * | |
| Assigning a vstring to a magic (e.g., tied, C<$!>) variable and then | |
| assigning something else used to blow away all magic. This meant that | |
| tied variables would come undone, C<$!> would stop getting updated on | |
| failed system calls, C<$|> would stop setting autoflush, and other | |
| mischief would take place. This has been fixed. | |
| =item * | |
| C<< version->new("version") >> and C<printf "%vd", "version"> no longer | |
| crash [perl #102586]. | |
| =item * | |
| Version comparisons, such as those that happen implicitly with C<use | |
| v5.43>, no longer cause locale settings to change [perl #105784]. | |
| =item * | |
| Version objects no longer cause memory leaks in boolean context | |
| [perl #109762]. | |
| =back | |
| =head2 Warnings, redefinition | |
| =over | |
| =item * | |
| Subroutines from the C<autouse> namespace are once more exempt from | |
| redefinition warnings. This used to work in 5.005, but was broken in | |
| 5.6 for most subroutines. For subs created via XS that redefine | |
| subroutines from the C<autouse> package, this stopped working in 5.10. | |
| =item * | |
| New XSUBs now produce redefinition warnings if they overwrite existing | |
| subs, as they did in 5.8.x. (The C<autouse> logic was reversed in | |
| 5.10-14. Only subroutines from the C<autouse> namespace would warn | |
| when clobbered.) | |
| =item * | |
| C<newCONSTSUB> used to use compile-time warning hints, instead of | |
| run-time hints. The following code should never produce a redefinition | |
| warning, but it used to, if C<newCONSTSUB> redefined an existing | |
| subroutine: | |
| use warnings; | |
| BEGIN { | |
| no warnings; | |
| some_XS_function_that_calls_new_CONSTSUB(); | |
| } | |
| =item * | |
| Redefinition warnings for constant subroutines are on by default (what | |
| are known as severe warnings in L<perldiag>). This occurred only | |
| when it was a glob assignment or declaration of a Perl subroutine that | |
| caused the warning. If the creation of XSUBs triggered the warning, it | |
| was not a default warning. This has been corrected. | |
| =item * | |
| The internal check to see whether a redefinition warning should occur | |
| used to emit "uninitialized" warnings in cases like this: | |
| use warnings "uninitialized"; | |
| use constant {u => undef, v => undef}; | |
| sub foo(){u} | |
| sub foo(){v} | |
| =back | |
| =head2 Warnings, "Uninitialized" | |
| =over | |
| =item * | |
| Various functions that take a filehandle argument in rvalue context | |
| (C<close>, C<readline>, etc.) used to warn twice for an undefined handle | |
| [perl #97482]. | |
| =item * | |
| C<dbmopen> now only warns once, rather than three times, if the mode | |
| argument is C<undef> [perl #90064]. | |
| =item * | |
| The C<+=> operator does not usually warn when the left-hand side is | |
| C<undef>, but it was doing so for tied variables. This has been fixed | |
| [perl #44895]. | |
| =item * | |
| A bug fix in Perl 5.14 introduced a new bug, causing "uninitialized" | |
| warnings to report the wrong variable if the operator in question had | |
| two operands and one was C<%{...}> or C<@{...}>. This has been fixed | |
| [perl #103766]. | |
| =item * | |
| C<..> and C<...> in list context now mention the name of the variable in | |
| "uninitialized" warnings for string (as opposed to numeric) ranges. | |
| =back | |
| =head2 Weak references | |
| =over | |
| =item * | |
| Weakening the first argument to an automatically-invoked C<DESTROY> method | |
| could result in erroneous "DESTROY created new reference" errors or | |
| crashes. Now it is an error to weaken a read-only reference. | |
| =item * | |
| Weak references to lexical hashes going out of scope were not going stale | |
| (becoming undefined), but continued to point to the hash. | |
| =item * | |
| Weak references to lexical variables going out of scope are now broken | |
| before any magical methods (e.g., DESTROY on a tie object) are called. | |
| This prevents such methods from modifying the variable that will be seen | |
| the next time the scope is entered. | |
| =item * | |
| Creating a weak reference to an @ISA array or accessing the array index | |
| (C<$#ISA>) could result in confused internal bookkeeping for elements | |
| later added to the @ISA array. For instance, creating a weak | |
| reference to the element itself could push that weak reference on to @ISA; | |
| and elements added after use of C<$#ISA> would be ignored by method lookup | |
| [perl #85670]. | |
| =back | |
| =head2 Other notable fixes | |
| =over | |
| =item * | |
| C<quotemeta> now quotes consistently the same non-ASCII characters under | |
| C<use feature 'unicode_strings'>, regardless of whether the string is | |
| encoded in UTF-8 or not, hence fixing the last vestiges (we hope) of the | |
| notorious L<perlunicode/The "Unicode Bug">. [perl #77654]. | |
| Which of these code points is quoted has changed, based on Unicode's | |
| recommendations. See L<perlfunc/quotemeta> for details. | |
| =item * | |
| C<study> is now a no-op, presumably fixing all outstanding bugs related to | |
| study causing regex matches to behave incorrectly! | |
| =item * | |
| When one writes C<open foo || die>, which used to work in Perl 4, a | |
| "Precedence problem" warning is produced. This warning used erroneously to | |
| apply to fully-qualified bareword handle names not followed by C<||>. This | |
| has been corrected. | |
| =item * | |
| After package aliasing (C<*foo:: = *bar::>), C<select> with 0 or 1 argument | |
| would sometimes return a name that could not be used to refer to the | |
| filehandle, or sometimes it would return C<undef> even when a filehandle | |
| was selected. Now it returns a typeglob reference in such cases. | |
| =item * | |
| C<PerlIO::get_layers> no longer ignores some arguments that it thinks are | |
| numeric, while treating others as filehandle names. It is now consistent | |
| for flat scalars (i.e., not references). | |
| =item * | |
| Unrecognized switches on C<#!> line | |
| If a switch, such as B<-x>, that cannot occur on the C<#!> line is used | |
| there, perl dies with "Can't emulate...". | |
| It used to produce the same message for switches that perl did not | |
| recognize at all, whether on the command line or the C<#!> line. | |
| Now it produces the "Unrecognized switch" error message [perl #104288]. | |
| =item * | |
| C<system> now temporarily blocks the SIGCHLD signal handler, to prevent the | |
| signal handler from stealing the exit status [perl #105700]. | |
| =item * | |
| The %n formatting code for C<printf> and C<sprintf>, which causes the number | |
| of characters to be assigned to the next argument, now actually | |
| assigns the number of characters, instead of the number of bytes. | |
| It also works now with special lvalue functions like C<substr> and with | |
| nonexistent hash and array elements [perl #3471, #103492]. | |
| =item * | |
| Perl skips copying values returned from a subroutine, for the sake of | |
| speed, if doing so would make no observable difference. Because of faulty | |
| logic, this would happen with the | |
| result of C<delete>, C<shift> or C<splice>, even if the result was | |
| referenced elsewhere. It also did so with tied variables about to be freed | |
| [perl #91844, #95548]. | |
| =item * | |
| C<utf8::decode> now refuses to modify read-only scalars [perl #91850]. | |
| =item * | |
| Freeing $_ inside a C<grep> or C<map> block, a code block embedded in a | |
| regular expression, or an @INC filter (a subroutine returned by a | |
| subroutine in @INC) used to result in double frees or crashes | |
| [perl #91880, #92254, #92256]. | |
| =item * | |
| C<eval> returns C<undef> in scalar context or an empty list in list | |
| context when there is a run-time error. When C<eval> was passed a | |
| string in list context and a syntax error occurred, it used to return a | |
| list containing a single undefined element. Now it returns an empty | |
| list in list context for all errors [perl #80630]. | |
| =item * | |
| C<goto &func> no longer crashes, but produces an error message, when | |
| the unwinding of the current subroutine's scope fires a destructor that | |
| undefines the subroutine being "goneto" [perl #99850]. | |
| =item * | |
| Perl now holds an extra reference count on the package that code is | |
| currently compiling in. This means that the following code no longer | |
| crashes [perl #101486]: | |
| package Foo; | |
| BEGIN {*Foo:: = *Bar::} | |
| sub foo; | |
| =item * | |
| The C<x> repetition operator no longer crashes on 64-bit builds with large | |
| repeat counts [perl #94560]. | |
| =item * | |
| Calling C<require> on an implicit C<$_> when C<*CORE::GLOBAL::require> has | |
| been overridden does not segfault anymore, and C<$_> is now passed to the | |
| overriding subroutine [perl #78260]. | |
| =item * | |
| C<use> and C<require> are no longer affected by the I/O layers active in | |
| the caller's scope (enabled by L<open.pm|open>) [perl #96008]. | |
| =item * | |
| C<our $::é; $é> (which is invalid) no longer produces the "Compilation | |
| error at lib/utf8_heavy.pl..." error message, which it started emitting in | |
| 5.10.0 [perl #99984]. | |
| =item * | |
| On 64-bit systems, C<read()> now understands large string offsets beyond | |
| the 32-bit range. | |
| =item * | |
| Errors that occur when processing subroutine attributes no longer cause the | |
| subroutine's op tree to leak. | |
| =item * | |
| Passing the same constant subroutine to both C<index> and C<formline> no | |
| longer causes one or the other to fail [perl #89218]. (5.14.1) | |
| =item * | |
| List assignment to lexical variables declared with attributes in the same | |
| statement (C<my ($x,@y) : blimp = (72,94)>) stopped working in Perl 5.8.0. | |
| It has now been fixed. | |
| =item * | |
| Perl 5.10.0 introduced some faulty logic that made "U*" in the middle of | |
| a pack template equivalent to "U0" if the input string was empty. This has | |
| been fixed [perl #90160]. (5.14.2) | |
| =item * | |
| Destructors on objects were not called during global destruction on objects | |
| that were not referenced by any scalars. This could happen if an array | |
| element were blessed (e.g., C<bless \$a[0]>) or if a closure referenced a | |
| blessed variable (C<bless \my @a; sub foo { @a }>). | |
| Now there is an extra pass during global destruction to fire destructors on | |
| any objects that might be left after the usual passes that check for | |
| objects referenced by scalars [perl #36347]. | |
| =item * | |
| Fixed a case where it was possible that a freed buffer may have been read | |
| from when parsing a here document [perl #90128]. (5.14.1) | |
| =item * | |
| C<each(I<ARRAY>)> is now wrapped in C<defined(...)>, like C<each(I<HASH>)>, | |
| inside a C<while> condition [perl #90888]. | |
| =item * | |
| A problem with context propagation when a C<do> block is an argument to | |
| C<return> has been fixed. It used to cause C<undef> to be returned in | |
| certain cases of a C<return> inside an C<if> block which itself is followed by | |
| another C<return>. | |
| =item * | |
| Calling C<index> with a tainted constant no longer causes constants in | |
| subsequently compiled code to become tainted [perl #64804]. | |
| =item * | |
| Infinite loops like C<1 while 1> used to stop C<strict 'subs'> mode from | |
| working for the rest of the block. | |
| =item * | |
| For list assignments like C<($a,$b) = ($b,$a)>, Perl has to make a copy of | |
| the items on the right-hand side before assignment them to the left. For | |
| efficiency's sake, it assigns the values on the right straight to the items | |
| on the left if no one variable is mentioned on both sides, as in C<($a,$b) = | |
| ($c,$d)>. The logic for determining when it can cheat was faulty, in that | |
| C<&&> and C<||> on the right-hand side could fool it. So C<($a,$b) = | |
| $some_true_value && ($b,$a)> would end up assigning the value of C<$b> to | |
| both scalars. | |
| =item * | |
| Perl no longer tries to apply lvalue context to the string in | |
| C<("string", $variable) ||= 1> (which used to be an error). Since the | |
| left-hand side of C<||=> is evaluated in scalar context, that's a scalar | |
| comma operator, which gives all but the last item void context. There is | |
| no such thing as void lvalue context, so it was a mistake for Perl to try | |
| to force it [perl #96942]. | |
| =item * | |
| C<caller> no longer leaks memory when called from the DB package if | |
| C<@DB::args> was assigned to after the first call to C<caller>. L<Carp> | |
| was triggering this bug [perl #97010]. (5.14.2) | |
| =item * | |
| C<close> and similar filehandle functions, when called on built-in global | |
| variables (like C<$+>), used to die if the variable happened to hold the | |
| undefined value, instead of producing the usual "Use of uninitialized | |
| value" warning. | |
| =item * | |
| When autovivified file handles were introduced in Perl 5.6.0, C<readline> | |
| was inadvertently made to autovivify when called as C<readline($foo)> (but | |
| not as C<E<lt>$fooE<gt>>). It has now been fixed never to autovivify. | |
| =item * | |
| Calling an undefined anonymous subroutine (e.g., what $x holds after | |
| C<undef &{$x = sub{}}>) used to cause a "Not a CODE reference" error, which | |
| has been corrected to "Undefined subroutine called" [perl #71154]. | |
| =item * | |
| Causing C<@DB::args> to be freed between uses of C<caller> no longer | |
| results in a crash [perl #93320]. | |
| =item * | |
| C<setpgrp($foo)> used to be equivalent to C<($foo, setpgrp)>, because | |
| C<setpgrp> was ignoring its argument if there was just one. Now it is | |
| equivalent to C<setpgrp($foo,0)>. | |
| =item * | |
| C<shmread> was not setting the scalar flags correctly when reading from | |
| shared memory, causing the existing cached numeric representation in the | |
| scalar to persist [perl #98480]. | |
| =item * | |
| C<++> and C<--> now work on copies of globs, instead of dying. | |
| =item * | |
| C<splice()> doesn't warn when truncating | |
| You can now limit the size of an array using C<splice(@a,MAX_LEN)> without | |
| worrying about warnings. | |
| =item * | |
| C<< $$ >> is no longer tainted. Since this value comes directly from | |
| C<< getpid() >>, it is always safe. | |
| =item * | |
| The parser no longer leaks a filehandle if STDIN was closed before parsing | |
| started [perl #37033]. | |
| =item * | |
| C<< die; >> with a non-reference, non-string, or magical (e.g., tainted) | |
| value in $@ now properly propagates that value [perl #111654]. | |
| =back | |
| =head1 Known Problems | |
| =over 4 | |
| =item * | |
| On Solaris, we have two kinds of failure. | |
| If F<make> is Sun's F<make>, we get an error about a badly formed macro | |
| assignment in the F<Makefile>. That happens when F<./Configure> tries to | |
| make depends. F<Configure> then exits 0, but further F<make>-ing fails. | |
| If F<make> is F<gmake>, F<Configure> completes, then we get errors related | |
| to F</usr/include/stdbool.h> | |
| =item * | |
| On Win32, a number of tests hang unless STDERR is redirected. The cause of | |
| this is still under investigation. | |
| =item * | |
| When building as root with a umask that prevents files from being | |
| other-readable, F<t/op/filetest.t> will fail. This is a test bug, not a | |
| bug in perl's behavior. | |
| =item * | |
| Configuring with a recent gcc and link-time-optimization, such as | |
| C<Configure -Doptimize='-O2 -flto'> fails | |
| because the optimizer optimizes away some of Configure's tests. A | |
| workaround is to omit the C<-flto> flag when running Configure, but add | |
| it back in while actually building, something like | |
| sh Configure -Doptimize=-O2 | |
| make OPTIMIZE='-O2 -flto' | |
| =item * | |
| The following CPAN modules have test failures with perl 5.16. Patches have | |
| been submitted for all of these, so hopefully there will be new releases | |
| soon: | |
| =over | |
| =item * | |
| L<Date::Pcalc> version 6.1 | |
| =item * | |
| L<Module::CPANTS::Analyse> version 0.85 | |
| This fails due to problems in L<Module::Find> 0.10 and L<File::MMagic> | |
| 1.27. | |
| =item * | |
| L<PerlIO::Util> version 0.72 | |
| =back | |
| =back | |
| =head1 Acknowledgements | |
| Perl 5.16.0 represents approximately 12 months of development since Perl | |
| 5.14.0 and contains approximately 590,000 lines of changes across 2,500 | |
| files from 139 authors. | |
| Perl continues to flourish into its third decade thanks to a vibrant | |
| community of users and developers. The following people are known to | |
| have contributed the improvements that became Perl 5.16.0: | |
| Aaron Crane, Abhijit Menon-Sen, Abigail, Alan Haggai Alavi, Alberto | |
| Simões, Alexandr Ciornii, Andreas König, Andy Dougherty, Aristotle | |
| Pagaltzis, Bo Johansson, Bo Lindbergh, Breno G. de Oliveira, brian d | |
| foy, Brian Fraser, Brian Greenfield, Carl Hayter, Chas. Owens, | |
| Chia-liang Kao, Chip Salzenberg, Chris 'BinGOs' Williams, Christian | |
| Hansen, Christopher J. Madsen, chromatic, Claes Jacobsson, Claudio | |
| Ramirez, Craig A. Berry, Damian Conway, Daniel Kahn Gillmor, Darin | |
| McBride, Dave Rolsky, David Cantrell, David Golden, David Leadbeater, | |
| David Mitchell, Dee Newcum, Dennis Kaarsemaker, Dominic Hargreaves, | |
| Douglas Christopher Wilson, Eric Brine, Father Chrysostomos, Florian | |
| Ragwitz, Frederic Briere, George Greer, Gerard Goossen, Gisle Aas, | |
| H.Merijn Brand, Hojung Youn, Ian Goodacre, James E Keenan, Jan Dubois, | |
| Jerry D. Hedden, Jesse Luehrs, Jesse Vincent, Jilles Tjoelker, Jim | |
| Cromie, Jim Meyering, Joel Berger, Johan Vromans, Johannes Plunien, John | |
| Hawkinson, John P. Linderman, John Peacock, Joshua ben Jore, Juerd | |
| Waalboer, Karl Williamson, Karthik Rajagopalan, Keith Thompson, Kevin J. | |
| Woolley, Kevin Ryde, Laurent Dami, Leo Lapworth, Leon Brocard, Leon | |
| Timmermans, Louis Strous, Lukas Mai, Marc Green, Marcel Grünauer, Mark | |
| A. Stratman, Mark Dootson, Mark Jason Dominus, Martin Hasch, Matthew | |
| Horsfall, Max Maischein, Michael G Schwern, Michael Witten, Mike | |
| Sheldrake, Moritz Lenz, Nicholas Clark, Niko Tyni, Nuno Carvalho, Pau | |
| Amma, Paul Evans, Paul Green, Paul Johnson, Perlover, Peter John Acklam, | |
| Peter Martini, Peter Scott, Phil Monsen, Pino Toscano, Rafael | |
| Garcia-Suarez, Rainer Tammer, Reini Urban, Ricardo Signes, Robin Barker, | |
| Rodolfo Carvalho, Salvador Fandiño, Sam Kimbrel, Samuel Thibault, Shawn | |
| M Moore, Shigeya Suzuki, Shirakata Kentaro, Shlomi Fish, Sisyphus, | |
| Slaven Rezic, Spiros Denaxas, Steffen Müller, Steffen Schwigon, Stephen | |
| Bennett, Stephen Oberholtzer, Stevan Little, Steve Hay, Steve Peters, | |
| Thomas Sibley, Thorsten Glaser, Timothe Litt, Todd Rinaldo, Tom | |
| Christiansen, Tom Hukins, Tony Cook, Vadim Konovalov, Vincent Pit, | |
| Vladimir Timofeev, Walt Mankowski, Yves Orton, Zefram, Zsbán Ambrus, | |
| Ævar Arnfjörð Bjarmason. | |
| The list above is almost certainly incomplete as it is automatically | |
| generated from version control history. In particular, it does not | |
| include the names of the (very much appreciated) contributors who | |
| reported issues to the Perl bug tracker. | |
| Many of the changes included in this version originated in the CPAN | |
| modules included in Perl's core. We're grateful to the entire CPAN | |
| community for helping Perl to flourish. | |
| For a more complete list of all of Perl's historical contributors, | |
| please see the F<AUTHORS> file in the Perl source distribution. | |
| =head1 Reporting Bugs | |
| If you find what you think is a bug, you might check the articles | |
| recently posted to the comp.lang.perl.misc newsgroup and the perl | |
| bug database at L<http://rt.perl.org/perlbug/>. There may also be | |
| information at L<http://www.perl.org/>, the Perl Home Page. | |
| If you believe you have an unreported bug, please run the L<perlbug> | |
| program included with your release. Be sure to trim your bug down | |
| to a tiny but sufficient test case. Your bug report, along with the | |
| output of C<perl -V>, will be sent off to [email protected] to be | |
| analysed by the Perl porting team. | |
| If the bug you are reporting has security implications, which make it | |
| inappropriate to send to a publicly archived mailing list, then please | |
| send it to [email protected]. This points to a closed | |
| subscription unarchived mailing list, which includes all core | |
| committers, who will be able to help assess the impact of issues, figure | |
| out a resolution, and help co-ordinate the release of patches to | |
| mitigate or fix the problem across all platforms on which Perl is | |
| supported. Please use this address only for security issues in the Perl | |
| core, not for modules independently distributed on CPAN. | |
| =head1 SEE ALSO | |
| The F<Changes> file for an explanation of how to view exhaustive details | |
| on what changed. | |
| The F<INSTALL> file for how to build Perl. | |
| The F<README> file for general stuff. | |
| The F<Artistic> and F<Copying> files for copyright information. | |
| =cut | |