[3.14] gh-135676: Simplify docs on lexing names (GH-140464) by StanFromIreland · Pull Request #142015 · python/cpython

StanFromIreland · Nov 27, 2025

This simplifies the Lexical Analysis section on Names (but keeps it technically correct) by putting all the info about non-ASCII characters in a separate (and very technical) section.

It uses a mental model where the parser doesn't handle Unicode complexity “immediately”, but:

parses any non-ASCII character (outside strings/comments) as part of a name, since these can't (yet) be e.g. operators
normalizes the name
validates the name, using the xid_start/xid_continue sets

(cherry picked from commit 2ff8608)

Issue: Reword the Lexical Analysis chapter of the docs #135676

📚 Documentation preview 📚: https://cpython-previews--142015.org.readthedocs.build/

This simplifies the Lexical Analysis section on Names (but keeps it technically correct) by putting all the info about non-ASCII characters in a separate (and very technical) section. It uses a mental model where the parser doesn't handle Unicode complexity “immediately”, but: - parses any non-ASCII character (outside strings/comments) as part of a name, since these can't (yet) be e.g. operators - normalizes the name - validates the name, using the xid_start/xid_continue sets (cherry picked from commit 2ff8608) Co-authored-by: Petr Viktorin <encukou@gmail.com> Co-authored-by: Stan Ulbrych <89152624+StanFromIreland@users.noreply.github.com> Co-authored-by: Blaise Pabon <blaise@gmail.com> Co-authored-by: Micha Albert <info@micha.zone> Co-authored-by: KeithTheEE <kmurrayis@gmail.com>

StanFromIreland · Nov 27, 2025

Doc/reference/lexical_analysis.rst

+.. _PropList.txt: https://www.unicode.org/Public/16.0.0/ucd/PropList.txt
+.. _DerivedCoreProperties.txt: https://www.unicode.org/Public/16.0.0/ucd/DerivedCoreProperties.txt


(un-)updated these to UCD 16.0.0.

encukou · Dec 3, 2025

Thank you!

StanFromIreland requested review from AA-Turner and willingc as code owners November 27, 2025 12:31

bedevere-app bot mentioned this pull request Nov 27, 2025

gh-135676: Simplify docs on lexing names #140464

Merged

bedevere-app bot added skip news awaiting review labels Nov 27, 2025

StanFromIreland assigned encukou Nov 27, 2025

bedevere-app bot added the docs Documentation in the Doc dir label Nov 27, 2025

github-project-automation bot added this to Docs PRs Nov 27, 2025

github-project-automation bot moved this to Todo in Docs PRs Nov 27, 2025

bedevere-app bot mentioned this pull request Nov 27, 2025

Reword the Lexical Analysis chapter of the docs #135676

Open

StanFromIreland commented Nov 27, 2025

View reviewed changes

StanFromIreland requested a review from encukou December 1, 2025 18:52

encukou merged commit 79245a4 into python:3.14 Dec 3, 2025
36 checks passed

bedevere-app bot removed the awaiting review label Dec 3, 2025

github-project-automation bot moved this from Todo to Done in Docs PRs Dec 3, 2025

StanFromIreland deleted the backport-2ff8608-3.14 branch December 3, 2025 13:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[3.14] gh-135676: Simplify docs on lexing names (GH-140464)#142015

[3.14] gh-135676: Simplify docs on lexing names (GH-140464)#142015
encukou merged 1 commit intopython:3.14python/cpython:3.14from
StanFromIreland:backport-2ff8608-3.14StanFromIreland/cpython:backport-2ff8608-3.14Copy head branch name to clipboard

StanFromIreland commented Nov 27, 2025 •

edited by github-actions bot

Loading

Uh oh!

StanFromIreland Nov 27, 2025

Uh oh!

encukou commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		.. _PropList.txt: https://www.unicode.org/Public/16.0.0/ucd/PropList.txt
		.. _DerivedCoreProperties.txt: https://www.unicode.org/Public/16.0.0/ucd/DerivedCoreProperties.txt

Search code, repositories, users, issues, pull requests...

Uh oh!

Conversation

StanFromIreland commented Nov 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

StanFromIreland Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

encukou commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

StanFromIreland commented Nov 27, 2025 •

edited by github-actions bot

Loading