Skip to content

Commit

Permalink
Prep 3.3.0 (#69)
Browse files Browse the repository at this point in the history
Update version. Update release date.  Update documentation to talk a little about the differences in Backrefs' Unicode properties and Regex's Unicode properties.
  • Loading branch information
facelessuser authored Feb 27, 2018
1 parent 25f9c05 commit f6eea85
Show file tree
Hide file tree
Showing 3 changed files with 9 additions and 3 deletions.
2 changes: 1 addition & 1 deletion backrefs/__init__.py
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
"""Backrefs package."""

# (major, minor, micro, release type, pre-release build, post-release build)
version_info = (3, 2, 1, 'final', 0, 0)
version_info = (3, 3, 0, 'final', 0, 0)


def _version():
Expand Down
2 changes: 2 additions & 0 deletions docs/src/markdown/changelog.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,8 @@

## 3.3.0

Feb 27, 2018

- **NEW**: Add support for `Indic_Positional_Category`\\`Indic_Matra_Category` and `Indic_Syllabic_Category` properties.

## 3.2.1
Expand Down
8 changes: 6 additions & 2 deletions docs/src/markdown/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -276,7 +276,11 @@ Back\ References | Description

## Unicode Properties

A number of various Unicode properties are supported in Backrefs, but only for Re as Regex already has its own implementation of Unicode properties. Some properties may not be available on certain Python versions due the Unicode build.
A number of various Unicode properties are supported in Backrefs, but only for Re as Regex already has its own implementation of Unicode properties. Some properties may not be available on certain Python versions due the included Unicode build.

It is important to note that Backrefs handles Unicode properties by transforming them to character classes with all the associated characters: `\p{Cs}` --> `[\ud800\udb7f-\udb80\udbff-\udc00\udfff]`. Because of this, Backrefs can create really large regular expressions that the underlying engine must walk through. In short, Re with Backrefs will never be as efficient or fast as using Regex's Unicode properties, but it is very useful when you need or want to use Re.

Also, keep in mind that there are most likely some differences between Regex's Unicode Properties and Backrefs' Unicode properties. One notable difference is Regex does not currently implement `script_extensions` while Backrefs' does and uses them as the default when specifying them in the form `\p{IsScriptValue}` or `\p{ScriptValue}` just like Perl does. See [Special Syntax Exceptions](#special-syntax-exceptions) for more info.

Supported\ Properties | Aliases
------------------------------------------- | -------
Expand Down Expand Up @@ -332,7 +336,7 @@ There are a number of binary properties. In general, binary properties are speci
General Category, Script, Blocks, and Binary all can be specified by their value alone: `\p{value}`, but they will be evaluated in the following order to resolve name conflicts as some the same value that is used in Script may be used in Blocks etc.

1. General Category
2. Script
2. Script (with Script Extensions on Python 3+)
3. Blocks
4. Binary

Expand Down

0 comments on commit f6eea85

Please sign in to comment.