Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switch internal representation to UTF8 #365

Merged
merged 38 commits into from
Sep 8, 2021
Merged
Show file tree
Hide file tree
Changes from 37 commits
Commits
Show all changes
38 commits
Select commit Hold shift + click to select a range
fd65ba2
Enable asserts in ord{2,3,4} and inline
Bodigrim Jul 14, 2021
0df0173
Use a primitive to resize MutableByteArray
Bodigrim May 22, 2021
c10dd8f
Implement utf8LengthByLeader via bit magic
Bodigrim May 22, 2021
250babf
Make copyI/copyM/new branchless
Bodigrim May 23, 2021
8476760
Change semantics of copyI: pass length, not end offset
Bodigrim May 23, 2021
22ce9c7
Shrink mutable arrays whenever possible
Bodigrim May 23, 2021
c99cc6b
Speed up words
Bodigrim Jun 22, 2021
de17480
Use AVX/SSE instructions for length/take/drop
Bodigrim May 22, 2021
b41d141
More rewrite rules
Bodigrim Jun 22, 2021
b160244
Speed up encodeUtf8 for strict and lazy Text
Bodigrim Jun 25, 2021
8b5bc09
Speed up reverse
Bodigrim Jun 25, 2021
b78ece1
Improve replicateChar
Bodigrim Jun 25, 2021
4970b72
Define iterArray
Bodigrim Jul 14, 2021
172a2ae
Implement mapAccum{L,R}
Bodigrim Jul 14, 2021
dac5ab7
Implement filter
Bodigrim Jul 17, 2021
d3e772a
Experiment with case conversions
Bodigrim Jul 18, 2021
ddf3455
Speed up instance Eq for lazy Text
Bodigrim Aug 2, 2021
81b1b50
Move equal into Data.Text.Internal.Lazy
Bodigrim Aug 2, 2021
1ab411b
Speed up replicate for lazy Text
Bodigrim Aug 2, 2021
c68f838
Add memchr to utils.c
Bodigrim Aug 30, 2021
92efd57
Speed up strict and lazy search
Bodigrim Aug 2, 2021
67e8943
Employ lexicographical comparison for compare
Bodigrim Aug 2, 2021
6a4e792
Speed up lines and unlines
Bodigrim Aug 30, 2021
a5b2dee
Speed up strict and lazy reading of numbers
Bodigrim Aug 7, 2021
79d9da0
Implement Data.Text.map explicitly
Bodigrim Aug 24, 2021
1a70117
Speed up Data.Text.intersperse
Bodigrim Aug 25, 2021
adbfc2d
Implement copy from/to pointer via primops
Bodigrim Sep 6, 2021
fc95ea4
Reimplement decodeASCII and decodeLatin1 to share C code
Bodigrim Sep 6, 2021
c0fd443
Use native (and naive) UTF8 decoder
Bodigrim Aug 21, 2021
32c76d1
Redesign concat
Bodigrim Aug 26, 2021
012612a
Use simdutf for UTF8 validation
Bodigrim Aug 1, 2021
bcc4dc6
Use GHC 8.10.5 for Windows build, because of issues with TH and simdutf
Bodigrim Jun 18, 2021
20b901d
Avoid reconstructing chars in commonPrefixes
Bodigrim Sep 4, 2021
87755a0
Switch internal representation to UTF-8
Bodigrim May 21, 2021
d3beb94
Rename constructors in Data.Array to highlight compatibility issues i…
Bodigrim Aug 24, 2021
fd8cf06
Make utf8Length branchless
Bodigrim May 22, 2021
fd49707
Bump version and update changelog
Bodigrim Sep 6, 2021
4e066ac
Tweak documentation
Bodigrim Sep 7, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .github/workflows/windows_and_macOS.yml
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ jobs:
strategy:
matrix:
os: ['windows-latest', 'macOS-latest']
ghc: ['9.0']
ghc: ['8.10.5']
fail-fast: false
steps:
- uses: actions/checkout@v2
Expand Down
2 changes: 1 addition & 1 deletion README.markdown
Original file line number Diff line number Diff line change
Expand Up @@ -29,4 +29,4 @@ based on the stream fusion framework developed by Roman Leshchinskiy,
Duncan Coutts, and Don Stewart.

The core library was fleshed out, debugged, and tested by Bryan
O'Sullivan <[email protected]>, and he is the current maintainer.
O'Sullivan. Transition from UTF-16 to UTF-8 is by Andrew Lelechenko.
346 changes: 0 additions & 346 deletions cbits/cbits.c

This file was deleted.

Loading