Commit 445251a
ARROW-9991: [C++] Split kernels for strings/binary
Contains:
* `split_pattern` kernel with max_split and reverse option
* `ascii_split_whitespace` similar to Python's `bytes.split`
* `utf8_split_whitespace` similar to Python's `str.split`
It should be easy to add new split methods, e.g. a regex one in the future.
Closes #8271 from maartenbreddels/ARROW-9991
Authored-by: Maarten A. Breddels <maartenbreddels@gmail.com>
Signed-off-by: Antoine Pitrou <antoine@python.org>1 parent 0e13e28 commit 445251a
11 files changed
Lines changed: 714 additions & 0 deletions
File tree
- cpp/src/arrow
- compute
- kernels
- util
- docs/source/cpp
- python/pyarrow
- includes
- tests
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
49 | 49 | | |
50 | 50 | | |
51 | 51 | | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
52 | 71 | | |
53 | 72 | | |
54 | 73 | | |
| |||
0 commit comments