-
feat: add jamba 1.5 tokenizers (mini and large) (#96)
-
feat: add jamba 1.5 tokenizers (mini and large)
-
test: fix value error on conftest.py
-
refactor: rename file name
-
chore: fix path
-
fix: enter (
e052b22
)
-
chore(release): v0.11.3 [skip ci] (
5be5b59
) -
chore(deps-dev): bump authlib from 1.2.1 to 1.3.1 (#91)
Bumps authlib from 1.2.1 to 1.3.1.
updated-dependencies:
- dependency-name: authlib dependency-type: indirect ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> (fe24518
)
- chore(deps-dev): bump black from 23.3.0 to 24.3.0 (#90)
Bumps black from 23.3.0 to 24.3.0.
updated-dependencies:
- dependency-name: black dependency-type: direct:development ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> (5257f3d
)
- chore(deps-dev): bump python-semantic-release from 8.3.0 to 8.7.0 (#80)
Bumps python-semantic-release from 8.3.0 to 8.7.0.
updated-dependencies:
- dependency-name: python-semantic-release dependency-type: direct:development update-type: version-update:semver-minor ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Asaf Gardin <[email protected]>
Co-authored-by: Asaf Joseph Gardin <[email protected]> (376894f
)
- ci: allow running tests and semantic checks on external prs (#93) (
c08604d
)
- fix(deps): bump requests from 2.31.0 to 2.32.2 (#92)
Bumps requests from 2.31.0 to 2.32.2.
updated-dependencies:
- dependency-name: requests dependency-type: indirect ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Asaf Joseph Gardin <[email protected]> (3216898
)
- chore(release): v0.11.2 [skip ci] (
57e712e
)
-
fix: add missing space tokens init on jursssic (#89)
-
fix: missing space tokens init on jursssic, fix import in example
-
test: add test for space_tokens init (
4ec43a0
)
- chore(release): v0.11.1 [skip ci] (
07bf0cd
)
-
fix: Added backwards compatibility for jamba tokenizer (#88)
-
fix: Added backwards compatibility for jamba tokenizer
-
fix: Added to async
Co-authored-by: asafg <[email protected]> (9ad3475
)
- chore(release): v0.11.0 [skip ci] (
0df321c
)
-
feat: Async tokenizer (#86)
-
feat: support async, wip
-
feat: fix and add tests, examples, update readme
-
fix: poetry lock
-
fix: anyio -> aiofiles
-
fix: try 3.8
-
fix: remove 3.7 from tests
-
fix: poetry lock
-
fix: add 3.7 back
-
fix: poetry lock
-
fix: poetry.lock
-
ci: pipenv
-
fix: pipenv
-
fix: pipenv
-
fix: pyproject
-
fix: lock
-
fix: version
-
fix: Removed aiofiles
-
ci: update python version,
-
fix: switch from aiofiles to anyio, remove redundant comments
-
chore: poetry lock
-
fix: disable initializing async classes directly, cr comments
-
test: fix import
-
ci: add asyncio-mode to test workflow
-
fix: to_thread -> run_in_executor
-
ci: add asyncio
-
fix: cr comments
-
fix: cr comments
Co-authored-by: asafg <[email protected]> (3006cda
)
- chore(release): v0.10.0 [skip ci] (
1178ba7
)
- feat: remove python 3.7 support (#87) (
58482ab
)
- chore(release): v0.9.1 [skip ci] (
a260d76
)
- fix: depend on less restrictive version of tokenizers (#85) (
eab6a14
)
- chore(release): v0.9.0 [skip ci] (
974df9e
)
- feat: Jamba instruct tokenizer (#84) (
88ff9af
)
- chore(release): v0.8.2 [skip ci] (
1146741
)
-
ci: add Python 3.12 to test matrix (#82)
-
ci: add Python 3.12 to test matrix
-
chore: use sentencepiece 0.2.0 or higher
-
fix: update poetry.lock (
8084117
)
- fix: docs (#83) (
c26949a
)
- Update issue templates (
86ea6e7
)
- chore(release): v0.8.1 [skip ci] (
fcacbf8
)
- fix: re-ordered parameters in ctor to avoid a breaking change (#79) (
6c1b608
)
-
chore(release): v0.8.0 [skip ci] (
c8b54df
) -
chore(deps-dev): bump pytest from 7.2.1 to 7.4.4 (#75)
Bumps pytest from 7.2.1 to 7.4.4.
updated-dependencies:
- dependency-name: pytest dependency-type: direct:development update-type: version-update:semver-minor ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: asafgardin <[email protected]> (081dda3
)
-
feat: Add start_of_line to decode (#77)
-
feat: Add start_of_line param to decode
-
test: added unittest with start_of_line=True and False (
182a8d1
)
- chore(release): v0.7.0 [skip ci] (
26f34b2
)
-
feat: Init tokenizer from filehandle (#76)
-
feat: allow creating JurassicTokenizer from model file handle
-
fix: Add default for model_path and model_file_handle
-
feat: Add JurassicTokenizer.from_file_path classmethod
-
fix: remove model_path=None in JurassicTokenizer.from_file_handle
-
fix: rename _assert_exactly_one to _validate_init and make it not static
-
refactor: semantics
-
test: Added tests
Co-authored-by: Asaf Gardin <[email protected]> (dcb73a7
)
-
chore(release): v0.6.0 [skip ci] (
7b8348d
) -
chore: add test case for encode with is_start=False (#74)
-
chore: add test case for encode with is_start=False
-
fix: split is_start=False to a different testcase (
77c0a39
)
-
feat: Add decode with offsets (#73)
-
feat: Add decode_with_offsets() to JurassicTokenizer
-
refactor: remove kwargs from decode_with_offsets since it's not used
-
chore: Add unittest for decode and for offsets
-
fix: test only decode_with_offsets
-
fix: dummy for returned offsets in decode_with_offsets (
a5a7bb4
) -
feat: Add the is_start parameter to JurassicTokenizer.encode() (#72)
-
feat: Add the is_start parameter to JurassicTokenizer.encode()
-
refactor: take 'is_start' from kwargs (
296bda5
)
- chore(release): v0.5.0 [skip ci] (
96f384f
)
-
feat: Add more special tokens (#71)
-
fix: commitizen tag starts with "v"
-
feat: add eos_id
-
feat: Add newline_id
-
fix: typo "_newline_piece" instead of "newline_piece"
-
fix: newline_id already existed as "private". Just make it "public"
-
fix: forgot to rename everywhere (
9a9e1a8
)
- fix: commitizen tag starts with "v" (#70) (
cf495ad
)
- chore(release): v0.4.0 [skip ci] (
b761edc
)
- feat: add pad_id and bos_id to jurassic_tokenizer (#69) (
ffb2ce3
)
- chore(release): v0.3.11 [skip ci] (
5280149
)
- fix: BaseTokenizer in init (#68) (
3cc71e7
)
-
chore(release): v0.3.10 [skip ci] (
1601535
) -
chore(deps-dev): bump safety from 2.3.4 to 2.3.5 (#64)
Bumps safety from 2.3.4 to 2.3.5.
updated-dependencies:
- dependency-name: safety dependency-type: direct:development update-type: version-update:semver-patch ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> (95696bb
)
- chore(deps-dev): bump ruff from 0.0.285 to 0.1.8 (#63)
Bumps ruff from 0.0.285 to 0.1.8.
updated-dependencies:
- dependency-name: ruff dependency-type: direct:development update-type: version-update:semver-minor ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> (81123d3
)
- chore(deps-dev): bump black from 22.12.0 to 23.3.0 (#61)
Bumps black from 22.12.0 to 23.3.0.
updated-dependencies:
- dependency-name: black dependency-type: direct:development update-type: version-update:semver-major ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> (7190d28
)
- chore(deps-dev): bump safety from 2.3.4 to 2.3.5 (#60)
Bumps safety from 2.3.4 to 2.3.5.
updated-dependencies:
- dependency-name: safety dependency-type: direct:development update-type: version-update:semver-patch ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> (2fa7bef
)
- fix: JurassicTokenizer in init (#67) (
253ae07
)
-
refactor: Added all in init (#65)
-
refactor: Added all in init
-
fix: tests
-
refactor: added version to all (
c0d9286
) -
refactor: sentencepiece version to support all patch versions (#66) (
845008c
)
-
chore(release): v0.3.9 [skip ci] (
84f17da
) -
chore: add github badges (#58) (
821455c
) -
chore(deps-dev): bump urllib3 from 2.0.4 to 2.0.7 (#57)
Bumps urllib3 from 2.0.4 to 2.0.7.
updated-dependencies:
- dependency-name: urllib3 dependency-type: indirect ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> (93ef6d6
)
-
fix: Modify badges (#59)
-
docs: fixed url
-
fix: inline
-
fix: README.md (
20b7090
)
- chore(release): v0.3.8 [skip ci] (
645ff5e
)
- fix: readme example (#56) (
b713da8
)
- chore(release): v0.3.7 [skip ci] (
a181ae6
)
-
ci: workflow dispatch for release (#54) (
dbf5609
) -
ci: Automate pypi publish (#53)
-
ci: Automate pypi publish on new release
-
fix: Remove comment
-
fix: title of action
-
fix: title of action (
7c04fda
)
-
fix: Examples in readme (#55)
-
ci: workflow dispatch for release
-
docs: Updated readme with more examples
-
docs: Added docs to base class (
94f3a3c
)
- chore(release): v0.3.6 [skip ci] (
550644d
)
- docs: CODEOWNERS (#49) (
1e6513b
)
- fix: support ai21_tokenizer.version (#52) (
13944eb
)
- chore(release): v0.3.5 [skip ci] (
217b14e
)
-
fix: newline n and prettier (#47)
-
fix: newline n and prettier
-
fix: exclude
-
fix: exclude from pretty (
f5c9204
)
- chore(release): v0.3.4 [skip ci] (
be67c79
)
- fix: newline r (#46) (
bc97ae1
)
- chore(release): v0.3.3 [skip ci] (
f6db520
)
- ci: on push release (#43) (
95bfc95
)
- docs: readme (#44) (
af23ac4
)
- fix: setup (#45) (
1b8c00f
)
- chore(release): v0.3.2 [skip ci] (
f3b2e73
)
- ci: Added newline sequence (#41) (
63a1898
)
-
fix: Version path (#42)
-
fix: version
-
fix: version variables
-
fix: name (
ee4d744
)
- chore(release): v0.3.1 [skip ci] (
2eebeb8
)
- chore(release): v0.3.0 [skip ci] (
0c4ada0
)
-
docs: Release md update before publish (#36)
-
fix: Added support for both str and path
-
fix: rename package
-
fix: updated pre commits and added new one
-
docs: Updated docs
-
ci: down grade
-
docs: Added another example (
18ccbeb
) -
docs: CONTRIBUTING.md (#35)
-
docs: CONTRIBUTING.md
-
ci: end_of_line fix
-
docs: inv test (
e282440
)
- feat: Added char for testing (#37) (
40d3feb
)
- fix: string example (#38) (
833038c
)
-
chore(release): v0.2.0 [skip ci] (
8988faa
) -
chore(deps-dev): bump black from 22.12.0 to 23.3.0 (#32)
Bumps black from 22.12.0 to 23.3.0.
updated-dependencies:
- dependency-name: black dependency-type: direct:development update-type: version-update:semver-major ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: asafgardin <[email protected]> (bb4986e
)
-
feat: Tokenizer factory (#31)
-
feat: Added tokenizer abc and factory
-
fix: api to receive default and none
-
fix: example
-
fix: factory and tests
-
fix: rename base
-
fix: rename base class
-
fix: rename package
-
fix: example
-
fix: readme and tasks
-
docs: factory class
-
docs: renames
-
fix: directory hierarchy in tests
-
fix: rename package
-
chore(release): v0.1.2 [skip ci]
-
fix: rename package
-
ci: example
-
fix: assert in example
-
fix: src_path
Co-authored-by: github-actions <[email protected]> (e55cd1d
)
- fix: token name (#34) (
2b229b2
)
-
chore(release): v0.1.2 [skip ci] (
5b1dc14
) -
chore(deps-dev): bump safety from 2.3.4 to 2.3.5 (#28)
Bumps safety from 2.3.4 to 2.3.5.
updated-dependencies:
- dependency-name: safety dependency-type: direct:development update-type: version-update:semver-patch ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> (28118ac
)
- chore(deps-dev): bump pytest-mock from 3.10.0 to 3.11.1 (#24)
Bumps pytest-mock from 3.10.0 to 3.11.1.
updated-dependencies:
- dependency-name: pytest-mock dependency-type: direct:development update-type: version-update:semver-minor ...
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: asafgardin <[email protected]> (0a36f21
)
-
ci: Remove install from publish (#29)
-
ci: Removed install dependency
-
ci: changlog changes (
f8e8392
) -
ci: dependabot pr limit (#27)
-
ci: dependabot pr limit
-
ci: dependabot pr limit (
9b2c4f8
)
- fix: workflow dispatch for release action (#33) (
f81b4ab
)
- chore(release): v0.1.1 [skip ci] (
f5150f5
)
-
fix: used PAT (#26)
-
test: write to main
-
fix: token
-
test: debug
-
test: debugging tokens
-
test: uncomment
-
test: write to main
-
fix: token
-
test: debug
-
test: debugging tokens
-
test: uncomment
-
fix: Changed to main (
2ff12c9
)
- chore(release): v0.1.0 [skip ci] (
48607f9
)
-
ci: Create dependabot.yml (#19)
-
ci: Create dependabot.yml
-
fix: commit-message prefix
-
fix: Added more config to dependabot action (
23faaa8
)
-
feat: Pypi publish (#18)
-
feast: Added setup.py
-
feast: Added publish.yaml (
77ee751
) -
feat: test PAT (#9) (
a10b6a4
) -
feat: Added semantic prs actions (#8) (
afab5ff
)
-
fix: Added permissions (#25)
-
fix: Added permissions
-
fix: permissions location
-
fix: verbose
-
fix: Removed bad input "root_options" (
d684575
) -
fix: Change token (#17)
-
fix: token key
-
fix: token github
-
fix: token github cls
-
fix: token github cls (
ec4f35b
) -
fix: Change token (#16)
-
fix: token key
-
fix: token github (
d876a43
) -
fix: token key (#15) (
0b76344
) -
fix: keys (#14) (
b064ea6
) -
fix: Test token (#11)
-
feat: test PAT
-
feat: test github token
-
fix: PAT (
94f64b6
) -
fix: Test token (#10)
-
feat: test PAT
-
feat: test github token (
52484fe
) -
fix: root_options verbose (#6) (
220ba5b
) -
fix: Release action test (#5)
-
fix: Added release step
-
fix: branch name for testing
-
fix: Removed comment
-
chore(release): v0.0.1 [skip ci]
-
fix: branch rename
-
fix: Removed CHANGELOG.md
Co-authored-by: github-actions <[email protected]> (fae5423
)
-
test: Test token (#13)
-
feat: test PAT
-
feat: test github token
-
fix: PAT
-
test: Added test step
-
test: Added test step
-
test: Added token to use (
d445e36
) -
test: Test token (#12)
-
feat: test PAT
-
feat: test github token
-
fix: PAT
-
test: Added test step
-
test: Added test step (
9619c3e
)
-
Add kwargs to functions (#7)
-
feat: added kwargs
-
test: Added tests (
efecff9
) -
Release action (#4)
-
feat: Added release action
-
fix: Removed unnecessary code
-
fix: testing on branch
-
fix: removed node install
-
fix: Removed unnecessary step
-
fix: base-branch
-
fix: removed code
-
0.0.2
-
feat: python-semantic-release test
-
fix: branch name
-
fix: branch name in .toml
-
fix: change from branch to match
-
fix: Added release_action part
-
chore(release): v0.1.0 [skip ci]
-
refactor: removed CHANGELOG.md
-
fix: branch to main
-
feat: Added version.py to version_variable
-
feat: Upgraded python-semantic-release
-
feat: Added python-semantic-release
-
fix: Removed unnecessary file
-
fix: Changed version
Co-authored-by: github-action <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions <[email protected]> (694683a
)
-
Add code (#2)
-
feat: Jurassic tokenizer
-
fix: remove is_start
-
fix: add types
-
fix: add types
-
chore: extracted utils
-
fix: simplified tokenizer even more
-
fix: simplified tokenizer even more
-
feat: Added tests
-
feat: exposed prop
Co-authored-by: Asaf Gardin <[email protected]> (6b80a05
)