Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

오타 교정 사용시 사용자 사전에 추가한 복합 명사가 제대로 인식되지 않는 버그 #205

Closed
bab2min opened this issue Dec 16, 2024 · 0 comments

Comments

@bab2min
Copy link
Owner

bab2min commented Dec 16, 2024

재현가능한 kiwipiepy 기반 코드(kiwipiepy==0.20.2)

>>> kiwi = Kiwi()
>>> kiwi.add_user_word('함박 스테이크')
>>> kiwi.tokenize('함박 스테이크를 먹었습니다.')
[Token(form='함박 스테이크', tag='NNP', start=0, len=7),
 Token(form='를', tag='JKO', start=7, len=1),
 Token(form='먹', tag='VV', start=9, len=1),
 Token(form='었', tag='EP', start=10, len=1),
 Token(form='습니다', tag='EF', start=11, len=3),
 Token(form='.', tag='SF', start=14, len=1)]

>>> kiwi = Kiwi(typos='basic_with_continual')
>>> kiwi.add_user_word('함박 스테이크')
>>> kiwi.tokenize('함박 스테이크를 먹었습니다.')
[Token(form='함박', tag='NNG', start=0, len=2), ###
 Token(form='스테이크', tag='NNG', start=3, len=4), ###
 Token(form='를', tag='JKO', start=7, len=1),
 Token(form='먹', tag='VV', start=9, len=1),
 Token(form='었', tag='EP', start=10, len=1),
 Token(form='습니다', tag='EF', start=11, len=3),
 Token(form='.', tag='SF', start=14, len=1)]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant