結(jié)巴分詞會將整段數(shù)字分成一個詞匯
代碼示例如下
import sys
import jieba
#jieba.load_userdict('userdict.txt')
test_sent = '驗證碼123678'
tokenized = jieba.tokenize(test_sent)
print(type(tokenized))
print(f"{[t for t in tokenized]}")
print(type(tokenized))
輸出示例如下