CHANGES IN sentencepiece
VERSION 0.2.3
- fix R CMD check warning due to change in version 0.2.2.
- in third_party/protobuf-lite/strutil.cc:506:33: warning: argument to
‘sizeof’ in ‘int snprintf(char, size_t, const char, …)’ call is
the same expression as the destination; did you mean to provide an
explicit length? [-Wsizeof-pointer-memaccess]
 
- this part of third_party/protobuf-lite/strutil.cc was not used in
sentencepiece
 
 
CHANGES IN sentencepiece
VERSION 0.2.2
- use snprintf instead of sprintf to handle the R CMD check
deprecating note on M1mac
 
CHANGES IN sentencepiece
VERSION 0.2.1
- Fix for clang-UBSAN error
 
CHANGES IN sentencepiece
VERSION 0.2
- Fix wordpiece bug for 1-character words. (@jonthegeek, #4)
 
- Upgraded to sentencepiece release v0.1.96
 
CHANGES IN sentencepiece
VERSION 0.1.3
- Fix wordpiece bug for 1-character words. (@jonthegeek, #4)
 
- Fix Solaris installation issue related to incorrect usage of pointer
as a function
 
- Also download the binary model in sentencepiece_download_model as it
can be loaded with word2vec::read.wordvectors
 
- read_word2vec now uses word2vec::read.wordvectors from word2vec
>= 0.2.0
 
- added BPEembed and predict.BPEembed
 
- allow subword regularisation by adding nbest and alpha option in
sentencepiece_encode and changed sentencepiece_decode accordingly
 
- Added txt_remove_
 
- Upgrade sentencepiece to release v0.1.91 commit
a32d7dc6ce6f383a65ad6e1cbe1983f94ab11932 which has subword
regularisation for BPE
 
CHANGES IN sentencepiece
VERSION 0.1.2
- Fix Solaris installation issue which used log of uint64 which is not
defined on Solaris
 
CHANGES IN sentencepiece
VERSION 0.1.1
- Added verbose argument in sentencepiece
 
CHANGES IN sentencepiece
VERSION 0.1.0
- Initial package based on https://github.com/google/sentencepiece
release v0.1.84 commit 2424d82d396b43b2556203c592e48a621ef10f3c
 
- Third-party code from
https://github.com/google/sentencepiece/tree/master/third_party is put
in src/absl, src/esaxx, src/darts_clone, src/protobuf-lite