aboutsummaryrefslogtreecommitdiff
path: root/v2/assets
diff options
context:
space:
mode:
authorGoogle Open Source <noreply+opensource@google.com>2022-02-16 16:48:21 -0800
committerBill Neubauer <wcn@google.com>2022-03-16 15:39:24 -0700
commit6086154fe860b90feacb4e06e4e6634a1c5d0a59 (patch)
tree9a82090d85432246d97bc4b98aa01ffc5322c674 /v2/assets
parentbbbfc18f4ca2a4ae16ea235940111e402761120f (diff)
downloadlicenseclassifier-6086154fe860b90feacb4e06e4e6634a1c5d0a59.tar.gz
Add the cjdict epilogue
PiperOrigin-RevId: 429172738
Diffstat (limited to 'v2/assets')
-rw-r--r--v2/assets/Supplement/BSD-3-Clause/cjdict.txt12
1 files changed, 12 insertions, 0 deletions
diff --git a/v2/assets/Supplement/BSD-3-Clause/cjdict.txt b/v2/assets/Supplement/BSD-3-Clause/cjdict.txt
new file mode 100644
index 0000000..d76d94b
--- /dev/null
+++ b/v2/assets/Supplement/BSD-3-Clause/cjdict.txt
@@ -0,0 +1,12 @@
+ # The word list in cjdict.txt are generated by combining three word lists
+ # listed below with further processing for compound word breaking. The
+ # frequency is generated with an iterative training against Google web
+ # corpora.
+ #
+ # * Libtabe (Chinese)
+ # - https://sourceforge.net/project/?group_id=1519
+ # - Its license terms and conditions are shown below.
+ #
+ # * IPADIC (Japanese)
+ # - http://chasen.aist-nara.ac.jp/chasen/distribution.html
+ # - Its license terms and conditions are shown below.