package wseg

  1. Overview
  2. Docs
A word identification system

Install

Dune Dependency

Authors

Maintainers

Sources

0.3.0.tar.gz
sha256=5b80072200df6f6a502ede38198415b1595ae3e117f52aa1e307437705d1b143
md5=e56244d34e92bda9c7a9fdea99734748

Description

The full technology page of the MMSEG system is available on http://technology.chtsai.org/wseg/

Published: 29 Dec 2019

README

wseg

A word identification system

Usage

In the test directory, there are two plain text files that serve as dictionary files, one for characters and one for words. And there is a test.ml program that illustrates how to build dictionary and index with wseg and how to apply several rules to identify words from a sentence. Just invoke make runtest to play with the demo.

char.dic contains 12640 Chinese characters and word.dic contains 157202 words. So you can expand the dict or demo for common usage.

Dependencies (6)

  1. camomile >= "1.0" & < "2.0.0"
  2. trie >= "1.0"
  3. stdio >= "v0.9"
  4. base >= "v0.9"
  5. dune >= "1.4"
  6. ocaml >= "4.03.0"

Dev Dependencies

None

Used by (1)

  1. mmseg >= "transition"

Conflicts

None

OCaml

Innovation. Community. Security.